Fluorescence In Situ Sequencing (FISSEQ)

14 October 2015 - videos (review7 on vimeo, lesson8 on vimeo) - Evan Daugharthy and George Church (Harvard/MIT)

class slides here (dropbox link)

Introduction

Why do we need analytic tools for synthetic projects? The tools for synthetic biology have grown incredibly powerful: DNA synthesis, genome engineering, synthetic cells, directed evolution, cell-free systems, metabolic engineering, and nanomaterial science. However, these tools only cover the second half of the “read/write” cycle. In this class, we will discuss the rationale for developing measurement technologies (“read”) to complement these engineering tools (“write”), so that we can understand the effects of our bioengineering efforts and make new products that resemble real biological systems.

We will review various approaches to molecular measurements, including DNA and RNA sequencing, proteomics, and 3D structural morphometry. We will focus predominantly on in situ detection of single molecules (in situ is latin for “in place,” referring to detection of molecules inside cells). Finally, we will discuss applications of these technologies to fibroblast wound healing, understanding how the brain works, and to developing new organoids to further our understanding of biological development and create new biomedical interventions to advance human health.

Readings

Background Reading:

The FISSEQ Method: Lee J, Daugharthy E, Scheiman J, Kalhor R, Yang JL, Ferrente TC, Terry R, Jeanty SSF, Li C,Amamoto R, Peters DT, Turczyk BM, Marblestone A, Inverso S, Bernard A, Mali P, Rios X, Aach J, Church GM (2014) Highly multiplexed three-dimensional subcellular transcriptome sequencing in situ. Science 343(6177):1360-3.

Theory of RNA and Cellular Molecular State: Kim, Junhyong, and James Eberwine (2010) RNA: state memory and mediator of cellular phenotype. Trends in cell biology 20.6:311-318.

Additional Reviews of FISSEQ and Single-Cell Sequencing

Ginart, Paul, and Arjun Raj (2014) RNA sequencing in situ. Nature biotechnology 32.6:543-544.

Mignardi, Marco, and Mats Nilsson (2014) Fourth-generation sequencing in the cell and the clinic. Genome medicine 6.4:31.

Avital, Gal, Tamar Hashimshony, and Itai Yanai (2014) Seeing is believing: new methods for in situ single-cell transcriptomics. Genome biology 15.3:110.

Additional Theory

Eberwine, James, and Junhyong Kim (2015) Cellular Deconstruction: Finding Meaning in Individual Cell Variation. Trends in cell biology 25.10:569-578.

Trapnell, Cole (2015) Defining cell types and states with single-cell genomics. Genome research 25.10:1491-1498.

Homework

Lab Homework Assignment: Create an in situ sequencing library inside a polyacrylamide hydrogel, and detect the sequencing amplicons using fluorescent sequencing by hybridization.

Materials:

3×1“, 1 mm Thick Gold Seal Microscope Slides (EMS 63710-05, 1 gross)
PTFE Printed Slides 1 Oval 24.4×16.7mm (EMS 63416-32, 72)
CultureWell Chambered Coverglass Inserts with 6 mm diameter (EMS 70461-2R2)
200 Proof Ethanol
Nuclease-free/Ultrapure H2O (e.g. Millipore)
1X PBS pH ~7.4 (ideally DNase-free)
1M Tris-HCl Buffer stock pH ~7.4
2X SSC Buffer stock pH ~7.4
Glacial acetic acid
GE Healthcare Life Sciences PlusOne Bind-Silane (17-1330-01)
GE Healthcare Life Sciences PlusOne Repel-Silane ES (17-1332-01)
40% Acrylamide/Bis Solution 19:1 (Bio-Rad 1610144)
TEMED (Bio-Rad 1610800)
Ammonium Persulfate (APS) (Bio-Rad 1610700, 10g bottle)
T4 DNA Ligase (NEB M0202S)
25 mM dNTP Solution Mix (Enzymatics N2050F)
Phi29 DNA Polymerase (Enzymatics P7020-LC-L)
DNA Oligonucleotides (IDT):
Template Species #1: /5phos/TCACGGACCTGCGCGACACATTCAACCCAACACTCCTCCAACCACCGCGCAGGTCCGTGATCTCGAGTGACCACGCGTGGTCACTCGAGA
Splint Ligation/RCA Primer #1: /5Acryd/CGCGCAGGTCCGTGATCTCGAGTGACCAC*G
Sequencing Primer #1: /5dye/CGTGGTCACTCGAGATCACGGACCTGCGCG (choose dyes based on microscope configuration)
Template Species #2: /5phos/TAGACTGGGCATCTCACACATTCAACCCAACACTCCTCCAACCACCGTCGAATGAAGCAG
Splint Ligation/RCA Primer #2: /5Acryd/GAGATGCCCAGTCTACTGCTTCATTCGAC*G
Sequencing Primer #1: /5dye/CGTCGAATGAAGCAGTAGACTGGGCATCTC (chose dyes based on microscope configuration)

Equipment:

Pipettes & Tips (20, 200, 1000 uL)
Binder clips (any size)
Razor blade
Clean glass or plastic beakers (large enough to submerge slides)
PCR machine or heat block
Fluorescence microscope, ideally with >=40X objective and at least 2 channels of fluorescence detection (e.g. Cy3 and Cy5)
Chemical hood
Optional: 30 deg C incubator
Optional: Vacuum line for aspiration

Instructions:

Note: “Washing” the sample just involves aspirating any fluid currently on the sample (gently using a pipette, or using a vacuum line), then gently pipetting the wash buffer or next reagent onto the sample.

Prepare the glass surfaces:

The BindSilane treatment covalently attaches groups to the glass that will cross-link with the polyacrylamide matrix, so that the gel will be well attached to the glass surface. The RepelSilane treatment forms a water repellant film on the other glass surface so that the gel does not stick. Creating a sandwich with a gel between these two surfaces allows you to pour very thin polyacrylamide gels (~50 um thickness).

Wash the glass slides and PTFE printed slides thoroughly with Ultrapure H2O and then ethanol until slides are completely clean with no visible residue.
Mix BindSilane Reagent (scale recipe to fill beaker enough to cover a glass slide completely):
- 8 mL Ethanol
- 200 uL glacial acetic acid
- 1.8 mL Ultrapure H2O
- 5 uL BindSilane
Note: Do BindSilane treatment inside a chemical hood!
Dip glass slides into the BindSilane Reagent and incubate for 10 seconds, then remove the slide and let air dry.
Wash glass slides thoroughly with ethanol and let air dry.
Pipet RepelSilane over the exposed oval glass surface of the PTFE printed slide and incubate for 10 seconds, then let air dry.
Note: Do RepelSilane treatment inside a chemical hood!
Wash PTFE slides thoroughly with ethanol and let air dry.

Pour the sequencing gel:

Here we will pour a thin polyacrylamide gel, embedding the DNA sequencing templates inside the gel.

Mix 1 mL of gel and bring to room temperature
- 850 uL Ultrapure H2O
- 100 uL 40% Acrylamide/Bis Solution 19:1 (final 4%)
- 40 uL 1 M Tris-HCl pH
Prepare 10% APS solution and 10% TEMED solution on ice
Hybridize the sequencing templates with the splint ligation probes
- Add equal moles of each sequencing template and splint ligation probes in 2X SSC (see the next step for molar concentration)
- Heat to 90 deg C for 30 seconds, then gradually cool to room temperature at 0.1 deg C/sec in a PCR machine or by moving the tube from the heat block to room temperature and incubating for 30 minutes
Add 1 uL of pre-hybridized sequencing template mix to the gel mix such that the final concentration of all DNA complexes (sets of sequencing template + splint ligation probe) are at 0.16 nM, which gives on average 1 sequencing amplicon in each 10 um^3 volume of the gel (e.g. 1 mL / 10 um^3 molecules in 1 mL)
Note: If the density is too high or too low, try again adding a different amount of DNA complexes.
Mix thoroughly by pipetting, being careful not to introduce bubbles (oxygen inhibits polymerization).
Add 5 uL 10% TEMED and mix thoroughly by pipetting, being careful not to introduce bubbles (oxygen inhibits polymerization).
Add 5 uL 10% APS and mix thoroughly by pipetting, being careful not to introduce bubbles (oxygen inhibits polymerization).
Quickly pipet 10 uL of gel mixture onto the BindSilane glass slide.
Carefully place a RepelSilane-treated PTFE slide on top of the gel droplet, sandwiching the gel between the two surfaces with the PTFE layer acting as a thin spacer.
Use 1+ binder clips to secure the glass slides together
Incubate for >20 minutes at room temperature, or until the gel has polymerized fully.
Note: It is helpful to pour a bunch of these so you can test whether the gel has set up on an extra slide.
Carefully remove the PTFE slide without stretching or deforming the gel too much. The gel should remain attached to the BindSilane-treated glass slide.
Take a CultureWell Chambered Coverglass insert, and using a razor blade cut across the insert to connect the two wells into a single, large oval area, which should be larger than the oval of the PTFE slide. Position the CultureWell over the gel and press down to create a tight seal with the glass slide surrounding the gel. If the gel extends into the area covered by the CultureWell Insert, use a razor blade to scrape away excess gel, leaving only an area of gel that can fit completely within the CultureWell Insert. Be sure to keep both the glass slide and CultureWell Insert very clean (e.g. no fingerprints, minimal handling) to create a leak-free seal.
Wash the gel twice for 1 minute each with 1X PBS, making sure the CultureWell Insert is not leaking.

Use DNA Ligase to circularize the two DNA species and use rolling circle amplification to generate an in situ sequencing amplicon:

The DNA splint is modified on the 5' end with an Acrydite, which covalently tethers it into the gel, while the 3' end is modified with phosphorothioate bonds to prevent Phi29 from digesting it using Phi29's 3'→5' exonuclease activity. The linear DNA template is complementary to the DNA splint on both ends, so that it looks like a circle. T4 DNA Ligase will seal the nick in the DNA template, and then the DNA splint acts as a primer for phi29 amplification.

Prepare Splint Ligation Mix by adding in order on ice (Recipe for 200 uL reaction volume, scale as necessary):
- 20 uL 10X Ligase Buffer
- 175 uL Ultrapure H2O
- 5 uL T4 DNA Ligase
Wash the sample once for 2 minutes with 1X T4 DNA Ligase Buffer in water.
Add Splint Ligation Mix and incubate at room temperature for 1 hour.
Wash the sample twice for 2 minutes each with 1X PBS.
Prepare the Rolling Circle Amplification Mix (RCA, Recipe for 200 uL reaction volume, scale as necessary):
- 20 uL 10X Phi29 Buffer
- 177 uL Ultrapure H2O
- 2 uL 25 mM dNTP Mix
- 1 uL Phi29 enzyme
Wash the sample once for 2 minutes with 1X Phi29 Polymerase Buffer in water.
Add RCA Mix and incubate at 30 deg C (room temperature is OK) 4 hours to overnight (put the slide inside a plastic bag or pipette tip box and seal with plastic wrap to minimize evaporation).
Wash the sample twice for 2 minutes each with 1X PBS.

Use sequencing by hybridization to determine the identity of each amplicon inside the gel:

Each RCA amplicon inside the gel is one of two “species.” Hybridize the fluorescent probes to the sample, which will light up each amplicon species in one of the two colors for microscopy. Note, we call this “sequencing by hybridization” because the process of DNA hybridization reveals the sequence of the template! In other words, at room temperature the fluorescent probe will only hybridize to other DNA sequences that are very similar, so if the fluorescent probe binds to the RCA amplicon, that tells us a lot about the sequence of the amplicon.

Prepare each fluorescent probe at 1 mM concentration in 2X SSC.
Add to sample and incubate at room temperature for 20 minutes.
Wash the sample five times for 4 minutes each with 1X PBS.
Image on a fluorescence microscope. Each RCA amplicon appears as a fluorescent dot between 100-1000 nm in diameter in one of the two colors!

Congratulations: You have identified single molecules in situ, inside a hydrogel!

Computational Homework Assignment: Analyze a FISSEQ dataset and find some in situ sequences.

Requirements:

Computer (best with 16 GB RAM)
MATLAB
| Python 2.7
| Fiji ImageJ Distribution
| Bio-Formats plug-ins for Fiji/ImageJ
| Bowtie 1.0
R and RStudio ()

Instructions:

These instructions are adapted from Lee, Je Hyuk, et al. (2015) “Fluorescent in situ sequencing (FISSEQ) of RNA for gene expression profiling in intact cells and tissues.” Nature protocols 10.3: 442-458.

Download a free academic version of Canopy Python 2.7 and follow the | installation instructions.
Download and unzip the files from | 2014 FISSEQ Nature Protocols
Download and unzip the RefSeq-to-Gene ID Conversion Table] - Download and unzip the [[ftp://ftp.ncbi.nlm.nih.gov/refseq/H_sapiens/mRNA_Prot/human.rna.fna.gz | human RefSeq RNA FASTA file
Build the reference index in Bowtie (see the Bowtie instruction manual):
$ bowtie-build -C -f human.rna.fna refseq_human
Start MATLAB and add the downloaded folders to the search path:
» addpath('∼/fisseq', '∼/fisseq/bfmatlab')
Define the input and output directories for Image Registration, then run the Image Registration script. Set the mumber of blocks per axis for local registration (default = 10); set the fraction overlap between neighboring blocks (default = 0.1); and adjust the alignment precision, where 10 will register images to 1/10 of a pixel (default = 1).
» input_dir='decon_images/' » output_dir='registered_images/' » register_FISSEQ_images(input_dir,output_dir,10,0.1,1)
Question: What happens when you use different values for the parameters? How does it affect the image registration quality? Open the results in Fiji and take a look! Note, you may have to adjust the contrast in Fiji to get a good look at the images.
Start python and run the script to generate base calls to the file read_data_*.csfasta. The maximum number of missing base calls allowed per read is 6 by default. (* denotes an automatically generated time stamp.)
$ python » import FISSEQ » FISSEQ.ImageData('registered_images','.',6) » quit()'
Question: Take a look at the reads in the resulting .csfasta file. How do they look? What happens to the number of reads if you change the value for maximum number of missing base calls ('6' in the command line).
Align reads to refseq_human using Bowtie 1.0, and write mapped reads to bowtie_output.txt. Note: Use the exact name of read_data_*.csfasta!
$ bowtie -C -n 3 -l 15 -e 240 -a -p 12 -m 20 –chunkmbs 200 -f –best –strata –refidx refseq_human read_data_*.csfasta bowtie_output.txt'
Spatially cluster the Bowtie reads to annotate clusters using gene2refseq, and write to results.tsv. The default kernel size of 3 performs a 3 × 3 dilation before clustering.
$ python » import FISSEQ » G = FISSEQ.ImageData('registered_images',None,6) » FISSEQ.AlignmentData('bowtie_output.txt',3,G,'results.tsv', 'human.rna.fna','gene2refseq','9606') » quit()
Question: Take a look at the output. What happens if you change the size of the kernel to something less than 3? To something much greater than 3?
Open the FISSEQ RStudio project file (Menu → File → Open project…).
Find the HISTORY tab on the upper right console window, and double-click on individual commands in order to re-execute the previous R session, and learn how to: import and filter data using a specific criterion (i.e., cluster size); plot a distribution of reads by a specific criterion (i.e., RNA classes and strands); convert a table of reads into a table of gene expression level; correlate gene expression from different images; and find statistically enriched genes in different regions.
Task: Are there any correlations between the features of FISSEQ clusters? E.g., is cluster size correlated with cluster quality?
Task: Find some clusters of different size and quality, and then look at the first image in Fiji and see if you can see the FISSEQ amplicon associated with that cluster. (Note: X/Y is inverted in the clustering file.)

Design Homework Assignment: Think back to your experience so far with HTGAA. Were there any experiments where in situ data of RNA, DNA, protein, or other cellular features would be helpful in understanding the engineering process? You should try to answer the following questions:

What are some reasons in situ data could be better than bulk data for this experiment? Try to think of cases where a bulk measurement would cause you to miss some insight.
What kinds of molecules would you like to detect? E.g. what species of RNA? How would you go about targeting those molecules?
What factors would limit your ability to detect the things you are interested in? There are probably lots of these! For example, if you are interested in RNA expression in e. coli, each e. coli is only big enough for a few FISSEQ amplicons, so at most you could only detect 2-3 RNA molecules!! Try to think of strategies to overcome these limitations.

Try to be as detailed as possible and think creatively! These are the kinds of questions we ask every day and that come up as we talk to other scientists who want to use FISSEQ. These questions drive our technology development process!