Publications

1. A consensus definition for deep layer 6 excitatory neurons in mouse somatosensory, visual, and motor cortex. (2025)

Kim SJ, Babola TA, Lee K, Spiegel AC, Lee K, Matney CJ, Liew MH, Schulteis EM, Coye AE, Proskurin M, Kang H, Kim JA, Chevee M, Lee J, Cabalinan AG, An SY, Bronckers SP, Kanold PO, Goff LA, Kim J, Brown SP

Cell Rep. 2025 Sep 23;44(9):116167. doi: 10.1016/j.celrep.2025.116167. Epub 2025 Aug 21. PMID: 40844876

To understand neocortical function, it is helpful to define cortical cell types. Recent studies indicate that neurons in the deepest cortical layer play roles in mediating thalamocortical interactions and modulating brain state and are implicated in neuropsychiatric disease. However, understanding the functions of deep layer 6 (layer 6b [L6b]) neurons has been hampered by the lack of agreed-upon definitions for these neuron types. We compared commonly used methods for defining L6b neurons, including molecular, transcriptional, and morphological approaches and transgenic mouse lines, and identified a core population of L6b neurons. This population does not innervate the thalamus, unlike layer 6 corticothalamic neurons (L6CThNs) in more superficial layer 6. Rather, single L6b neurons project ipsilaterally between cortical areas. Their intrinsic electrophysiological properties were stable after the first two postnatal weeks. The four methods we identify for defining L6b neurons enable comparisons across studies testing their contributions to cortical function.

10.1016/j.celrep.2025.116167

2. mNSF: multi-sample non-negative spatial factorization. (2025)

Wang Y, Woyshner K, Sriworarat C, Stein-O'Brien G, Goff LA, Hansen KD

Genome Biol. 2025 Jun 2;26(1):149. doi: 10.1186/s13059-025-03601-x. PMID: 40457480

Analyzing multi-sample spatial transcriptomics data requires accounting for biological variation. We present multi-sample non-negative spatial factorization (mNSF), an alignment-free framework extending single-sample spatial factorization to multi-sample datasets. mNSF incorporates sample-specific spatial correlation modeling and extracts low-dimensional data representations. Through simulations and real data analysis, we demonstrate mNSF's efficacy in identifying true factors, shared anatomical regions, and region-specific biological functions. mNSF's performance is comparable to alignment-based methods when alignment is feasible, while enabling analysis in scenarios where spatial alignment is unfeasible. mNSF shows promise as a robust method for analyzing spatially resolved transcriptomics data across multiple samples.

10.1186/s13059-025-03601-x

3. Transcriptional profiles of mouse oligodendrocyte precursor cells across the lifespan. (2025)

Heo D, Kim AA, Neumann B, Doze VN, Xu YKT, Mironova YA, Slosberg J, Goff LA, Franklin RJM, Bergles DE

Nat Aging. 2025 Apr;5(4):675-690. doi: 10.1038/s43587-025-00840-2. Epub 2025 Mar 31. PMID: 40164771

Oligodendrocyte progenitor cells (OPCs) are highly dynamic, widely distributed glial cells of the central nervous system responsible for generating myelinating oligodendrocytes throughout life. However, the rates of OPC proliferation and differentiation decline dramatically with aging, which may impair homeostasis, remyelination and adaptive myelination during learning. To determine how aging influences OPCs, we generated a transgenic mouse line (Matn4-mEGFP) and performed single-cell RNA sequencing, providing enhanced resolution of transcriptional changes during key transitions from quiescence to proliferation and differentiation across the lifespan. We found that aging induces distinct transcriptomic changes in OPCs in different states, including enhanced activation of HIF-1alpha and WNT pathways. Pharmacological inhibition of these pathways in aged OPCs was sufficient to increase their ability to differentiate in vitro. Ultimately, Matn4-mEGFP mouse line and the sequencing dataset of cortical OPCs across ages will help to define the molecular changes guiding OPC behavior in various physiological and pathological contexts.

10.1038/s43587-025-00840-2

4. Retinal ganglion cell-derived semaphorin 6A segregates starburst amacrine cell dendritic scaffolds to organize the mouse inner retina. (2024)

James RE, Hamilton NR, Huffman LN, Brown MP, Neckles VN, Pasterkamp RJ, Goff LA, Kolodkin AL

Development. 2024 Nov 15;151(22):dev204293. doi: 10.1242/dev.204293. Epub 2024 Nov 26. PMID: 39495936

To form functional circuits, neurons must settle in their appropriate cellular locations, and then project and elaborate neurites to contact their target synaptic neuropils. Laminar organization within the vertebrate retinal inner plexiform layer (IPL) facilitates pre- and postsynaptic neurite targeting, yet the precise mechanisms underlying establishment of functional IPL subdomains are not well understood. Here, we explore mechanisms defining the compartmentalization of OFF and ON neurites generally, and OFF and ON direction-selective neurites specifically, within the developing mouse IPL. We show that semaphorin 6A (Sema6A), a repulsive axon guidance cue, is required for delineation of OFF versus ON circuits within the IPL: in the Sema6a null IPL, the boundary between OFF and ON domains is blurred. Furthermore, Sema6A expressed by retinal ganglion cells (RGCs) directs laminar segregation of OFF and ON starburst amacrine cell dendritic scaffolds, which themselves serve as a substrate upon which other retinal neurites elaborate. These results demonstrate that RGCs, the first type of neuron born within the retina, play an active role in functional specialization of the IPL.

10.1242/dev.204293

5. Conserved transcriptional regulation by BRN1 and BRN2 in neocortical progenitors drives mammalian neural specification and neocortical expansion. (2024)

Barao S, Xu Y, Llongueras JP, Vistein R, Goff LA, Nielsen KJ, Bae BI, Smith RS, Walsh CA, Stein-O'Brien G, Muller U

Nat Commun. 2024 Sep 14;15(1):8043. doi: 10.1038/s41467-024-52443-x. PMID: 39271675

The neocortex varies in size and complexity among mammals due to the tremendous variability in the number and diversity of neuronal subtypes across species. The increased cellular diversity is paralleled by the expansion of the pool of neocortical progenitors and the emergence of indirect neurogenesis during brain evolution. The molecular pathways that control these biological processes and are disrupted in neurological disorders remain largely unknown. Here we show that the transcription factors BRN1 and BRN2 have an evolutionary conserved function in neocortical progenitors to control their proliferative capacity and the switch from direct to indirect neurogenesis. Functional studies in mice and ferrets show that BRN1/2 act in concert with NOTCH and primary microcephaly genes to regulate progenitor behavior. Analysis of transcriptomics data from genetically modified macaques provides evidence that these molecular pathways are conserved in non-human primates. Our findings thus demonstrate that BRN1/2 are central regulators of gene expression programs in neocortical progenitors critical to determine brain size during evolution.

10.1038/s41467-024-52443-x

6. Establishment of a conditionally reprogrammed primary eccrine sweat gland culture for evaluation of tissue-specific CFTR function. (2024)

Eastman AC, Rosson G, Kim N, Kang S, Raraigh K, Goff LA, Merlo C, Lechtzin N, Cutting GR, Sharma N

J Cyst Fibros. 2024 Nov;23(6):1173-1179. doi: 10.1016/j.jcf.2024.06.013. Epub 2024 Jul 4. PMID: 38969603

BACKGROUND: Sweat chloride concentration is used both for CF diagnosis and for tracking CFTR modulator efficacy over time, but the relationship between sweat chloride and lung health is heterogeneous and informed by CFTR genotype. Here, we endeavored to characterize ion transport in eccrine sweat glands (ESGs). METHODS: First, ESGs were microdissected from a non-CF skin donor to analyze individual glands. We established primary cultures of ESG cells via conditional reprogramming for functional testing of ion transport by short circuit current measurement and examined cell composition by single-cell RNA-sequencing (scRNA-seq) comparing with whole dissociated ESGs. Secondly, we cultured nasal epithelial (NE) cells and ESGs from two people with CF (pwCF) to assess modulator efficacy. Finally, NEs and ESGs were grown from one person with the CFTR genotype F312del/F508del to explore genotype-phenotype heterogeneity. RESULTS: ESG primary cells from individuals without CF demonstrated robust ENaC and CFTR function. scRNA-seq demonstrated both secretory and ductal ESG markers in cultured ESG cells. In both NEs and ESGs from pwCF homozygous for F508del, minimal baseline CFTR function was observed, and treatment with CFTR modulators significantly enhanced function. Notably, NEs from an individual bearing F312del/F508del exhibited significant baseline CFTR function, whereas ESGs from the same person displayed minimal CFTR function, consistent with observed phenotype. CONCLUSIONS: This study has established a novel primary culture technique for ESGs that allows for functional ion transport measurement to assess modulator efficacy and evaluate genotype-phenoytpe heterogeneity. To our knowledge, this is the first reported application of conditional reprogramming and scRNA-seq of microdissected ESGs.

10.1016/j.jcf.2024.06.013

7. Age-associated changes in lineage composition of the enteric nervous system regulate gut health and disease. (2023)

Kulkarni S, Saha M, Slosberg J, Singh A, Nagaraj S, Becker L, Zhang C, Bukowski A, Wang Z, Liu G, Leser JM, Kumar M, Bakhshi S, Anderson MJ, Lewandoski M, Vincent E, Goff LA, Pasricha PJ

Elife. 2023 Dec 18;12:RP88051. doi: 10.7554/eLife.88051. PMID: 38108810

The enteric nervous system (ENS), a collection of neural cells contained in the wall of the gut, is of fundamental importance to gastrointestinal and systemic health. According to the prevailing paradigm, the ENS arises from progenitor cells migrating from the neural crest and remains largely unchanged thereafter. Here, we show that the lineage composition of maturing ENS changes with time, with a decline in the canonical lineage of neural-crest derived neurons and their replacement by a newly identified lineage of mesoderm-derived neurons. Single cell transcriptomics and immunochemical approaches establish a distinct expression profile of mesoderm-derived neurons. The dynamic balance between the proportions of neurons from these two different lineages in the post-natal gut is dependent on the availability of their respective trophic signals, GDNF-RET and HGF-MET. With increasing age, the mesoderm-derived neurons become the dominant form of neurons in the ENS, a change associated with significant functional effects on intestinal motility which can be reversed by GDNF supplementation. Transcriptomic analyses of human gut tissues show reduced GDNF-RET signaling in patients with intestinal dysmotility which is associated with reduction in neural crest-derived neuronal markers and concomitant increase in transcriptional patterns specific to mesoderm-derived neurons. Normal intestinal function in the adult gastrointestinal tract therefore appears to require an optimal balance between these two distinct lineages within the ENS.

10.7554/eLife.88051

8. Inferring cellular and molecular processes in single-cell data with non-negative matrix factorization using Python, R and GenePattern Notebook implementations of CoGAPS. (2023)

Johnson JAI, Tsang AP, Mitchell JT, Zhou DL, Bowden J, Davis-Marcisak E, Sherman T, Liefeld T, Loth M, Goff LA, Zimmerman JW, Kinny-Koster B, Jaffee EM, Tamayo P, Mesirov JP, Reich M, Fertig EJ, Stein-O'Brien GL

Nat Protoc. 2023 Dec;18(12):3690-3731. doi: 10.1038/s41596-023-00892-x. Epub 2023 Nov 21. PMID: 37989764

Non-negative matrix factorization (NMF) is an unsupervised learning method well suited to high-throughput biology. However, inferring biological processes from an NMF result still requires additional post hoc statistics and annotation for interpretation of learned features. Here, we introduce a suite of computational tools that implement NMF and provide methods for accurate and clear biological interpretation and analysis. A generalized discussion of NMF covering its benefits, limitations and open questions is followed by four procedures for the Bayesian NMF algorithm Coordinated Gene Activity across Pattern Subsets (CoGAPS). Each procedure will demonstrate NMF analysis to quantify cell state transitions in a public domain single-cell RNA-sequencing dataset. The first demonstrates PyCoGAPS, our new Python implementation that enhances runtime for large datasets, and the second allows its deployment in Docker. The third procedure steps through the same single-cell NMF analysis using our R CoGAPS interface. The fourth introduces a beginner-friendly CoGAPS platform using GenePattern Notebook, aimed at users with a working conceptual knowledge of data analysis but without a basic proficiency in the R or Python programming language. We also constructed a user-facing website to serve as a central repository for information and instructional materials about CoGAPS and its application programming interfaces. The expected timing to setup the packages and conduct a test run is around 15 min, and an additional 30 min to conduct analyses on a precomputed result. The expected runtime on the user's desired dataset can vary from hours to days depending on factors such as dataset size or input parameters.

10.1038/s41596-023-00892-x

9. Pumping the brakes on RNA velocity by understanding and interpreting RNA velocity estimates. (2023)

Zheng SC, Stein-O'Brien G, Boukas L, Goff LA, Hansen KD

Genome Biol. 2023 Oct 26;24(1):246. doi: 10.1186/s13059-023-03065-x. PMID: 37885016

BACKGROUND: RNA velocity analysis of single cells offers the potential to predict temporal dynamics from gene expression. In many systems, RNA velocity has been observed to produce a vector field that qualitatively reflects known features of the system. However, the limitations of RNA velocity estimates are still not well understood. RESULTS: We analyze the impact of different steps in the RNA velocity workflow on direction and speed. We consider both high-dimensional velocity estimates and low-dimensional velocity vector fields mapped onto an embedding. We conclude the transition probability method for mapping velocity estimates onto an embedding is effectively interpolating in the embedding space. Our findings reveal a significant dependence of the RNA velocity workflow on smoothing via the k-nearest-neighbors (k-NN) graph of the observed data. This reliance results in considerable estimation errors for both direction and speed in both high- and low-dimensional settings when the k-NN graph fails to accurately represent the true data structure; this is an unknown feature of real data. RNA velocity performs poorly at estimating speed in both low- and high-dimensional spaces, except in very low noise settings. We introduce a novel quality measure that can identify when RNA velocity should not be used. CONCLUSIONS: Our findings emphasize the importance of choices in the RNA velocity workflow and highlight critical limitations of data analysis. We advise against over-interpreting expression dynamics using RNA velocity, particularly in terms of speed. Finally, we emphasize that the use of RNA velocity in assessing the correctness of a low-dimensional embedding is circular.

10.1186/s13059-023-03065-x

10. Normal and Sjogren's syndrome models of the murine lacrimal gland studied at single-cell resolution. (2023)

Rattner A, Heng JS, Winer BL, Goff LA, Nathans J

Proc Natl Acad Sci U S A. 2023 Oct 17;120(42):e2311983120. doi: 10.1073/pnas.2311983120. Epub 2023 Oct 9. PMID: 37812717

The lacrimal gland is of central interest in ophthalmology both as the source of the aqueous component of tear fluid and as the site of autoimmune pathology in the context of Sjogren's syndrome (SjS). To provide a foundational description of mouse lacrimal gland cell types and their patterns of gene expression, we have analyzed single-cell transcriptomes from wild-type (Balb/c) mice and from two genetically based SjS models, MRL/lpr and NOD (nonobese diabetic).H2b, and defined the localization of multiple cell-type-specific protein and mRNA markers. This analysis has uncovered a previously undescribed cell type, Car6+ cells, which are located at the junction of the acini and the connecting ducts. More than a dozen secreted polypeptides that are likely to be components of tear fluid are expressed by acinar cells and show pronounced sex differences in expression. Additional examples of gene expression heterogeneity within a single cell type were identified, including a gradient of Claudin4 along the length of the ductal system and cell-to-cell heterogeneity in transcription factor expression within acinar and myoepithelial cells. The patterns of expression of channels, transporters, and pumps in acinar, Car6+, and ductal cells make strong predictions regarding the mechanisms of water and electrolyte secretion. In MRL/lpr and NOD.H2b lacrimal glands, distinctive changes in parenchymal gene expression and in immune cell subsets reveal widespread interferon responses, a T cell-dominated infiltrate in the MRL/lpr model, and a mixed B cell and T cell infiltrate in the NOD.H2b model.

10.1073/pnas.2311983120

11. Ret deficiency decreases neural crest progenitor proliferation and restricts fate potential during enteric nervous system development. (2023)

Vincent E, Chatterjee S, Cannon GH, Auer D, Ross H, Chakravarti A, Goff LA,

Proc Natl Acad Sci U S A. 2023 Aug 22;120(34):e2211986120. doi: 10.1073/pnas.2211986120. Epub 2023 Aug 16. PMID: 37585461

The receptor tyrosine kinase RET plays a critical role in the fate specification of enteric neural crest-derived cells (ENCDCs) during enteric nervous system (ENS) development. RET loss of function (LoF) is associated with Hirschsprung disease (HSCR), which is marked by aganglionosis of the gastrointestinal (GI) tract. Although the major phenotypic consequences and the underlying transcriptional changes from Ret LoF in the developing ENS have been described, cell type- and state-specific effects are unknown. We performed single-cell RNA sequencing on an enriched population of ENCDCs from the developing GI tract of Ret null heterozygous and homozygous mice at embryonic day (E)12.5 and E14.5. We demonstrate four significant findings: 1) Ret-expressing ENCDCs are a heterogeneous population comprising ENS progenitors as well as glial- and neuronal-committed cells; 2) neurons committed to a predominantly inhibitory motor neuron developmental trajectory are not produced under Ret LoF, leaving behind a mostly excitatory motor neuron developmental program; 3) expression patterns of HSCR-associated and Ret gene regulatory network genes are impacted by Ret LoF; and 4) Ret deficiency leads to precocious differentiation and reduction in the number of proliferating ENS precursors. Our results support a model in which Ret contributes to multiple distinct cellular phenotypes during development of the ENS, including the specification of inhibitory neuron subtypes, cell cycle dynamics of ENS progenitors, and the developmental timing of neuronal and glial commitment.

10.1073/pnas.2211986120

12. Psychedelics reopen the social reward learning critical period. (2023)

Nardou R, Sawyer E, Song YJ, Wilkinson M, Padovan-Hernandez Y, de Deus JL, Wright N, Lama C, Faltin S, Goff LA, Stein-O'Brien GL, Dolen G

Nature. 2023 Jun;618(7966):790-798. doi: 10.1038/s41586-023-06204-3. Epub 2023 Jun 14. PMID: 37316665

Psychedelics are a broad class of drugs defined by their ability to induce an altered state of consciousness(1,2). These drugs have been used for millennia in both spiritual and medicinal contexts, and a number of recent clinical successes have spurred a renewed interest in developing psychedelic therapies(3-9). Nevertheless, a unifying mechanism that can account for these shared phenomenological and therapeutic properties remains unknown. Here we demonstrate in mice that the ability to reopen the social reward learning critical period is a shared property across psychedelic drugs. Notably, the time course of critical period reopening is proportional to the duration of acute subjective effects reported in humans. Furthermore, the ability to reinstate social reward learning in adulthood is paralleled by metaplastic restoration of oxytocin-mediated long-term depression in the nucleus accumbens. Finally, identification of differentially expressed genes in the 'open state' versus the 'closed state' provides evidence that reorganization of the extracellular matrix is a common downstream mechanism underlying psychedelic drug-mediated critical period reopening. Together these results have important implications for the implementation of psychedelics in clinical practice, as well as the design of novel compounds for the treatment of neuropsychiatric disease.

10.1038/s41586-023-06204-3

13. The transcription factor Tbx5 regulates direction-selective retinal ganglion cell development and image stabilization. (2022)

Al-Khindi T, Sherman MB, Kodama T, Gopal P, Pan Z, Kiraly JK, Zhang H, Goff LA, du Lac S, Kolodkin AL

Curr Biol. 2022 Oct 10;32(19):4286-4298.e5. doi: 10.1016/j.cub.2022.07.064. Epub 2022 Aug 22. PMID: 35998637

The diversity of visual input processed by the mammalian visual system requires the generation of many distinct retinal ganglion cell (RGC) types, each tuned to a particular feature. The molecular code needed to generate this cell-type diversity is poorly understood. Here, we focus on the molecules needed to specify one type of retinal cell: the upward-preferring ON direction-selective ganglion cell (up-oDSGC) of the mouse visual system. Single-cell transcriptomic profiling of up- and down-oDSGCs shows that the transcription factor Tbx5 is selectively expressed in up-oDSGCs. The loss of Tbx5 in up-oDSGCs results in a selective defect in the formation of up-oDSGCs and a corresponding inability to detect vertical motion. A downstream effector of Tbx5, Sfrp1, is also critical for vertical motion detection but not up-oDSGC formation. These results advance our understanding of the molecular mechanisms that specify a rare retinal cell type and show how disrupting this specification leads to a corresponding defect in neural circuitry and behavior.

10.1016/j.cub.2022.07.064

14. Postnatal Smad3 Inactivation in Murine Smooth Muscle Cells Elicits a Temporally and Regionally Distinct Transcriptional Response. (2022)

Bramel EE, Creamer TJ, Saqib M, Camejo Nunez WA, Bagirzadeh R, Roker LA, Goff LA, MacFarlane EG

Front Cardiovasc Med. 2022 Apr 8;9:826495. doi: 10.3389/fcvm.2022.826495. eCollection 2022. PMID: 35463747

Heterozygous, loss of function mutations in positive regulators of the Transforming Growth Factor-beta (TGF-beta) pathway cause hereditary forms of thoracic aortic aneurysm. It is unclear whether and how the initial signaling deficiency triggers secondary signaling upregulation in the remaining functional branches of the pathway, and if this contributes to maladaptive vascular remodeling. To examine this process in a mouse model in which time-controlled, partial interference with postnatal TGF-beta signaling in vascular smooth muscle cells (VSMCs) could be assessed, we used a VSMC-specific tamoxifen-inducible system, and a conditional allele, to inactivate Smad3 at 6 weeks of age, after completion of perinatal aortic development. This intervention induced dilation and histological abnormalities in the aortic root, with minor involvement of the ascending aorta. To analyze early and late events associated with disease progression, we performed a comparative single cell transcriptomic analysis at 10- and 18-weeks post-deletion, when aortic dilation is undetectable and moderate, respectively. At the early time-point, Smad3-inactivation resulted in a broad reduction in the expression of extracellular matrix components and critical components of focal adhesions, including integrins and anchoring proteins, which was reflected histologically by loss of connections between VSMCs and elastic lamellae. At the later time point, however, expression of several transcripts belonging to the same functional categories was normalized or even upregulated; this occurred in association with upregulation of transcripts coding for TGF-beta ligands, and persistent downregulation of negative regulators of the pathway. To interrogate how VSMC heterogeneity may influence this transition, we examined transcriptional changes in each of the four VSMC subclusters identified, regardless of genotype, as partly reflecting the proximal-to-distal anatomic location based on in situ RNA hybridization. The response to Smad3-deficiency varied depending on subset, and VSMC subsets over-represented in the aortic root, the site most vulnerable to dilation, most prominently upregulated TGF-beta ligands and pro-pathogenic factors such as thrombospondin-1, angiotensin converting enzyme, and pro-inflammatory mediators. These data suggest that Smad3 is required for maintenance of focal adhesions, and that loss of contacts with the extracellular matrix has consequences specific to each VSMC subset, possibly contributing to the regional susceptibility to dilation in the aorta.

10.3389/fcvm.2022.826495

15. Odorant-receptor-mediated regulation of chemosensory gene expression in the malaria mosquito Anopheles gambiae. (2022)

Maguire SE, Afify A, Goff LA, Potter CJ

Cell Rep. 2022 Mar 8;38(10):110494. doi: 10.1016/j.celrep.2022.110494. PMID: 35263579

Mosquitoes locate and approach humans based on the activity of odorant receptors (ORs) expressed on olfactory receptor neurons (ORNs). Olfactogenetic experiments in Anopheles gambiae mosquitoes revealed that the ectopic expression of an AgOR (AgOR2) in ORNs dampened the activity of the expressing neuron. This contrasts with studies in Drosophila melanogaster in which the ectopic expression of non-native ORs in ORNs confers ectopic neuronal responses without interfering with native olfactory physiology. RNA-seq analyses comparing wild-type antennae to those ectopically expressing AgOR2 in ORNs indicated that nearly all AgOR transcripts were significantly downregulated (except for AgOR2). Additional experiments suggest that AgOR2 protein rather than mRNA mediates this downregulation. Using in situ hybridization, we find that AgOR gene choice is active into adulthood and that AgOR2 expression inhibits AgORs from turning on at this late stage. Our study shows that the ORNs of Anopheles mosquitoes (in contrast to Drosophila) are sensitive to a currently unexplored mechanism of AgOR regulation.

10.1016/j.celrep.2022.110494

16. Follistatin promotes LIN28B-mediated supporting cell reprogramming and hair cell regeneration in the murine cochlea. (2022)

Li XJ, Morgan C, Goff LA, Doetzlhofer A

Sci Adv. 2022 Feb 11;8(6):eabj7651. doi: 10.1126/sciadv.abj7651. Epub 2022 Feb 11. PMID: 35148175

Hair cell (HC) loss within the inner ear cochlea is a leading cause for deafness in humans. Before the onset of hearing, immature supporting cells (SCs) in neonatal mice have some limited capacity for HC regeneration. Here, we show that in organoid culture, transient activation of the progenitor-specific RNA binding protein LIN28B and Activin antagonist follistatin (FST) enhances regenerative competence of maturing/mature cochlear SCs by reprogramming them into progenitor-like cells. Transcriptome profiling and mechanistic studies reveal that LIN28B drives SC reprogramming, while FST is required to counterbalance hyperactivation of transforming growth factor-beta-type signaling by LIN28B. Last, we show that LIN28B and FST coactivation enhances spontaneous cochlear HC regeneration in neonatal mice and that LIN28B may be part of an endogenous repair mechanism that primes SCs for HC regeneration. These findings indicate that SC dedifferentiation is critical for HC regeneration and identify LIN28B and FST as main regulators.

10.1126/sciadv.abj7651

17. Universal prediction of cell-cycle position using transfer learning. (2022)

Zheng SC, Stein-O'Brien G, Augustin JJ, Slosberg J, Carosso GA, Winer B, Shin G, Bjornsson HT, Goff LA, Hansen KD

Genome Biol. 2022 Jan 31;23(1):41. doi: 10.1186/s13059-021-02581-y. PMID: 35101061

BACKGROUND: The cell cycle is a highly conserved, continuous process which controls faithful replication and division of cells. Single-cell technologies have enabled increasingly precise measurements of the cell cycle both as a biological process of interest and as a possible confounding factor. Despite its importance and conservation, there is no universally applicable approach to infer position in the cell cycle with high-resolution from single-cell RNA-seq data. RESULTS: Here, we present tricycle, an R/Bioconductor package, to address this challenge by leveraging key features of the biology of the cell cycle, the mathematical properties of principal component analysis of periodic functions, and the use of transfer learning. We estimate a cell-cycle embedding using a fixed reference dataset and project new data into this reference embedding, an approach that overcomes key limitations of learning a dataset-dependent embedding. Tricycle then predicts a cell-specific position in the cell cycle based on the data projection. The accuracy of tricycle compares favorably to gold-standard experimental assays, which generally require specialized measurements in specifically constructed in vitro systems. Using internal controls which are available for any dataset, we show that tricycle predictions generalize to datasets with multiple cell types, across tissues, species, and even sequencing assays. CONCLUSIONS: Tricycle generalizes across datasets and is highly scalable and applicable to atlas-level single-cell RNA-seq data.

10.1186/s13059-021-02581-y

18. Differential Expression Levels of Sox9 in Early Neocortical Radial Glial Cells Regulate the Decision between Stem Cell Maintenance and Differentiation. (2021)

Fabra-Beser J, Alves Medeiros de Araujo J, Marques-Coelho D, Goff LA, Costa MR, Muller U, Gil-Sanz C

J Neurosci. 2021 Aug 18;41(33):6969-6986. doi: 10.1523/JNEUROSCI.2905-20.2021. Epub 2021 Jul 15. PMID: 34266896

Radial glial progenitor cells (RGCs) in the dorsal telencephalon directly or indirectly produce excitatory projection neurons and macroglia of the neocortex. Recent evidence shows that the pool of RGCs is more heterogeneous than originally thought and that progenitor subpopulations can generate particular neuronal cell types. Using single-cell RNA sequencing, we have studied gene expression patterns of RGCs with different neurogenic behavior at early stages of cortical development. At this early age, some RGCs rapidly produce postmitotic neurons, whereas others self-renew and undergo neurogenic divisions at a later age. We have identified candidate genes that are differentially expressed among these early RGC subpopulations, including the transcription factor Sox9. Using in utero electroporation in embryonic mice of either sex, we demonstrate that elevated Sox9 expression in progenitors affects RGC cell cycle duration and leads to the generation of upper layer cortical neurons. Our data thus reveal molecular differences between progenitor cells with different neurogenic behavior at early stages of corticogenesis and indicates that Sox9 is critical for the maintenance of RGCs to regulate the generation of upper layer neurons.SIGNIFICANCE STATEMENT The existence of heterogeneity in the pool of RGCs and its relationship with the generation of cellular diversity in the cerebral cortex has been an interesting topic of debate for many years. Here we describe the existence of RGCs with reduced neurogenic behavior at early embryonic ages presenting a particular molecular signature. This molecular signature consists of differential expression of some genes including the transcription factor Sox9, which has been found to be a specific regulator of this subpopulation of progenitor cells. Functional experiments perturbing expression levels of Sox9 reveal its instructive role in the regulation of the neurogenic behavior of RGCs and its relationship with the generation of upper layer projection neurons at later ages.

10.1523/JNEUROSCI.2905-20.2021

19. An in vivo screen of noncoding loci reveals that Daedalus is a gatekeeper of an Ikaros-dependent checkpoint during haematopoiesis. (2021)

Harman CCD, Bailis W, Zhao J, Hill L, Qu R, Jackson RP, Shyer JA, Steach HR, Kluger Y, Goff LA, Rinn JL, Williams A, Henao-Mejia J, Flavell RA

Proc Natl Acad Sci U S A. 2021 Jan 19;118(3):e1918062118. doi: 10.1073/pnas.1918062118. PMID: 33446502

Haematopoiesis relies on tightly controlled gene expression patterns as development proceeds through a series of progenitors. While the regulation of hematopoietic development has been well studied, the role of noncoding elements in this critical process is a developing field. In particular, the discovery of new regulators of lymphopoiesis could have important implications for our understanding of the adaptive immune system and disease. Here we elucidate how a noncoding element is capable of regulating a broadly expressed transcription factor, Ikaros, in a lymphoid lineage-specific manner, such that it imbues Ikaros with the ability to specify the lymphoid lineage over alternate fates. Deletion of the Daedalus locus, which is proximal to Ikaros, led to a severe reduction in early lymphoid progenitors, exerting control over the earliest fate decisions during lymphoid lineage commitment. Daedalus locus deletion led to alterations in Ikaros isoform expression and a significant reduction in Ikaros protein. The Daedalus locus may function through direct DNA interaction as Hi-C analysis demonstrated an interaction between the two loci. Finally, we identify an Ikaros-regulated erythroid-lymphoid checkpoint that is governed by Daedalus in a lymphoid-lineage-specific manner. Daedalus appears to act as a gatekeeper of Ikaros's broad lineage-specifying functions, selectively stabilizing Ikaros activity in the lymphoid lineage and permitting diversion to the erythroid fate in its absence. These findings represent a key illustration of how a transcription factor with broad lineage expression must work in concert with noncoding elements to orchestrate hematopoietic lineage commitment.

10.1073/pnas.1918062118

20. Striking heterogeneity of somatic L1 retrotransposition in single normal and cancerous gastrointestinal cells. (2020)

Yamaguchi K, Soares AO, Goff LA, Talasila A, Choi JA, Ivenitsky D, Karma S, Brophy B, Devine SE, Meltzer SJ, Kazazian HH Jr

Proc Natl Acad Sci U S A. 2020 Dec 22;117(51):32215-32222. doi: 10.1073/pnas.2019450117. Epub 2020 Dec 4. PMID: 33277430

Somatic LINE-1 (L1) retrotransposition has been detected in early embryos, adult brains, and the gastrointestinal (GI) tract, and many cancers, including epithelial GI tumors. We previously found numerous somatic L1 insertions in paired normal and GI cancerous tissues. Here, using a modified method of single-cell analysis for somatic L1 insertions, we studied adenocarcinomas of colon, pancreas, and stomach, and found a variable number of somatic L1 insertions in tumors of the same type from patient to patient. We detected no somatic L1 insertions in single cells of 5 of 10 tumors studied. In three tumors, aneuploid cells were detected by FACS. In one pancreatic tumor, there were many more L1 insertions in aneuploid than in euploid tumor cells. In one gastric cancer, both aneuploid and euploid cells contained large numbers of likely clonal insertions. However, in a second gastric cancer with aneuploid cells, no somatic L1 insertions were found. We suggest that when the cellular environment is favorable to retrotransposition, aneuploidy predisposes tumor cells to L1 insertions, and retrotransposition may occur at the transition from euploidy to aneuploidy. Seventeen percent of insertions were also present in normal cells, similar to findings in genomic DNA from normal tissues of GI tumor patients. We provide evidence that: 1) The number of L1 insertions in tumors of the same type is highly variable, 2) most somatic L1 insertions in GI cancer tissues are absent from normal tissues, and 3) under certain conditions, somatic L1 retrotransposition exhibits a propensity for occurring in aneuploid cells.

10.1073/pnas.2019450117

21. Parallel Social Information Processing Circuits Are Differentially Impacted in Autism. (2020)

Lewis EM, Stein-O'Brien GL, Patino AV, Nardou R, Grossman CD, Brown M, Bangamwabo B, Ndiaye N, Giovinazzo D, Dardani I, Jiang C, Goff LA, Dolen G

Neuron. 2020 Nov 25;108(4):659-675.e6. doi: 10.1016/j.neuron.2020.10.002. Epub 2020 Oct 27. PMID: 33113347

Parallel processing circuits are thought to dramatically expand the network capabilities of the nervous system. Magnocellular and parvocellular oxytocin neurons have been proposed to subserve two parallel streams of social information processing, which allow a single molecule to encode a diverse array of ethologically distinct behaviors. Here we provide the first comprehensive characterization of magnocellular and parvocellular oxytocin neurons in male mice, validated across anatomical, projection target, electrophysiological, and transcriptional criteria. We next use novel multiple feature selection tools in Fmr1-KO mice to provide direct evidence that normal functioning of the parvocellular but not magnocellular oxytocin pathway is required for autism-relevant social reward behavior. Finally, we demonstrate that autism risk genes are enriched in parvocellular compared with magnocellular oxytocin neurons. Taken together, these results provide the first evidence that oxytocin-pathway-specific pathogenic mechanisms account for social impairments across a broad range of autism etiologies.

10.1016/j.neuron.2020.10.002

22. Developmental, cellular, and behavioral phenotypes in a mouse model of congenital hypoplasia of the dentate gyrus. (2020)

Rattner A, Terrillion CE, Jou C, Kleven T, Hu SF, Williams J, Hou Z, Aggarwal M, Mori S, Shin G, Goff LA, Witter MP, Pletnikov M, Fenton AA, Nathans J

Elife. 2020 Oct 21;9:e62766. doi: 10.7554/eLife.62766. PMID: 33084572

In the hippocampus, a widely accepted model posits that the dentate gyrus improves learning and memory by enhancing discrimination between inputs. To test this model, we studied conditional knockout mice in which the vast majority of dentate granule cells (DGCs) fail to develop - including nearly all DGCs in the dorsal hippocampus - secondary to eliminating Wntless (Wls) in a subset of cortical progenitors with Gfap-Cre. Other cells in the Wls(fl/-);Gfap-Cre hippocampus were minimally affected, as determined by single nucleus RNA sequencing. CA3 pyramidal cells, the targets of DGC-derived mossy fibers, exhibited normal morphologies with a small reduction in the numbers of synaptic spines. Wls(fl/-);Gfap-Cre mice have a modest performance decrement in several complex spatial tasks, including active place avoidance. They were also modestly impaired in one simpler spatial task, finding a visible platform in the Morris water maze. These experiments support a role for DGCs in enhancing spatial learning and memory.

10.7554/eLife.62766

23. Screening non-MAPT genes of the Chr17q21 H1 haplotype in Parkinson's disease. (2020)

Soto-Beasley AI, Walton RL, Valentino RR, Hook PW, Labbe C, Heckman MG, Johnson PW, Goff LA, Uitti RJ, McLean PJ, Springer W, McCallion AS, Wszolek ZK, Ross OA

Parkinsonism Relat Disord. 2020 Sep;78:138-144. doi: 10.1016/j.parkreldis.2020.07.022. Epub 2020 Aug 1. PMID: 32829096

INTRODUCTION: The microtubule-associated protein tau (MAPT) gene is considered a strong genetic risk factor for Parkinson's disease (PD) in Caucasians. MAPT is located within an inversion region of high linkage disequilibrium designated as H1 and H2 haplotype, and contains eight other genes which have been implicated in neurodegeneration. The aim of the current study was to identify common coding variants in strong linkage disequilibrium (LD) within the associated loci on chr17q21 harboring MAPT. METHODS: Sanger sequencing of coding exons in 90 Caucasian late-onset PD (LOPD) patients was performed. Specific gene sequencing for LRRC37A, LRRC37A2, ARL17A and ARL17B was not possible given the high homology, presence of pseudogenes and copy number variants that are in the region, and therefore four genes (NSF, KANSL1, SPPL2C, and CRHR1) were included in the analysis. Coding variants from these four genes that did not perfectly tag (r(2) = 1) the MAPT H1/H2 haplotype were genotyped in an independent replication series of Caucasian PD cases (N = 851) and controls (N = 730). RESULTS: In the 90 LOPD cases we identified 30 coding variants. Eleven non-synonymous variants tagged the MAPT H1/H2 haplotype, including two SPPL2C variants (rs12185233 and rs12373123) that had high pathogenic combined annotation dependent depletion (CADD) scores of >20. In the replication series, the non-synonymous KANSL1 rs17585974 variant was in very strong LD with MAPT H1/H2 and had a high CADD score of 24.7. CONCLUSION: We have identified several non-synonymous variants across neighboring genes of MAPT that may warrant further genetic and functional investigation within the biological etiology of PD.

10.1016/j.parkreldis.2020.07.022

24. Single-Cell Analysis of Human Retina Identifies Evolutionarily Conserved and Species-Specific Mechanisms Controlling Development. (2020)

Lu Y, Shiau F, Yi W, Lu S, Wu Q, Pearson JD, Kallman A, Zhong S, Hoang T, Zuo Z, Zhao F, Zhang M, Tsai N, Zhuo Y, He S, Zhang J, Stein-O'Brien GL, Sherman TD, Duan X, Fertig EJ, Goff LA, Zack DJ, Handa JT, Xue T, Bremner R, Blackshaw S, Wang X, Clark BS

Dev Cell. 2020 May 18;53(4):473-491.e9. doi: 10.1016/j.devcel.2020.04.009. Epub 2020 May 7. PMID: 32386599

The development of single-cell RNA sequencing (scRNA-seq) has allowed high-resolution analysis of cell-type diversity and transcriptional networks controlling cell-fate specification. To identify the transcriptional networks governing human retinal development, we performed scRNA-seq analysis on 16 time points from developing retina as well as four early stages of retinal organoid differentiation. We identified evolutionarily conserved patterns of gene expression during retinal progenitor maturation and specification of all seven major retinal cell types. Furthermore, we identified gene-expression differences between developing macula and periphery and between distinct populations of horizontal cells. We also identified species-specific patterns of gene expression during human and mouse retinal development. Finally, we identified an unexpected role for ATOH7 expression in regulation of photoreceptor specification during late retinogenesis. These results provide a roadmap to future studies of human retinal development and may help guide the design of cell-based therapies for treating retinal dystrophies.

10.1016/j.devcel.2020.04.009

25. Mitoregulin Controls beta-Oxidation in Human and Mouse Adipocytes. (2020)

Friesen M, Warren CR, Yu H, Toyohara T, Ding Q, Florido MHC, Sayre C, Pope BD, Goff LA, Rinn JL, Cowan CA

Stem Cell Reports. 2020 Apr 14;14(4):590-602. doi: 10.1016/j.stemcr.2020.03.002. Epub 2020 Apr 2. PMID: 32243843

We previously discovered in mouse adipocytes an lncRNA (the homolog of human LINC00116) regulating adipogenesis that contains a highly conserved coding region. Here, we show human protein expression of a peptide within LINC00116, and demonstrate that this peptide modulates triglyceride clearance in human adipocytes by regulating lipolysis and mitochondrial beta-oxidation. This gene has previously been identified as mitoregulin (MTLN). We conclude that MTLN has a regulatory role in adipocyte metabolism as demonstrated by systemic lipid phenotypes in knockout mice. We also assert its adipocyte-autonomous phenotypes in both isolated murine adipocytes as well as human stem cell-derived adipocytes. MTLN directly interacts with the beta subunit of the mitochondrial trifunctional protein, an enzyme critical in the beta-oxidation of long-chain fatty acids. Our human and murine models contend that MTLN could be an avenue for further therapeutic research, albeit not without caveats, for example, by promoting white adipocyte triglyceride clearance in obese subjects.

10.1016/j.stemcr.2020.03.002

26. projectR: an R/Bioconductor package for transfer learning via PCA, NMF, correlation and clustering. (2020)

Sharma G, Colantuoni C, Goff LA, Fertig EJ, Stein-O'Brien G

Bioinformatics. 2020 Jun 1;36(11):3592-3593. doi: 10.1093/bioinformatics/btaa183. PMID: 32167521

MOTIVATION: Dimension reduction techniques are widely used to interpret high-dimensional biological data. Features learned from these methods are used to discover both technical artifacts and novel biological phenomena. Such feature discovery is critically importent in analysis of large single-cell datasets, where lack of a ground truth limits validation and interpretation. Transfer learning (TL) can be used to relate the features learned from one source dataset to a new target dataset to perform biologically driven validation by evaluating their use in or association with additional sample annotations in that independent target dataset. RESULTS: We developed an R/Bioconductor package, projectR, to perform TL for analyses of genomics data via TL of clustering, correlation and factorization methods. We then demonstrate the utility TL for integrated data analysis with an example for spatial single-cell analysis. AVAILABILITY AND IMPLEMENTATION: projectR is available on Bioconductor and at https://github.com/genesofeve/projectR. CONTACT: gsteinobrien@jhmi.edu or ejfertig@jhmi.edu. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

10.1093/bioinformatics/btaa183

27. Comprehensive analysis of a mouse model of spontaneous uveoretinitis using single-cell RNA sequencing. (2019)

Heng JS, Hackett SF, Stein-O'Brien GL, Winer BL, Williams J, Goff LA, Nathans J

Proc Natl Acad Sci U S A. 2019 Dec 26;116(52):26734-26744. doi: 10.1073/pnas.1915571116. Epub 2019 Dec 16. PMID: 31843893

Autoimmune uveoretinitis is a significant cause of visual loss, and mouse models offer unique opportunities to study its disease mechanisms. Aire(-/-) mice fail to express self-antigens in the thymus, exhibit reduced central tolerance, and develop a spontaneous, chronic, and progressive uveoretinitis. Using single-cell RNA sequencing (scRNA-seq), we characterized wild-type and Aire(-/-) retinas to define, in a comprehensive and unbiased manner, the cell populations and gene expression patterns associated with disease. Based on scRNA-seq, immunostaining, and in situ hybridization, we infer that 1) the dominant effector response in Aire(-/-) retinas is Th1-driven, 2) a subset of monocytes convert to either a macrophage/microglia state or a dendritic cell state, 3) the development of tertiary lymphoid structures constitutes part of the Aire(-/-) retinal phenotype, 4) all major resident retinal cell types respond to interferon gamma (IFNG) by changing their patterns of gene expression, and 5) Muller glia up-regulate specific genes in response to IFN gamma and may act as antigen-presenting cells.

10.1073/pnas.1915571116

28. Increased expression of anion transporter SLC26A9 delays diabetes onset in cystic fibrosis. (2020)

Lam AN, Aksit MA, Vecchio-Pagan B, Shelton CA, Osorio DL, Anzmann AF, Goff LA, Whitcomb DC, Blackman SM, Cutting GR

J Clin Invest. 2020 Jan 2;130(1):272-286. doi: 10.1172/JCI129833. PMID: 31581148

Diabetes is a common complication of cystic fibrosis (CF) that affects approximately 20% of adolescents and 40%-50% of adults with CF. The age at onset of CF-related diabetes (CFRD) (marked by clinical diagnosis and treatment initiation) is an important measure of the disease process. DNA variants associated with age at onset of CFRD reside in and near SLC26A9. Deep sequencing of the SLC26A9 gene in 762 individuals with CF revealed that 2 common DNA haplotypes formed by the risk variants account for the association with diabetes. Single-cell RNA sequencing (scRNA-Seq) indicated that SLC26A9 is predominantly expressed in pancreatic ductal cells and frequently coexpressed with CF transmembrane conductance regulator (CFTR) along with transcription factors that have binding sites 5' of SLC26A9. These findings were replicated upon reanalysis of scRNA-Seq data from 4 independent studies. DNA fragments derived from the 5' region of SLC26A9-bearing variants from the low-risk haplotype generated 12%-20% higher levels of expression in PANC-1 and CFPAC-1 cells compared with the high- risk haplotype. Taken together, our findings indicate that an increase in SLC26A9 expression in ductal cells of the pancreas delays the age at onset of diabetes, suggesting a CFTR-agnostic treatment for a major complication of CF.

10.1172/JCI129833

29. A screen of 1,049 schizophrenia and 30 Alzheimer's-associated variants for regulatory potential. (2020)

Myint L, Wang R, Boukas L, Hansen KD, Goff LA, Avramopoulos D

Am J Med Genet B Neuropsychiatr Genet. 2020 Jan;183(1):61-73. doi: 10.1002/ajmg.b.32761. Epub 2019 Sep 10. PMID: 31503409

Recent genome-wide association studies (GWAS) identified numerous schizophrenia (SZ) and Alzheimer's disease (AD) associated loci, most outside protein-coding regions and hypothesized to affect gene transcription. We used a massively parallel reporter assay to screen, 1,049 SZ and 30 AD variants in 64 and nine loci, respectively for allele differences in driving reporter gene expression. A library of synthetic oligonucleotides assaying each allele five times was transfected into K562 chronic myelogenous leukemia lymphoblasts and SK-SY5Y human neuroblastoma cells. One hundred forty eight variants showed allelic differences in K562 and 53 in SK-SY5Y cells, on average 2.6 variants per locus. Nine showed significant differences in both lines, a modest overlap reflecting different regulatory landscapes of these lines that also differ significantly in chromatin marks. Eight of nine were in the same direction. We observe no preference for risk alleles to increase or decrease expression. We find a positive correlation between the number of SNPs in linkage disequilibrium and the proportion of functional SNPs supporting combinatorial effects that may lead to haplotype selection. Our results prioritize future functional follow up of disease associated SNPs to determine the driver GWAS variant(s), at each locus and enhance our understanding of gene regulation dynamics.

10.1002/ajmg.b.32761

30. Precocious neuronal differentiation and disrupted oxygen responses in Kabuki syndrome. (2019)

Carosso GA, Boukas L, Augustin JJ, Nguyen HN, Winer BL, Cannon GH, Robertson JD, Zhang L, Hansen KD, Goff LA, Bjornsson HT

JCI Insight. 2019 Oct 17;4(20):e129375. doi: 10.1172/jci.insight.129375. PMID: 31465303

Chromatin modifiers act to coordinate gene expression changes critical to neuronal differentiation from neural stem/progenitor cells (NSPCs). Lysine-specific methyltransferase 2D (KMT2D) encodes a histone methyltransferase that promotes transcriptional activation and is frequently mutated in cancers and in the majority (>70%) of patients diagnosed with the congenital, multisystem intellectual disability disorder Kabuki syndrome 1 (KS1). Critical roles for KMT2D are established in various non-neural tissues, but the effects of KMT2D loss in brain cell development have not been described. We conducted parallel studies of proliferation, differentiation, transcription, and chromatin profiling in KMT2D-deficient human and mouse models to define KMT2D-regulated functions in neurodevelopmental contexts, including adult-born hippocampal NSPCs in vivo and in vitro. We report cell-autonomous defects in proliferation, cell cycle, and survival, accompanied by early NSPC maturation in several KMT2D-deficient model systems. Transcriptional suppression in KMT2D-deficient cells indicated strong perturbation of hypoxia-responsive metabolism pathways. Functional experiments confirmed abnormalities of cellular hypoxia responses in KMT2D-deficient neural cells and accelerated NSPC maturation in vivo. Together, our findings support a model in which loss of KMT2D function suppresses expression of oxygen-responsive gene programs important to neural progenitor maintenance, resulting in precocious neuronal differentiation in a mouse model of KS1.

10.1172/jci.insight.129375

31. Differential Variation Analysis Enables Detection of Tumor Heterogeneity Using Single-Cell RNA-Sequencing Data. (2019)

Davis-Marcisak EF, Sherman TD, Orugunta P, Stein-O'Brien GL, Puram SV, Roussos Torres ET, Hopkins AC, Jaffee EM, Favorov AV, Afsari B, Goff LA, Fertig EJ

Cancer Res. 2019 Oct 1;79(19):5102-5112. doi: 10.1158/0008-5472.CAN-18-3882. Epub 2019 Jul 23. PMID: 31337651

Tumor heterogeneity provides a complex challenge to cancer treatment and is a critical component of therapeutic response, disease recurrence, and patient survival. Single-cell RNA-sequencing (scRNA-seq) technologies have revealed the prevalence of intratumor and intertumor heterogeneity. Computational techniques are essential to quantify the differences in variation of these profiles between distinct cell types, tumor subtypes, and patients to fully characterize intratumor and intertumor molecular heterogeneity. In this study, we adapted our algorithm for pathway dysregulation, Expression Variation Analysis (EVA), to perform multivariate statistical analyses of differential variation of expression in gene sets for scRNA-seq. EVA has high sensitivity and specificity to detect pathways with true differential heterogeneity in simulated data. EVA was applied to several public domain scRNA-seq tumor datasets to quantify the landscape of tumor heterogeneity in several key applications in cancer genomics such as immunogenicity, metastasis, and cancer subtypes. Immune pathway heterogeneity of hematopoietic cell populations in breast tumors corresponded to the amount of diversity present in the T-cell repertoire of each individual. Cells from head and neck squamous cell carcinoma (HNSCC) primary tumors had significantly more heterogeneity across pathways than cells from metastases, consistent with a model of clonal outgrowth. Moreover, there were dramatic differences in pathway dysregulation across HNSCC basal primary tumors. Within the basal primary tumors, there was increased immune dysregulation in individuals with a high proportion of fibroblasts present in the tumor microenvironment. These results demonstrate the broad utility of EVA to quantify intertumor and intratumor heterogeneity from scRNA-seq data without reliance on low-dimensional visualization. SIGNIFICANCE: This study presents a robust statistical algorithm for evaluating gene expression heterogeneity within pathways or gene sets in single-cell RNA-seq data.

10.1158/0008-5472.CAN-18-3882

32. Single-Cell RNA-Seq Analysis of Retinal Development Identifies NFI Factors as Regulating Mitotic Exit and Late-Born Cell Specification. (2019)

Clark BS, Stein-O'Brien GL, Shiau F, Cannon GH, Davis-Marcisak E, Sherman T, Santiago CP, Hoang TV, Rajaii F, James-Esposito RE, Gronostajski RM, Fertig EJ, Goff LA, Blackshaw S

Neuron. 2019 Jun 19;102(6):1111-1126.e5. doi: 10.1016/j.neuron.2019.04.010. Epub 2019 May 22. PMID: 31128945

Precise temporal control of gene expression in neuronal progenitors is necessary for correct regulation of neurogenesis and cell fate specification. However, the cellular heterogeneity of the developing CNS has posed a major obstacle to identifying the gene regulatory networks that control these processes. To address this, we used single-cell RNA sequencing to profile ten developmental stages encompassing the full course of retinal neurogenesis. This allowed us to comprehensively characterize changes in gene expression that occur during initiation of neurogenesis, changes in developmental competence, and specification and differentiation of each major retinal cell type. We identify the NFI transcription factors (Nfia, Nfib, and Nfix) as selectively expressed in late retinal progenitor cells and show that they control bipolar interneuron and Muller glia cell fate specification and promote proliferative quiescence.

10.1016/j.neuron.2019.04.010

33. Decomposing Cell Identity for Transfer Learning across Cellular Measurements, Platforms, Tissues, and Species. (2019)

Stein-O'Brien GL, Clark BS, Sherman T, Zibetti C, Hu Q, Sealfon R, Liu S, Qian J, Colantuoni C, Blackshaw S, Goff LA, Fertig EJ

Cell Syst. 2019 May 22;8(5):395-411.e8. doi: 10.1016/j.cels.2019.04.004. PMID: 31121116

Analysis of gene expression in single cells allows for decomposition of cellular states as low-dimensional latent spaces. However, the interpretation and validation of these spaces remains a challenge. Here, we present scCoGAPS, which defines latent spaces from a source single-cell RNA-sequencing (scRNA-seq) dataset, and projectR, which evaluates these latent spaces in independent target datasets via transfer learning. Application of developing mouse retina to scRNA-Seq reveals intrinsic relationships across biological contexts and assays while avoiding batch effects and other technical features. We compare the dimensions learned in this source dataset to adult mouse retina, a time-course of human retinal development, select scRNA-seq datasets from developing brain, chromatin accessibility data, and a murine-cell type atlas to identify shared biological features. These tools lay the groundwork for exploratory analysis of scRNA-seq data via latent space representations, enabling a shift in how we compare and identify cells beyond reliance on marker genes or ensemble molecular identity.

10.1016/j.cels.2019.04.004

34. Hypoxia tolerance in the Norrin-deficient retina and the chronically hypoxic brain studied at single-cell resolution. (2019)

Heng JS, Rattner A, Stein-O'Brien GL, Winer BL, Jones BW, Vernon HJ, Goff LA, Nathans J

Proc Natl Acad Sci U S A. 2019 Apr 30;116(18):9103-9114. doi: 10.1073/pnas.1821122116. Epub 2019 Apr 15. PMID: 30988181

The mammalian CNS is capable of tolerating chronic hypoxia, but cell type-specific responses to this stress have not been systematically characterized. In the Norrin KO (Ndp(KO) ) mouse, a model of familial exudative vitreoretinopathy (FEVR), developmental hypovascularization of the retina produces chronic hypoxia of inner nuclear-layer (INL) neurons and Muller glia. We used single-cell RNA sequencing, untargeted metabolomics, and metabolite labeling from (13)C-glucose to compare WT and Ndp(KO) retinas. In Ndp(KO) retinas, we observe gene expression responses consistent with hypoxia in Muller glia and retinal neurons, and we find a metabolic shift that combines reduced flux through the TCA cycle with increased synthesis of serine, glycine, and glutathione. We also used single-cell RNA sequencing to compare the responses of individual cell types in Ndp(KO) retinas with those in the hypoxic cerebral cortex of mice that were housed for 1 week in a reduced oxygen environment (7.5% oxygen). In the hypoxic cerebral cortex, glial transcriptome responses most closely resemble the response of Muller glia in the Ndp(KO) retina. In both retina and brain, vascular endothelial cells activate a previously dormant tip cell gene expression program, which likely underlies the adaptive neoangiogenic response to chronic hypoxia. These analyses of retina and brain transcriptomes at single-cell resolution reveal both shared and cell type-specific changes in gene expression in response to chronic hypoxia, implying both shared and distinct cell type-specific physiologic responses.

10.1073/pnas.1821122116

35. Linear models enable powerful differential activity analysis in massively parallel reporter assays. (2019)

Myint L, Avramopoulos DG, Goff LA, Hansen KD

BMC Genomics. 2019 Mar 12;20(1):209. doi: 10.1186/s12864-019-5556-x. PMID: 30866806

BACKGROUND: Massively parallel reporter assays (MPRAs) have emerged as a popular means for understanding noncoding variation in a variety of conditions. While a large number of experiments have been described in the literature, analysis typically uses ad-hoc methods. There has been little attention to comparing performance of methods across datasets. RESULTS: We present the mpralm method which we show is calibrated and powerful, by analyzing its performance on multiple MPRA datasets. We show that it outperforms existing statistical methods for analysis of this data type, in the first comprehensive evaluation of statistical methods on several datasets. We investigate theoretical and real-data properties of barcode summarization methods and show an unappreciated impact of summarization method for some datasets. Finally, we use our model to conduct a power analysis for this assay and show substantial improvements in power by performing up to 6 replicates per condition, whereas sequencing depth has smaller impact; we recommend to always use at least 4 replicates. An R package is available from the Bioconductor project. CONCLUSIONS: Together, these results inform recommendations for differential analysis, general group comparisons, and power analysis and will help improve design and analysis of MPRA experiments.

10.1186/s12864-019-5556-x

36. Transcriptional and epigenomic landscapes of CNS and non-CNS vascular endothelial cells. (2018)

Sabbagh MF, Heng JS, Luo C, Castanon RG, Nery JR, Rattner A, Goff LA, Ecker JR, Nathans J

Elife. 2018 Sep 6;7:e36187. doi: 10.7554/eLife.36187. PMID: 30188322

Vascular endothelial cell (EC) function depends on appropriate organ-specific molecular and cellular specializations. To explore genomic mechanisms that control this specialization, we have analyzed and compared the transcriptome, accessible chromatin, and DNA methylome landscapes from mouse brain, liver, lung, and kidney ECs. Analysis of transcription factor (TF) gene expression and TF motifs at candidate cis-regulatory elements reveals both shared and organ-specific EC regulatory networks. In the embryo, only those ECs that are adjacent to or within the central nervous system (CNS) exhibit canonical Wnt signaling, which correlates precisely with blood-brain barrier (BBB) differentiation and Zic3 expression. In the early postnatal brain, single-cell RNA-seq of purified ECs reveals (1) close relationships between veins and mitotic cells and between arteries and tip cells, (2) a division of capillary ECs into vein-like and artery-like classes, and (3) new endothelial subtype markers, including new validated tip cell markers.

10.7554/eLife.36187

37. Enter the Matrix: Factorization Uncovers Knowledge from Omics. (2018)

Stein-O'Brien GL, Arora R, Culhane AC, Favorov AV, Garmire LX, Greene CS, Goff LA, Li Y, Ngom A, Ochs MF, Xu Y, Fertig EJ

Trends Genet. 2018 Oct;34(10):790-805. doi: 10.1016/j.tig.2018.07.003. Epub 2018 Aug 22. PMID: 30143323

Omics data contain signals from the molecular, physical, and kinetic inter- and intracellular interactions that control biological systems. Matrix factorization (MF) techniques can reveal low-dimensional structure from high-dimensional data that reflect these interactions. These techniques can uncover new biological knowledge from diverse high-throughput omics data in applications ranging from pathway discovery to timecourse analysis. We review exemplary applications of MF for systems-level analyses. We discuss appropriate applications of these methods, their limitations, and focus on the analysis of results to facilitate optimal biological interpretation. The inference of biologically relevant features with MF enables discovery from high-throughput data beyond the limits of current biological knowledge - answering questions from high-dimensional data that we have not yet thought to ask.

10.1016/j.tig.2018.07.003

38. Single-Cell RNA-Seq of Mouse Dopaminergic Neurons Informs Candidate Gene Selection for Sporadic Parkinson Disease. (2018)

Hook PW, McClymont SA, Cannon GH, Law WD, Morton AJ, Goff LA, McCallion AS

Am J Hum Genet. 2018 Mar 1;102(3):427-446. doi: 10.1016/j.ajhg.2018.02.001. PMID: 29499164

Genetic variation modulating risk of sporadic Parkinson disease (PD) has been primarily explored through genome-wide association studies (GWASs). However, like many other common genetic diseases, the impacted genes remain largely unknown. Here, we used single-cell RNA-seq to characterize dopaminergic (DA) neuron populations in the mouse brain at embryonic and early postnatal time points. These data facilitated unbiased identification of DA neuron subpopulations through their unique transcriptional profiles, including a postnatal neuroblast population and substantia nigra (SN) DA neurons. We use these population-specific data to develop a scoring system to prioritize candidate genes in all 49 GWAS intervals implicated in PD risk, including genes with known PD associations and many with extensive supporting literature. As proof of principle, we confirm that the nigrostriatal pathway is compromised in Cplx1-null mice. Ultimately, this systematic approach establishes biologically pertinent candidates and testable hypotheses for sporadic PD, informing a new era of PD genetic research.

10.1016/j.ajhg.2018.02.001

39. Variation in Activity State, Axonal Projection, and Position Define the Transcriptional Identity of Individual Neocortical Projection Neurons. (2018)

Chevee M, Robertson JJ, Cannon GH, Brown SP, Goff LA,

Cell Rep. 2018 Jan 9;22(2):441-455. doi: 10.1016/j.celrep.2017.12.046. PMID: 29320739

Single-cell RNA sequencing has generated catalogs of transcriptionally defined neuronal subtypes of the brain. However, the cellular processes that contribute to neuronal subtype specification and transcriptional heterogeneity remain unclear. By comparing the gene expression profiles of single layer 6 corticothalamic neurons in somatosensory cortex, we show that transcriptional subtypes primarily reflect axonal projection pattern, laminar position within the cortex, and neuronal activity state. Pseudotemporal ordering of 1,023 cellular responses to sensory manipulation demonstrates that changes in expression of activity-induced genes both reinforced cell-type identity and contributed to increased transcriptional heterogeneity within each cell type. This is due to cell-type biased choices of transcriptional states following manipulation of neuronal activity. These results reveal that axonal projection pattern, laminar position, and activity state define significant axes of variation that contribute both to the transcriptional identity of individual neurons and to the transcriptional heterogeneity within each neuronal subtype.

10.1016/j.celrep.2017.12.046

40. Group 1 Innate Lymphoid Cell Lineage Identity Is Determined by a cis-Regulatory Element Marked by a Long Non-coding RNA. (2017)

Mowel WK, McCright SJ, Kotzin JJ, Collet MA, Uyar A, Chen X, DeLaney A, Spencer SP, Virtue AT, Yang E, Villarino A, Kurachi M, Dunagin MC, Pritchard GH, Stein J, Hughes C, Fonseca-Pereira D, Veiga-Fernandes H, Raj A, Kambayashi T, Brodsky IE, O'Shea JJ, Wherry EJ, Goff LA, Rinn JL, Williams A, Flavell RA, Henao-Mejia J

Immunity. 2017 Sep 19;47(3):435-449.e8. doi: 10.1016/j.immuni.2017.08.012. PMID: 28930659

Commitment to the innate lymphoid cell (ILC) lineage is determined by Id2, a transcriptional regulator that antagonizes T and B cell-specific gene expression programs. Yet how Id2 expression is regulated in each ILC subset remains poorly understood. We identified a cis-regulatory element demarcated by a long non-coding RNA (lncRNA) that controls the function and lineage identity of group 1 ILCs, while being dispensable for early ILC development and homeostasis of ILC2s and ILC3s. The locus encoding this lncRNA, which we termed Rroid, directly interacted with the promoter of its neighboring gene, Id2, in group 1 ILCs. Moreover, the Rroid locus, but not the lncRNA itself, controlled the identity and function of ILC1s by promoting chromatin accessibility and deposition of STAT5 at the promoter of Id2 in response to interleukin (IL)-15. Thus, non-coding elements responsive to extracellular cues unique to each ILC subset represent a key regulatory layer for controlling the identity and function of ILCs.

10.1016/j.immuni.2017.08.012

41. Changes in the Excitability of Neocortical Neurons in a Mouse Model of Amyotrophic Lateral Sclerosis Are Not Specific to Corticospinal Neurons and Are Modulated by Advancing Disease. (2017)

Kim J, Hughes EG, Shetty AS, Arlotta P, Goff LA, Bergles DE, Brown SP

J Neurosci. 2017 Sep 13;37(37):9037-9053. doi: 10.1523/JNEUROSCI.0811-17.2017. Epub 2017 Aug 17. PMID: 28821643

Cell type-specific changes in neuronal excitability have been proposed to contribute to the selective degeneration of corticospinal neurons in amyotrophic lateral sclerosis (ALS) and to neocortical hyperexcitability, a prominent feature of both inherited and sporadic variants of the disease, but the mechanisms underlying selective loss of specific cell types in ALS are not known. We analyzed the physiological properties of distinct classes of cortical neurons in the motor cortex of hSOD1(G93A) mice of both sexes and found that they all exhibit increases in intrinsic excitability that depend on disease stage. Targeted recordings and in vivo calcium imaging further revealed that neurons adapt their functional properties to normalize cortical excitability as the disease progresses. Although different neuron classes all exhibited increases in intrinsic excitability, transcriptional profiling indicated that the molecular mechanisms underlying these changes are cell type specific. The increases in excitability in both excitatory and inhibitory cortical neurons show that selective dysfunction of neuronal cell types cannot account for the specific vulnerability of corticospinal motor neurons in ALS. Furthermore, the stage-dependent alterations in neuronal function highlight the ability of cortical circuits to adapt as disease progresses. These findings show that both disease stage and cell type must be considered when developing therapeutic strategies for treating ALS.SIGNIFICANCE STATEMENT It is not known why certain classes of neurons preferentially die in different neurodegenerative diseases. It has been proposed that the enhanced excitability of affected neurons is a major contributor to their selective loss. We show using a mouse model of amyotrophic lateral sclerosis (ALS), a disease in which corticospinal neurons exhibit selective vulnerability, that changes in excitability are not restricted to this neuronal class and that excitability does not increase monotonically with disease progression. Moreover, although all neuronal cell types tested exhibited abnormal functional properties, analysis of their gene expression demonstrated cell type-specific responses to the ALS-causing mutation. These findings suggest that therapies for ALS may need to be tailored for different cell types and stages of disease.

10.1523/JNEUROSCI.0811-17.2017

42. A ketogenic diet rescues hippocampal memory defects in a mouse model of Kabuki syndrome. (2017)

Benjamin JS, Pilarowski GO, Carosso GA, Zhang L, Huso DL, Goff LA, Vernon HJ, Hansen KD, Bjornsson HT

Proc Natl Acad Sci U S A. 2017 Jan 3;114(1):125-130. doi: 10.1073/pnas.1611431114. Epub 2016 Dec 20. PMID: 27999180

Kabuki syndrome is a Mendelian intellectual disability syndrome caused by mutations in either of two genes (KMT2D and KDM6A) involved in chromatin accessibility. We previously showed that an agent that promotes chromatin opening, the histone deacetylase inhibitor (HDACi) AR-42, ameliorates the deficiency of adult neurogenesis in the granule cell layer of the dentate gyrus and rescues hippocampal memory defects in a mouse model of Kabuki syndrome (Kmt2d(+/betaGeo)). Unlike a drug, a dietary intervention could be quickly transitioned to the clinic. Therefore, we have explored whether treatment with a ketogenic diet could lead to a similar rescue through increased amounts of beta-hydroxybutyrate, an endogenous HDACi. Here, we report that a ketogenic diet in Kmt2d(+/betaGeo) mice modulates H3ac and H3K4me3 in the granule cell layer, with concomitant rescue of both the neurogenesis defect and hippocampal memory abnormalities seen in Kmt2d(+/betaGeo) mice; similar effects on neurogenesis were observed on exogenous administration of beta-hydroxybutyrate. These data suggest that dietary modulation of epigenetic modifications through elevation of beta-hydroxybutyrate may provide a feasible strategy to treat the intellectual disability seen in Kabuki syndrome and related disorders.

10.1073/pnas.1611431114

43. The DPYSL2 gene connects mTOR and schizophrenia. (2016)

Pham X, Song G, Lao S, Goff LA, Zhu H, Valle D, Avramopoulos D

Transl Psychiatry. 2016 Nov 1;6(11):e933. doi: 10.1038/tp.2016.204. PMID: 27801893

We previously reported a schizophrenia-associated polymorphic CT di-nucleotide repeat (DNR) at the 5'-untranslated repeat (UTR) of DPYSL2, which responds to mammalian target of Rapamycin (mTOR) signaling with allelic differences in reporter assays. Now using microarray analysis, we show that the DNR alleles interact differentially with specific proteins, including the mTOR-related protein HuD/ELAVL4. We confirm the differential binding to HuD and other known mTOR effectors by electrophoretic mobility shift assays. We edit HEK293 cells by CRISPR/Cas9 to carry the schizophrenia risk variant (13DNR) and observe a significant reduction of the corresponding CRMP2 isoform. These edited cells confirm the response to mTOR inhibitors and show a twofold shortening of the cellular projections. Transcriptome analysis of these modified cells by RNA-seq shows changes in 12.7% of expressed transcripts at a false discovery rate of 0.05. These transcripts are enriched in immunity-related genes, overlap significantly with those modified by the schizophrenia-associated gene, ZNF804A, and have a reverse expression signature from that seen with antipsychotic drugs. Our results support the functional importance of the DPYSL2 DNR and a role for mTOR signaling in schizophrenia.

10.1038/tp.2016.204

44. The long non-coding RNA Morrbid regulates Bim and short-lived myeloid cell lifespan. (2016)

Kotzin JJ, Spencer SP, McCright SJ, Kumar DBU, Collet MA, Mowel WK, Elliott EN, Uyar A, Makiya MA, Dunagin MC, Harman CCD, Virtue AT, Zhu S, Bailis W, Stein J, Hughes C, Raj A, Wherry EJ, Goff LA, Klion AD, Rinn JL, Williams A, Flavell RA, Henao-Mejia J

Nature. 2016 Sep 8;537(7619):239-243. doi: 10.1038/nature19346. Epub 2016 Aug 15. PMID: 27525555

Neutrophils, eosinophils and 'classical' monocytes collectively account for about 70% of human blood leukocytes and are among the shortest-lived cells in the body. Precise regulation of the lifespan of these myeloid cells is critical to maintain protective immune responses and minimize the deleterious consequences of prolonged inflammation. However, how the lifespan of these cells is strictly controlled remains largely unknown. Here we identify a long non-coding RNA that we termed Morrbid, which tightly controls the survival of neutrophils, eosinophils and classical monocytes in response to pro-survival cytokines in mice. To control the lifespan of these cells, Morrbid regulates the transcription of the neighbouring pro-apoptotic gene, Bcl2l11 (also known as Bim), by promoting the enrichment of the PRC2 complex at the Bcl2l11 promoter to maintain this gene in a poised state. Notably, Morrbid regulates this process in cis, enabling allele-specific control of Bcl2l11 transcription. Thus, in these highly inflammatory cells, changes in Morrbid levels provide a locus-specific regulatory mechanism that allows rapid control of apoptosis in response to extracellular pro-survival signals. As MORRBID is present in humans and dysregulated in individuals with hypereosinophilic syndrome, this long non-coding RNA may represent a potential therapeutic target for inflammatory disorders characterized by aberrant short-lived myeloid cell lifespan.

10.1038/nature19346

45. Investigating long noncoding RNAs using animal models. (2016)

Feyder M, Goff LA,

J Clin Invest. 2016 Aug 1;126(8):2783-91. doi: 10.1172/JCI84422. Epub 2016 Aug 1. PMID: 27479747

The number of long noncoding RNAs (lncRNAs) has grown rapidly; however, our understanding of their function remains limited. Although cultured cells have facilitated investigations of lncRNA function at the molecular level, the use of animal models provides a rich context in which to investigate the phenotypic impact of these molecules. Promising initial studies using animal models demonstrated that lncRNAs influence a diverse number of phenotypes, ranging from subtle dysmorphia to viability. Here, we highlight the diversity of animal models and their unique advantages, discuss the use of animal models to profile lncRNA expression, evaluate experimental strategies to manipulate lncRNA function in vivo, and review the phenotypes attributable to lncRNAs. Despite a limited number of studies leveraging animal models, lncRNAs are already recognized as a notable class of molecules with important implications for health and disease.

10.1172/JCI84422

46. Long noncoding RNAs: Central to nervous system development. (2016)

Hart RP, Goff LA,

Int J Dev Neurosci. 2016 Dec;55:109-116. doi: 10.1016/j.ijdevneu.2016.06.001. Epub 2016 Jun 11. PMID: 27296516

The development of the central nervous system (CNS) is a complex orchestration of stem cells, transcription factors, growth/differentiation factors, and epigenetic control. Noncoding RNAs have been identified, classified, and studied for their functional roles in many systems including the CNS. In particular, the class of long noncoding RNAs (lncRNAs) has generated both enthusiasm and skepticism due to the unexpected discovery, the diversity of mechanisms, and the lower level of expression than found in protein-coding RNAs. Here we describe evidence supporting the role of lncRNAs in driving CNS-specific differentiation. It is clear that lncRNAs exhibit a functional diversity that makes their study and compartmentalization more challenging than other classes of noncoding RNAs. We predict, however, that lncRNAs will be essential for the characterization of discrete neuronal cell types in the age of single-cell transcriptomics and that these regulatory RNAs contribute to the multitude of functional mechanisms during CNS differentiation that will rival the diversities of protein-based mechanisms.

10.1016/j.ijdevneu.2016.06.001

47. Creation and characterization of an airway epithelial cell line for stable expression of CFTR variants. (2016)

Gottschalk LB, Vecchio-Pagan B, Sharma N, Han ST, Franca A, Wohler ES, Batista DA, Goff LA, Cutting GR

J Cyst Fibros. 2016 May;15(3):285-94. doi: 10.1016/j.jcf.2015.11.010. Epub 2015 Dec 13. PMID: 26694805

BACKGROUND: Analysis of the functional consequences and treatment response of rare CFTR variants is challenging due to the limited availability of primary airways cells. METHODS: A Flp recombination target (FRT) site for stable expression of CFTR was incorporated into an immortalized CF bronchial epithelial cell line (CFBE41o-). CFTR cDNA was integrated into the FRT site. Expression was evaluated by western blotting and confocal microscopy and function measured by short circuit current. RNA sequencing was used to compare the transcriptional profile of the resulting CF8Flp cell line to primary cells and tissues. RESULTS: Functional CFTR was expressed from integrated cDNA at the FRT site of the CF8Flp cell line at levels comparable to that seen in native airway cells. CF8Flp cells expressing WT-CFTR have a stable transcriptome comparable to that of primary cultured airway epithelial cells, including genes that play key roles in CFTR pathways. CONCLUSION: CF8Flp cells provide a viable substitute for primary CF airway cells for the analysis of CFTR variants in a native context.

10.1016/j.jcf.2015.11.010

48. Linking RNA biology to lncRNAs. (2015)

Goff LA, Rinn JL

Genome Res. 2015 Oct;25(10):1456-65. doi: 10.1101/gr.191122.115. PMID: 26430155

The regulatory potential of RNA has never ceased to amaze: from RNA catalysis, to RNA-mediated splicing, to RNA-based silencing of an entire chromosome during dosage compensation. More recently, thousands of long noncoding RNA (lncRNA) transcripts have been identified, the majority with unknown function. Thus, it is tempting to think that these lncRNAs represent a cadre of new factors that function through ribonucleic mechanisms. Some evidence points to several lncRNAs with tantalizing physiological contributions and thought-provoking molecular modalities. However, dissecting the RNA biology of lncRNAs has been difficult, and distinguishing the independent contributions of functional RNAs from underlying DNA elements, or the local act of transcription, is challenging. Here, we aim to survey the existing literature and highlight future approaches that will be needed to link the RNA-based biology and mechanisms of lncRNAs in vitro and in vivo.

10.1101/gr.191122.115

49. Spatiotemporal expression and transcriptional perturbations by long noncoding RNAs in the mouse brain. (2015)

Goff LA, Groff AF, Sauvageau M, Trayes-Gibson Z, Sanchez-Gomez DB, Morse M, Martin RD, Elcavage LE, Liapis SC, Gonzalez-Celeiro M, Plana O, Li E, Gerhardinger C, Tomassy GS, Arlotta P, Rinn JL

Proc Natl Acad Sci U S A. 2015 Jun 2;112(22):6855-62. doi: 10.1073/pnas.1411263112. PMID: 26034286

Long noncoding RNAs (lncRNAs) have been implicated in numerous cellular processes including brain development. However, the in vivo expression dynamics and molecular pathways regulated by these loci are not well understood. Here, we leveraged a cohort of 13 lncRNAnull mutant mouse models to investigate the spatiotemporal expression of lncRNAs in the developing and adult brain and the transcriptome alterations resulting from the loss of these lncRNA loci. We show that several lncRNAs are differentially expressed both in time and space, with some presenting highly restricted expression in only selected brain regions. We further demonstrate altered regulation of genes for a large variety of cellular pathways and processes upon deletion of the lncRNA loci. Finally, we found that 4 of the 13 lncRNAs significantly affect the expression of several neighboring proteincoding genes in a cis-like manner. By providing insight into the endogenous expression patterns and the transcriptional perturbations caused by deletion of the lncRNA locus in the developing and postnatal mammalian brain, these data provide a resource to facilitate future examination of the specific functional relevance of these genes in neural development, brain function, and disease.

Article

50. DeCoN: genome-wide analysis of in vivo transcriptional dynamics during pyramidal neuron fate selection in neocortex. (2015)

Molyneaux BJ, Goff LA, Brettler AC, Chen HH, Hrvatin S, Rinn JL, Arlotta P

Neuron. 2015 Jan 21;85(2):275-288. doi: 10.1016/j.neuron.2014.12.024. Epub 2014 Dec 31. PMID: 25556833

Neuronal development requires a complex choreography of transcriptional decisions to obtain specific cellular identities. Realizing the ultimate goal of identifying genome-wide signatures that define and drive specific neuronal fates has been hampered by enormous complexity in both time and space during development. Here, we have paired high-throughput purification of pyramidal neuron subclasses with deep profiling of spatiotemporal transcriptional dynamics during corticogenesis to resolve lineage choice decisions. We identified numerous features ranging from spatial and temporal usage of alternative mRNA isoforms and promoters to a host of mRNA genes modulated during fate specification. Notably, we uncovered numerous long noncoding RNAs with restricted temporal and cell-type-specific expression. To facilitate future exploration, we provide an interactive online database to enable multidimensional data mining and dissemination. This multifaceted study generates a powerful resource and informs understanding of the transcriptional regulation underlying pyramidal neuron diversity in the neocortex.

10.1016/j.neuron.2014.12.024

51. Gene co-regulation by Fezf2 selects neurotransmitter identity and connectivity of corticospinal neurons. (2014)

Lodato S, Molyneaux BJ, Zuccaro E, Goff LA, Chen HH, Yuan W, Meleski A, Takahashi E, Mahony S, Rinn JL, Gifford DK, Arlotta P

Nat Neurosci. 2014 Aug;17(8):1046-54. doi: 10.1038/nn.3757. Epub 2014 Jul 6. PMID: 24997765

The neocortex contains an unparalleled diversity of neuronal subtypes, each defined by distinct traits that are developmentally acquired under the control of subtype-specific and pan-neuronal genes. The regulatory logic that orchestrates the expression of these unique combinations of genes is unknown for any class of cortical neuron. Here, we report that Fezf2 is a selector gene able to regulate the expression of gene sets that collectively define mouse corticospinal motor neurons (CSMN). We find that Fezf2 directly induces the glutamatergic identity of CSMN via activation of Vglut1 (Slc17a7) and inhibits a GABAergic fate by repressing transcription of Gad1. In addition, we identify the axon guidance receptor EphB1 as a target of Fezf2 necessary to execute the ipsilateral extension of the corticospinal tract. Our data indicate that co-regulated expression of neuron subtype-specific and pan-neuronal gene batteries by a single transcription factor is one component of the regulatory logic responsible for the establishment of CSMN identity.

10.1038/nn.3757

52. A positive feedback mechanism that regulates expression of miR-9 during neurogenesis. (2014)

Davila JL, Goff LA, Ricupero CL, Camarillo C, Oni EN, Swerdel MR, Toro-Ramos AJ, Li J, Hart RP

PLoS One. 2014 Apr 8;9(4):e94348. doi: 10.1371/journal.pone.0094348. eCollection 2014. PMID: 24714615

MiR-9, a neuron-specific miRNA, is an important regulator of neurogenesis. In this study we identify how miR-9 is regulated during early differentiation from a neural stem-like cell. We utilized two immortalized rat precursor clones, one committed to neurogenesis (L2.2) and another capable of producing both neurons and non-neuronal cells (L2.3), to reproducibly study early neurogenesis. Exogenous miR-9 is capable of increasing neurogenesis from L2.3 cells. Only one of three genomic loci capable of encoding miR-9 was regulated during neurogenesis and the promoter region of this locus contains sufficient functional elements to drive expression of a luciferase reporter in a developmentally regulated pattern. Furthermore, among a large number of potential regulatory sites encoded in this sequence, Mef2 stood out because of its known pro-neuronal role. Of four Mef2 paralogs, we found only Mef2C mRNA was regulated during neurogenesis. Removal of predicted Mef2 binding sites or knockdown of Mef2C expression reduced miR-9-2 promoter activity. Finally, the mRNA encoding the Mef2C binding partner HDAC4 was shown to be targeted by miR-9. Since HDAC4 protein could be co-immunoprecipitated with Mef2C protein or with genomic Mef2 binding sequences, we conclude that miR-9 regulation is mediated, at least in part, by Mef2C binding but that expressed miR-9 has the capacity to reduce inhibitory HDAC4, stabilizing its own expression in a positive feedback mechanism.

10.1371/journal.pone.0094348

53. Topological organization of multichromosomal regions by the long intergenic noncoding RNA Firre. (2014)

Hacisuleyman E, Goff LA, Trapnell C, Williams A, Henao-Mejia J, Sun L, McClanahan P, Hendrickson DG, Sauvageau M, Kelley DR, Morse M, Engreitz J, Lander ES, Guttman M, Lodish HF, Flavell R, Raj A, Rinn JL

Nat Struct Mol Biol. 2014 Feb;21(2):198-206. doi: 10.1038/nsmb.2764. Epub 2014 Jan 26. PMID: 24463464

RNA, including long noncoding RNA (lncRNA), is known to be an abundant and important structural component of the nuclear matrix. However, the molecular identities, functional roles and localization dynamics of lncRNAs that influence nuclear architecture remain poorly understood. Here, we describe one lncRNA, Firre, that interacts with the nuclear-matrix factor hnRNPU through a 156-bp repeating sequence and localizes across an ~5-Mb domain on the X chromosome. We further observed Firre localization across five distinct trans-chromosomal loci, which reside in spatial proximity to the Firre genomic locus on the X chromosome. Both genetic deletion of the Firre locus and knockdown of hnRNPU resulted in loss of colocalization of these trans-chromosomal interacting loci. Thus, our data suggest a model in which lncRNAs such as Firre can interface with and modulate nuclear architecture across chromosomes.

10.1038/nsmb.2764

54. RNase-mediated protein footprint sequencing reveals protein-binding sites throughout the human transcriptome. (2014)

Silverman IM, Li F, Alexander A, Goff LA, Trapnell C, Rinn JL, Gregory BD

Genome Biol. 2014 Jan 7;15(1):R3. doi: 10.1186/gb-2014-15-1-r3. PMID: 24393486

Although numerous approaches have been developed to map RNA-binding sites of individual RNA-binding proteins (RBPs), few methods exist that allow assessment of global RBP-RNA interactions. Here, we describe PIP-seq, a universal, high-throughput, ribonuclease-mediated protein footprint sequencing approach that reveals RNA-protein interaction sites throughout a transcriptome of interest. We apply PIP-seq to the HeLa transcriptome and compare binding sites found using different cross-linkers and ribonucleases. From this analysis, we identify numerous putative RBP-binding motifs, reveal novel insights into co-binding by RBPs, and uncover a significant enrichment for disease-associated polymorphisms within RBP interaction sites.

10.1186/gb-2014-15-1-r3

55. Multiple knockout mouse models reveal lincRNAs are required for life and brain development. (2013)

Sauvageau M, Goff LA, Lodato S, Bonev B, Groff AF, Gerhardinger C, Sanchez-Gomez DB, Hacisuleyman E, Li E, Spence M, Liapis SC, Mallard W, Morse M, Swerdel MR, D'Ecclessis MF, Moore JC, Lai V, Gong G, Yancopoulos GD, Frendewey D, Kellis M, Hart RP, Valenzuela DM, Arlotta P, Rinn JL

Elife. 2013 Dec 31;2:e01749. doi: 10.7554/eLife.01749. PMID: 24381249

Many studies are uncovering functional roles for long noncoding RNAs (lncRNAs), yet few have been tested for in vivo relevance through genetic ablation in animal models. To investigate the functional relevance of lncRNAs in various physiological conditions, we have developed a collection of 18 lncRNA knockout strains in which the locus is maintained transcriptionally active. Initial characterization revealed peri- and postnatal lethal phenotypes in three mutant strains (Fendrr, Peril, and Mdgt), the latter two exhibiting incomplete penetrance and growth defects in survivors. We also report growth defects for two additional mutant strains (linc-Brn1b and linc-Pint). Further analysis revealed defects in lung, gastrointestinal tract, and heart in Fendrr(-/-) neonates, whereas linc-Brn1b(-/-) mutants displayed distinct abnormalities in the generation of upper layer II-IV neurons in the neocortex. This study demonstrates that lncRNAs play critical roles in vivo and provides a framework and impetus for future larger-scale functional investigation into the roles of lncRNA molecules. DOI: http://dx.doi.org/10.7554/eLife.01749.001.

10.7554/eLife.01749

56. Poly-combing the genome for RNA. (2013)

Goff LA, Rinn JL

Nat Struct Mol Biol. 2013 Dec;20(12):1344-6. doi: 10.1038/nsmb.2728. PMID: 24304912

An unresolved question in mammalian epigenetic regulation is how ubiquitously expressed chromatin-modifying complexes such as Polycomb group complex 2 (PRC2) find their specific target sites across an intricate choreography of localization events in time and space. Two recent studies now provide critical new insights into an intriguing genome-wide role for RNA in PRC2 regulation.

10.1038/nsmb.2728

57. DNMT1-interacting RNAs block gene-specific DNA methylation. (2013)

Di Ruscio A, Ebralidze AK, Benoukraf T, Amabile G, Goff LA, Terragni J, Figueroa ME, De Figueiredo Pontes LL, Alberich-Jorda M, Zhang P, Wu M, D'Alo F, Melnick A, Leone G, Ebralidze KK, Pradhan S, Rinn JL, Tenen DG

Nature. 2013 Nov 21;503(7476):371-6. doi: 10.1038/nature12598. Epub 2013 Oct 9. PMID: 24107992

DNA methylation was first described almost a century ago; however, the rules governing its establishment and maintenance remain elusive. Here we present data demonstrating that active transcription regulates levels of genomic methylation. We identify a novel RNA arising from the CEBPA gene locus that is critical in regulating the local DNA methylation profile. This RNA binds to DNMT1 and prevents CEBPA gene locus methylation. Deep sequencing of transcripts associated with DNMT1 combined with genome-scale methylation and expression profiling extend the generality of this finding to numerous gene loci. Collectively, these results delineate the nature of DNMT1-RNA interactions and suggest strategies for gene-selective demethylation of therapeutic targets in human diseases.

10.1038/nature12598

58. The microRNA miR-181 is a critical cellular metabolic rheostat essential for NKT cell ontogenesis and lymphocyte development and homeostasis. (2013)

Henao-Mejia J, Williams A, Goff LA, Staron M, Licona-Limon P, Kaech SM, Nakayama M, Rinn JL, Flavell RA

Immunity. 2013 May 23;38(5):984-97. doi: 10.1016/j.immuni.2013.02.021. Epub 2013 Apr 25. PMID: 23623381

Regulation of metabolic pathways in the immune system provides a mechanism to actively control cellular function, growth, proliferation, and survival. Here, we report that miR-181 is a nonredundant determinant of cellular metabolism and is essential for supporting the biosynthetic demands of early NKT cell development. As a result, miR-181-deficient mice showed a complete absence of mature NKT cells in the thymus and periphery. Mechanistically, miR-181 modulated expression of the phosphatase PTEN to control PI3K signaling, which was a primary stimulus for anabolic metabolism in immune cells. Thus miR-181-deficient mice also showed severe defects in lymphoid development and T cell homeostasis associated with impaired PI3K signaling. These results uncover miR-181 as essential for NKT cell development and establish this family of miRNAs as central regulators of PI3K signaling and global metabolic fitness during development and homeostasis.

10.1016/j.immuni.2013.02.021

59. Long noncoding RNAs regulate adipogenesis. (2013)

Sun L, Goff LA, Trapnell C, Alexander R, Lo KA, Hacisuleyman E, Sauvageau M, Tazon-Vega B, Kelley DR, Hendrickson DG, Yuan B, Kellis M, Lodish HF, Rinn JL

Proc Natl Acad Sci U S A. 2013 Feb 26;110(9):3387-92. doi: 10.1073/pnas.1222643110. Epub 2013 Feb 11. PMID: 23401553

The prevalence of obesity has led to a surge of interest in understanding the detailed mechanisms underlying adipocyte development. Many protein-coding genes, mRNAs, and microRNAs have been implicated in adipocyte development, but the global expression patterns and functional contributions of long noncoding RNA (lncRNA) during adipogenesis have not been explored. Here we profiled the transcriptome of primary brown and white adipocytes, preadipocytes, and cultured adipocytes and identified 175 lncRNAs that are specifically regulated during adipogenesis. Many lncRNAs are adipose-enriched, strongly induced during adipogenesis, and bound at their promoters by key transcription factors such as peroxisome proliferator-activated receptor gamma (PPARgamma) and CCAAT/enhancer-binding protein alpha (CEBPalpha). RNAi-mediated loss of function screens identified functional lncRNAs with varying impact on adipogenesis. Collectively, we have identified numerous lncRNAs that are functionally required for proper adipogenesis.

10.1073/pnas.1222643110

60. Differential analysis of gene regulation at transcript resolution with RNA-seq. (2013)

Trapnell C, Hendrickson DG, Sauvageau M, Goff LA, Rinn JL, Pachter L

Nat Biotechnol. 2013 Jan;31(1):46-53. doi: 10.1038/nbt.2450. Epub 2012 Dec 9. PMID: 23222703

Differential analysis of gene and transcript expression using high-throughput RNA sequencing (RNA-seq) is complicated by several sources of measurement variability and poses numerous statistical challenges. We present Cuffdiff 2, an algorithm that estimates expression at transcript-level resolution and controls for variability evident across replicate libraries. Cuffdiff 2 robustly identifies differentially expressed transcripts and genes and reveals differential splicing and promoter-preference changes. We demonstrate the accuracy of our approach through differential analysis of lung fibroblasts in response to loss of the developmental transcription factor HOXA1, which we show is required for lung fibroblast and HeLa cell cycle progression. Loss of HOXA1 results in significant expression level changes in thousands of individual transcripts, along with isoform switching events in key regulators of the cell cycle. Cuffdiff 2 performs robust differential analysis in RNA-seq experiments at transcript resolution, revealing a layer of regulation not readily observable with other high-throughput technologies.

10.1038/nbt.2450

61. Computational analysis of noncoding RNAs. (2012)

Washietl S, Will S, Hendrix DA, Goff LA, Rinn JL, Berger B, Kellis M

Wiley Interdiscip Rev RNA. 2012 Nov-Dec;3(6):759-78. doi: 10.1002/wrna.1134. Epub 2012 Sep 18. PMID: 22991327

Noncoding RNAs have emerged as important key players in the cell. Understanding their surprisingly diverse range of functions is challenging for experimental and computational biology. Here, we review computational methods to analyze noncoding RNAs. The topics covered include basic and advanced techniques to predict RNA structures, annotation of noncoding RNAs in genomic data, mining RNA-seq data for novel transcripts and prediction of transcript structures, computational aspects of microRNAs, and database resources.

10.1002/wrna.1134

62. Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. (2012)

Trapnell C, Roberts A, Goff LA, Pertea G, Kim D, Kelley DR, Pimentel H, Salzberg SL, Rinn JL, Pachter L

Nat Protoc. 2012 Mar 1;7(3):562-78. doi: 10.1038/nprot.2012.016. PMID: 22383036

Recent advances in high-throughput cDNA sequencing (RNA-seq) can reveal new genes and splice variants and quantify expression genome-wide in a single assay. The volume and complexity of data from RNA-seq experiments necessitate scalable, fast and mathematically principled analysis software. TopHat and Cufflinks are free, open-source software tools for gene discovery and comprehensive expression analysis of high-throughput mRNA sequencing (RNA-seq) data. Together, they allow biologists to identify new genes and new splice variants of known ones, as well as compare gene and transcript expression under two or more conditions. This protocol describes in detail how to use TopHat and Cufflinks to perform such analyses. It also covers several accessory tools and utilities that aid in managing data, including CummeRbund, a tool for visualizing RNA-seq analysis results. Although the procedure assumes basic informatics skills, these tools assume little to no background with RNA-seq analysis and are meant for novices and experts alike. The protocol begins with raw sequencing reads and produces a transcriptome assembly, lists of differentially expressed and regulated genes and transcripts, and publication-quality visualizations of analysis results. The protocol's execution time depends on the volume of transcriptome sequencing data and available computing resources but takes less than 1 d of computer time for typical experiments and approximately 1 h of hands-on time.

10.1038/nprot.2012.016

63. Expression profiling of synaptic microRNAs from the adult rat brain identifies regional differences and seizure-induced dynamic modulation. (2012)

Pichardo-Casas I, Goff LA, Swerdel MR, Athie A, Davila J, Ramos-Brossier M, Lapid-Volosin M, Friedman WJ, Hart RP, Vaca L

Brain Res. 2012 Feb 3;1436:20-33. doi: 10.1016/j.brainres.2011.12.001. Epub 2011 Dec 9. PMID: 22197703

In recent years, microRNAs or miRNAs have been proposed to target neuronal mRNAs localized near the synapse, exerting a pivotal role in modulating local protein synthesis, and presumably affecting adaptive mechanisms such as synaptic plasticity. In the present study we have characterized the distribution of miRNAs in five regions of the adult mammalian brain and compared the relative abundance between total fractions and purified synaptoneurosomes (SN), using three different methodologies. The results show selective enrichment or depletion of some miRNAs when comparing total versus SN fractions. These miRNAs were different for each brain region explored. Changes in distribution could not be attributed to simple diffusion or to a targeting sequence inside the miRNAs. In silico analysis suggest that the differences in distribution may be related to the preferential concentration of synaptically localized mRNA targeted by the miRNAs. These results favor a model of co-transport of the miRNA-mRNA complex to the synapse, although further studies are required to validate this hypothesis. Using an in vivo model for increasing excitatory activity in the cortex and the hippocampus indicates that the distribution of some miRNAs can be modulated by enhanced neuronal (epileptogenic) activity. All these results demonstrate the dynamic modulation in the local distribution of miRNAs from the adult brain, which may play key roles in controlling localized protein synthesis at the synapse.

10.1016/j.brainres.2011.12.001

64. Integrative annotation of human large intergenic noncoding RNAs reveals global properties and specific subclasses. (2011)

Cabili MN, Trapnell C, Goff LA, Koziol M, Tazon-Vega B, Regev A, Rinn JL

Genes Dev. 2011 Sep 15;25(18):1915-27. doi: 10.1101/gad.17446611. Epub 2011 Sep 2. PMID: 21890647

Large intergenic noncoding RNAs (lincRNAs) are emerging as key regulators of diverse cellular processes. Determining the function of individual lincRNAs remains a challenge. Recent advances in RNA sequencing (RNA-seq) and computational methods allow for an unprecedented analysis of such transcripts. Here, we present an integrative approach to define a reference catalog of >8000 human lincRNAs. Our catalog unifies previously existing annotation sources with transcripts we assembled from RNA-seq data collected from approximately 4 billion RNA-seq reads across 24 tissues and cell types. We characterize each lincRNA by a panorama of >30 properties, including sequence, structural, transcriptional, and orthology features. We found that lincRNA expression is strikingly tissue-specific compared with coding genes, and that lincRNAs are typically coexpressed with their neighboring genes, albeit to an extent similar to that of pairs of neighboring protein-coding genes. We distinguish an additional subset of transcripts that have high evolutionary conservation but may include short ORFs and may serve as either lincRNAs or small peptides. Our integrated, comprehensive, yet conservative reference catalog of human lincRNAs reveals the global properties of lincRNAs and will facilitate experimental studies and further functional classification of these genes.

10.1101/gad.17446611

65. Differential regulation of microRNA stability. (2010)

Bail S, Swerdel M, Liu H, Jiao X, Goff LA, Hart RP, Kiledjian M

RNA. 2010 May;16(5):1032-9. doi: 10.1261/rna.1851510. Epub 2010 Mar 26. PMID: 20348442

MicroRNAs (miRNAs) are endogenous single-stranded RNA molecules of about 21 nucleotides in length that are fundamental post-transcriptional regulators of gene expression. Although the transcriptional and processing events involved in the generation of miRNAs have been extensively studied, very little is known pertaining to components that regulate the stability of individual miRNAs. All RNAs have distinct inherent half-lives that dictate their level of accumulation and miRNAs would be expected to follow a similar principle. Here we demonstrate that although most miRNA appear to be stable, like mRNAs, miRNAs possess differential stability in human cells. In particular, we found that miR-382, a miRNA that contributes to HIV-1 provirus latency, is unstable in cells. To determine the region of miR-382 responsible for its rapid decay, we developed a cell-free system that recapitulated the observed cell-based-regulated miR-382 turnover. The system utilizes in vitro-processed mature miRNA derived from pre-miRNA and follows the decay of the processed miRNA. Using this system, we demonstrate that instability of miR-382 is driven by sequences outside its seed region and required the 3' terminal seven nucleotides where mutations in this region increased the stability of the RNA. Moreover, the exosome 3'-5' exoribonuclease complex was identified as the primary nuclease involved in miR-382 decay with a more modest contribution by the Xrn1 and no detectable contribution by Xrn2. These studies provide evidence for an miRNA element essential for rapid miRNA decay and implicate the exosome in this process. The development of a biochemically amendable system to analyze the mechanism of differential miRNA stability provides an important step in efforts to regulate gene expression by modulating miRNA stability.

10.1261/rna.1851510

66. Ago2 immunoprecipitation identifies predicted microRNAs in human embryonic stem cells and neural precursors. (2009)

Goff LA, Davila J, Swerdel MR, Moore JC, Cohen RI, Wu H, Sun YE, Hart RP

PLoS One. 2009 Sep 28;4(9):e7192. doi: 10.1371/journal.pone.0007192. PMID: 19784364

BACKGROUND: MicroRNAs are required for maintenance of pluripotency as well as differentiation, but since more microRNAs have been computationally predicted in genome than have been found, there are likely to be undiscovered microRNAs expressed early in stem cell differentiation. METHODOLOGY/PRINCIPAL FINDINGS: SOLiD ultra-deep sequencing identified >10(7) unique small RNAs from human embryonic stem cells (hESC) and neural-restricted precursors that were fit to a model of microRNA biogenesis to computationally predict 818 new microRNA genes. These predicted genomic loci are associated with chromatin patterns of modified histones that are predictive of regulated gene expression. 146 of the predicted microRNAs were enriched in Ago2-containing complexes along with 609 known microRNAs, demonstrating association with a functional RISC complex. This Ago2 IP-selected subset was consistently expressed in four independent hESC lines and exhibited complex patterns of regulation over development similar to previously-known microRNAs, including pluripotency-specific expression in both hESC and iPS cells. More than 30% of the Ago2 IP-enriched predicted microRNAs are new members of existing families since they share seed sequences with known microRNAs. CONCLUSIONS/SIGNIFICANCE: Extending the classic definition of microRNAs, this large number of new microRNA genes, the majority of which are less conserved than their canonical counterparts, likely represent evolutionarily recent regulators of early differentiation. The enrichment in Ago2 containing complexes, the presence of chromatin marks indicative of regulated gene expression, and differential expression over development all support the identification of 146 new microRNAs active during early hESC differentiation.

10.1371/journal.pone.0007192

67. Rapid induction of genes associated with tissue protection and neural development in contused adult spinal cord after radial glial cell transplantation. (2009)

Chang YW, Goff LA, Li H, Kane-Goldsmith N, Tzatzalos E, Hart RP, Young W, Grumet M

J Neurotrauma. 2009 Jul;26(7):979-93. doi: 10.1089/neu.2008.0762. PMID: 19257808

Cell-based therapy has been widely evaluated in spinal cord injury (SCI) animal models and shown to improve functional recovery. However, host response to cell transplants at gene expression level is rarely discussed. We reported previously that acute transplantation of radial glial cells RG3.6 following SCI promoted early locomotion improvement within 1 week post-injury. To identify rapid molecular changes induced by RG3.6 transplantation in the host tissue, distal spinal cord segments were subjected to microarray analysis. Although RG3.6 transplantation, reduced activity of macrophages as early as 1-2 weeks post-injury, the expression levels of inflammatory genes (e.g., IL-6, MIP-2, MCP-1) were not decreased by RG3.6 treatment as compared to medium or other cell controls at 6-12 h post-injury. However, genes associated with tissue protection (Hsp70 and Hsp32) and neural cell development (Foxg1, Top2a, Sox11, Nkx2.2, Vimentin) were found to be significantly up-regulated by RG3.6 transplants. Foxg1 was the most highly induced gene in the RG3.6-treated spinal cords, and its expression by immunocytochemistry was confirmed in the host tissue. Moreover, RG3.6 treatment boosted the number of Nkx2.2 cells in the spinal cord, and these cells frequently co-expressed NG2, which marks progenitor cells. Taken together, these results demonstrate that radial glial transplants induced rapid and specific gene expression in the injured host tissue, and suggest that these early responses are associated with mechanisms of tissue protection and activation of endogenous neural progenitor cells.

Article

68. Functional differentiation of a clone resembling embryonic cortical interneuron progenitors. (2008)

Li H, Han YR, Bi C, Davila J, Goff LA, Thompson K, Swerdel M, Camarillo C, Ricupero CL, Hart RP, Plummer MR, Grumet M

Dev Neurobiol. 2008 Dec;68(14):1549-64. doi: 10.1002/dneu.20679. PMID: 18814314

We have generated clones (L2.3 and RG3.6) of neural progenitors with radial glial properties from rat E14.5 cortex that differentiate into astrocytes, neurons, and oligodendrocytes. Here, we describe a different clone (L2.2) that gives rise exclusively to neurons, but not to glia. Neuronal differentiation of L2.2 cells was inhibited by bone morphogenic protein 2 (BMP2) and enhanced by Sonic Hedgehog (SHH) similar to cortical interneuron progenitors. Compared with L2.3, differentiating L2.2 cells expressed significantly higher levels of mRNAs for glutamate decarboxylases (GADs), DLX transcription factors, calretinin, calbindin, neuropeptide Y (NPY), and somatostatin. Increased levels of DLX-2, GADs, and calretinin proteins were confirmed upon differentiation. L2.2 cells differentiated into neurons that fired action potentials in vitro, and their electrophysiological differentiation was accelerated and more complete when cocultured with developing astroglial cells but not with conditioned medium from these cells. The combined results suggest that clone L2.2 resembles GABAergic interneuron progenitors in the developing forebrain.

10.1002/dneu.20679

69. Differentiating human multipotent mesenchymal stromal cells regulate microRNAs: prediction of microRNA regulation by PDGF during osteogenesis. (2008)

Goff LA, Boucher S, Ricupero CL, Fenstermacher S, Swerdel M, Chase LG, Adams CC, Chesnut J, Lakshmipathy U, Hart RP

Exp Hematol. 2008 Oct;36(10):1354-1369. doi: 10.1016/j.exphem.2008.05.004. Epub 2008 Jul 26. PMID: 18657893

OBJECTIVE: Human multipotent mesenchymal stromal cells (MSC) have the potential to differentiate into multiple cell types, although little is known about factors that control their fate. Differentiation-specific microRNAs may play a key role in stem cell self-renewal and differentiation. We propose that specific intracellular signaling pathways modulate gene expression during differentiation by regulating microRNA expression. MATERIALS AND METHODS: Illumina mRNA and NCode microRNA expression analyses were performed on MSC and their differentiated progeny. A combination of bioinformatic prediction and pathway inhibition was used to identify microRNAs associated with platelet-derived growth factor (PDGF) signaling. RESULTS: The pattern of microRNA expression in MSC is distinct from that in pluripotent stem cells, such as human embryonic stem cells. Specific populations of microRNAs are regulated in MSC during differentiation targeted toward specific cell types. Complementary mRNA expression analysis increases the pool of markers characteristic of MSC or differentiated progeny. To identify microRNA expression patterns affected by signaling pathways, we examined the PDGF pathway found to be regulated during osteogenesis by microarray studies. A set of microRNAs bioinformatically predicted to respond to PDGF signaling was experimentally confirmed by direct PDGF inhibition. CONCLUSION: Our results demonstrate that a subset of microRNAs regulated during osteogenic differentiation of MSCs is responsive to perturbation of the PDGF pathway. This approach not only identifies characteristic classes of differentiation-specific mRNAs and microRNAs, but begins to link regulated molecules with specific cellular pathways.

10.1016/j.exphem.2008.05.004

70. MicroRNA expression pattern of undifferentiated and differentiated human embryonic stem cells. (2007)

Lakshmipathy U, Love B, Goff LA, Jornsten R, Graichen R, Hart RP, Chesnut JD

Stem Cells Dev. 2007 Dec;16(6):1003-16. doi: 10.1089/scd.2007.0026. PMID: 18004940

Many of the currently established human embryonic stem (hES) cell lines have been characterized extensively in terms of their gene expression profiles and genetic stability in culture. Recent studies have indicated that microRNAs (miRNAs), a class of noncoding small RNAs that participate in the regulation of gene expression, may play a key role in stem cell self-renewal and differentiation. Using both microarrays and quantitative PCR, we report here the differences in miRNA expression between undifferentiated hES cells and their corresponding differentiated cells that underwent differentiation in vitro over a period of 2 weeks. Our results confirm the identity of a signature miRNA profile in pluripotent cells, comprising a small subset of differentially expressed miRNAs in hES cells. Examining both mRNA and miRNA profiles under multiple conditions using cross-correlation, we find clusters of miRNAs grouped with specific, biologically interpretable mRNAs. We identify patterns of expression in the progression from hES cells to differentiated cells that suggest a role for selected miRNAs in maintenance of the undifferentiated, pluripotent state. Profiling of the hES cell "miRNA-ome" provides an insight into molecules that control cellular differentiation and maintenance of the pluripotent state, findings that have broad implications in development, homeostasis, and human disease states.

Article

71. Bioinformatic analysis of neural stem cell differentiation. (2007)

Goff LA, Davila J, Jornsten R, Keles S, Hart RP

J Biomol Tech. 2007 Sep;18(4):205-12. PMID: 17916793

Regulated mRnAs during differentiation of rat neural stem cells were analyzed using the ABi1700 microarray platform. This microarray, while technically advanced, suffers from the difficulty of integrating hybridization results into public databases for systems-level analysis. This is particularly true for the rat array, since many of the probes were designed for transcripts based on predicted human and mouse homologs. using several strategies, we increased the public annotation of the 27,531 probes from 43% to over 65%. To increase the dynamic range of annotation, probes were mapped to numerous public keys from several data sources. consensus annotation from multiple sources was determined for well-scoring alignments, and a confidence-based ranking system established for probes with less agreement across multiple data sources. previous attempts at genomic interpretation using the celera annotation model resulted in poor overlap with expected genomic sequences. since the public keys are more precisely mapped to the genome, we could now analyze the relationships between predicted transcription-factor binding sites and expression clusters. Results collected from a differentiation time course of two neural stem cell clones were clustered using a model-based algorithm. Transcription-factor binding sites were predicted from upstream regions of mapped transcripts using position weight matrices from either JAspAR or TRAnsFAc, and the resulting scores were used to discriminate between observed expression clusters. A classification and regression tree analysis was conducted using cluster numbers as gene identifiers and TFBs scores as predictors, pruning back to obtain a tree with the lowest gene class prediction error rate. Results identify several transcription factors, the presence or absence of which are sufficient to describe clusters of mRnAs changing over time-those that are static, as well as clusters describing cell line differences. public annotation of the AB1700 rat genome array will be valuable for integrating results into future systems-level analyses.

Article

72. Rational probe optimization and enhanced detection strategy for microRNAs using microarrays. (2005)

Goff LA, Yang M, Bowers J, Getts RC, Padgett RW, Hart RP

RNA Biol. 2005 Jul-Sep;2(3):93-100. doi: 10.4161/rna.2.3.2059. Epub 2005 Jul 20. PMID: 17114923

MicroRNAs (miRNAs) are post-transcriptional regulators participating in biological processes ranging from differentiation to carcinogenesis. We developed a rational probe design algorithm and a sensitive labelling scheme for optimizing miRNA microarrays. Our microarray contains probes for all validated miRNAs from five species, with the potential for drawing on species conservation to identify novel miRNAs with homologous probes. These methods are useful for high-throughput analysis of micro RNAs from various sources, and allow analysis with limiting quantities of RNA. The system design can also be extended for use on Luminex beads or on 96-well plates in an ELISA-style assay. We optimized hybridization temperatures using sequence variations on 20 of the probes and determined that all probes distinguish wild-type from 2 nt mutations, and most probes distinguish a 1 nt mutation, producing good selectivity between closely-related small RNA sequences. Results of tissue comparisons on our microarrays reveal patterns of hybridization that agree with results from Northern blots and other methods.

Article

73. Evaluation of sense-strand mRNA amplification by comparative quantitative PCR. (2004)

Goff LA, Bowers J, Schwalm J, Howerton K, Getts RC, Hart RP

BMC Genomics. 2004 Oct 6;5:76. doi: 10.1186/1471-2164-5-76. PMID: 15469607

BACKGROUND: RNA amplification is required for incorporating laser-capture microdissection techniques into microarray assays. However, standard oligonucleotide microarrays contain sense-strand probes, so traditional T7 amplification schemes producing anti-sense RNA are not appropriate for hybridization when combined with conventional reverse transcription labeling methods. We wished to assess the accuracy of a new sense-strand RNA amplification method by comparing ratios between two samples using quantitative real-time PCR (qPCR), mimicking a two-color microarray assay. RESULTS: We performed our validation using qPCR. Three samples of rat brain RNA and three samples of rat liver RNA were amplified using several kits (Ambion messageAmp, NuGen Ovation, and several versions of Genisphere SenseAmp). Results were assessed by comparing the liver/brain ratio for 192 mRNAs before and after amplification. In general, all kits produced strong correlations with unamplified RNAs. The SenseAmp kit produced the highest correlation, and was also able to amplify a partially degraded sample accurately. CONCLUSION: We have validated an optimized sense-strand RNA amplification method for use in comparative studies such as two-color microarrays.

Article