MicroRNAs (miRNAs) are ∼22 nt RNAs that direct posttranscriptional repression of mRNA targets in diverse eukaryotic lineages. In humans and other mammals, these small RNAs help sculpt the expression of most mRNAs. This article reviews advances in our understanding of the defining features of metazoan miRNAs and their biogenesis, genomics, and evolution. It then reviews how metazoan miRNAs are regulated, how they recognize and cause repression of their targets, and the biological functions of this repression, with a compilation of knockout phenotypes that shows that important biological functions have been identified for most of the broadly conserved miRNAs of mammals.
Main Text
Introduction
MicroRNAs (miRNAs) are small regulatory RNAs that are processed from stem-loop regions of longer RNA transcripts. Hundreds of different miRNAs have been identified in humans, many of which are conserved in other animals, and these conserved miRNAs have preferentially conserved interactions with most human mRNAs (Friedman et al., 2009). This inferred regulation of most human mRNAs suggests that miRNAs influence essentially all developmental process and diseases. Indeed, loss-of-function studies disrupting miRNA genes in mice have revealed diverse phenotypes, including defects in the development of the skeleton, teeth, brain, eyes, neurons, muscle, heart, lungs, kidneys, vasculature, liver, pancreas, intestine, skin, fat, breast, ovaries, testes, placenta, thymus, and each hematopoietic lineage, as well as cellular, physiological, and behavioral defects. Many of these developmental and physiological defects affect embryonic or postnatal viability or cause other severe conditions, such as epilepsy, deafness, retinal degeneration, infertility, immune disorders, or cancer. In addition, some miRNA-knockout strains have altered susceptibility to infections, and many have differential responses to mouse models of diseases or injuries.
As with many discoveries of fundamental importance to mammalian development, physiology, and disease, the first known miRNA was not found in humans or other mammals but was instead found in an invertebrate model organism. Molecular geneticists studying the lin-4 and let-7 genes, which are each required for the proper timing of C. elegans development, found that instead of producing mRNAs, these genes produce noncoding RNAs (ncRNAs), including short RNAs ∼22 nt in length (Lee et al., 1993, Reinhart et al., 2000). The lin-4 and let-7 RNAs both had imperfect complementarity to conserved sites within the 3′ UTRs of genetically identified regulatory targets, which led to a model in which these small RNAs mediate translational repression through antisense interactions (Lee et al., 1993, Wightman et al., 1993, Moss et al., 1997, Olsen and Ambros, 1999, Reinhart et al., 2000). The let-7 RNA was subsequently recognized in humans and other bilaterian animals, with temporal expression resembling that observed in C. elegans (Pasquinelli et al., 2000). This discovery showed that these regulatory RNAs were not mere curiosities of worms and led to the idea that additional “small temporal RNAs” might exist to regulate the timing of other development transitions (Pasquinelli et al., 2000). Soon thereafter, molecular searches for endogenous small RNAs refined the identities of the lin-4 and let-7 RNAs and revealed that these RNAs were actually part of a much larger class of small RNAs (Lagos-Quintana et al., 2001, Lau et al., 2001, Lee and Ambros, 2001). Members of this class all resembled lin-4 and let-7 RNAs in their small size and potential to be processed from hairpin precursors, but most were not expressed in a temporal manner. Because they were identified without the help of genetics, their functions were not known—what was known is that they were small, and so they were called “microRNAs” (Lagos-Quintana et al., 2001, Lau et al., 2001, Lee and Ambros, 2001).
At this point, interest in these small regulatory RNAs surged. It was clear that there were hundreds of different miRNAs in humans, flies, nematodes, and presumably other animals, which implied that in each of these species there were hundreds of different mRNAs that were regulated. Moreover, these potential regulatory targets were not just mRNAs involved in the timing of development—any mRNA that a biologist was studying might be regulated by one or more miRNAs. In this review, I touch on what has since been learned about the miRNAs of animals, particularly with respect to their biogenesis, genomics, regulation, mechanisms of action, target recognition, and biological functions.
miRNA Biogenesis, Genomics, and Evolution
The metazoan miRNA pathway derived from a more basal RNA-silencing pathway known as RNA interference (RNAi), which appears to have been present in the last common ancestor of eukaryotes and continues to defend against viruses and transposons in many extant eukaryotes (Shabalina and Koonin, 2008). The hallmark innovation of the miRNA pathway is the use of short hairpins to produce defined guide RNAs appropriate for directing the silencing machinery to specific cellular mRNAs, whereas the ancestral RNAi pathway starts with longer double-stranded RNA (dsRNA) precursors that each produce a large diversity of small interfering RNAs (siRNAs) (Bartel, 2004). When considering the intrinsic advantages of short hairpins for generating defined guide RNAs appropriate for enlisting RNA silencing for endogenous gene regulation, miRNAs might have been expected to have arisen more than once in eukaryotic evolution. Indeed, miRNAs or miRNA-like RNAs have emerged independently in other diverse eukaryotic lineages, including land plants (Jones-Rhoades et al., 2006), green algae (Molnár et al., 2007, Zhao et al., 2007), brown algae (Cock et al., 2010), filamentous fungi (Lee et al., 2010), and slime mold (Avesson et al., 2012).
Biogenesis of Canonical miRNAs
In animals, canonical miRNAs are transcribed by RNA polymerase II (Pol II) as part of much longer RNAs called “pri-miRNAs” (Lee et al., 2002, Cai et al., 2004, Lee et al., 2004) (Figure 1). Each pri-miRNA has at least one region that folds back on itself to form a hairpin substrate for Microprocessor, a heterotrimeric complex containing one molecule of the Drosha endonuclease and two molecules of its partner protein, DGCR8 (named Pasha in flies and nematodes) (Nguyen et al., 2015). Drosha has two RNase III domains that each cut one strand of the stem of the pri-miRNA hairpin with a 2 bp offset, which liberates a ∼60 nt stem-loop called a “pre-miRNA” (Lee et al., 2003) (Figure 1A). Note that although all canonical pri-miRNAs have a 5′ cap, as expected for Pol II transcripts, they do not necessarily have a poly(A) tail because cotranscriptional processing by Microprocessor can sometimes trigger transcription termination in a manner that preempts normal 3′-end maturation (Ballarino et al., 2009).
After export to the cytoplasm through the action of Exportin 5 and RAN–GTP (Yi et al., 2003, Bohnsack et al., 2004, Lund et al., 2004), the pre-miRNA is further processed by Dicer (Grishok et al., 2001, Hutvágner et al., 2001). Like Drosha, Dicer is an endonuclease with two RNase III domains (Bernstein et al., 2001, Zhang et al., 2004), and like Drosha, it associates with a partner protein, although the Dicer partner protein (named TRBP in mammals and Loquacious in flies) is essential for pre-miRNA processing in flies but not mammals (Ha and Kim, 2014). Dicer cuts both strands near the loop to generate the miRNA duplex, which contains the miRNA paired to its passenger strand (often called the “miRNA∗,” pronounced “miRNA star”). This duplex has a ∼2 nt 3′ overhang on each end, resulting from the offset cuts made by both Drosha and Dicer (Lee et al., 2003, Zhang et al., 2004) (Figure 1A). Once formed, the miRNA duplex is loaded into an Argonaute protein with assistance from chaperone proteins (HSC70/HSP90), which use ATP to help Argonaute assume a high-energy, open conformation suitable for binding the rigid miRNA duplex (Iwasaki et al., 2010). Following loading of the duplex, relaxation of Argonaute back to its ground-state conformation is thought to promote expulsion of the miRNA∗ to form the mature silencing complex (Kawamata and Tomari, 2010).
The choice of which strand of the duplex usually becomes the miRNA, i.e., the guide strand of the silencing complex, and which one usually is discarded and degraded as the miRNA∗ depends on the preferred orientation by which the duplex binds Argonaute, and this orientation of the duplex depends on which strand of the duplex has a 5′ terminus most suitable for loading into the pocket within Argonaute that binds the 5′-nucleoside monophosphate of guide RNAs. This pocket prefers a 5′-terminal pU or pA (Frank et al., 2010, Suzuki et al., 2015) as well as the 5′-nucleoside monophosphate of the strand with the least stable 5′-terminal pairing (Khvorova et al., 2003, Schwarz et al., 2003). Because the preferred loading orientation is independent of the duplex orientation within the pre-miRNA, the strand from either arm of the hairpin can be retained to become the miRNA (Figure 1B). Once loaded into the silencing complex, the miRNA pairs to sites within mRNAs and other transcripts to direct their posttranscriptional repression (Figure 1A).
Genomics and Evolution of Canonical miRNAs
High-throughput sequencing of small RNAs has transformed miRNA gene discovery (Lu et al., 2005, Ruby et al., 2006). Of course, not every small RNA sequenced from the cell is a miRNA. Some are other types of small RNAs, such as piwi-interacting RNAs (piRNAs) and endogenous siRNAs, which derive from related RNA-silencing pathways (Malone and Hannon, 2009), and others are degradation fragments of longer RNAs. To prevent these other types of small RNAs from being misannotated as miRNAs, stringent analysis pipelines require that the annotated miRNA have a consistent 5′ terminus and map to a potential hairpin supported by reads corresponding to both strands of the miRNA duplex, with its diagnostic ∼2 nt 3′ overhangs (Ruby et al., 2006). Using these criteria, recent analyses have supported the authenticity of many previously annotated genes and identified some new genes, bringing tallies of confidently identified canonical miRNA genes to 147 in C. elegans (Jan et al., 2011), 164 in Drosophila (Fromm et al., 2015), 475 in mouse (Chiang et al., 2010), and 519 in human (Fromm et al., 2015).
Although lin-4, let-7, and a couple of other metazoan miRNAs were named based on their mutant phenotypes, the other miRNAs, which were identified before a mutant phenotype was known, were named with numbers, in the order of their discovery (Ambros et al., 2003). Some, albeit imperfect, attempt was also made to give orthologs from different species the same name. For example, 11 of the 12 human orthologs of C. elegans let-7 bear the let-7 name (the other one being mir-98). Likewise, an attempt was made to assign similar names to paralogs within a species, with letter suffixes (a, b, c, . . .) distinguishing genes producing similar mature miRNAs and number suffixes (-1, -2, -3, . . .) distinguishing those producing identical mature miRNAs.
Of the more than 100 studies that have sought to annotate miRNAs, not all have imposed stringent criteria for gene annotation, leading to many additional gene annotations. For example, miRBase v21 lists 1,193 miRNA gene annotations in mouse and 1,881 in human (Kozomara and Griffiths-Jones, 2014), raising the question of how many of these additional annotations represent authentic miRNAs missed by the stringent annotation criteria (which might happen, for instance, if a miRNA was not expressed at sufficient levels in the sequenced samples), and how many are false-positive annotations. Experimental evaluation of a subset of these additional annotations indicates that a large majority are false positives (Chiang et al., 2010), implying that biologists interested in exploring miRNA functions should focus on only miRNA annotations that satisfy the stringent criteria, based on either existing or newly acquired high-throughput sequencing data.
MicroRNAs are grouped into families based on their targeting properties, which depend primarily on the identity of their extended seed region (miRNA nucleotides 2–8) (Bartel, 2009). For example, mice and humans each have three members of the miR-1/206 seed family (miR-1-1, miR-1-2, and miR-206), which are paralogous miRNAs that arose through duplication of an ancestral gene inherited from a common ancestor of all bilaterian animals. Indeed, members of the same seed family are usually evolutionarily related, and evolutionarily related miRNAs are usually members of the same seed family. However, the use of the term “family” does not strictly denote common ancestry. For example, miR-32, which is not related to other members of the miR-25/32/92/363/367 seed family, is nonetheless an adopted member because it has converged on the same extended seed and thus has the same targeting preferences. Other miRNAs, such as miR-200a and miR-200b, which are clearly related, have a single-nucleotide difference in their seed regions that place them into different families because of their divergent targeting preferences. As with paralogous proteins, members of the same seed families often have at least partially redundant functions, with severe loss-of-function phenotypes apparent only after multiple family members are disrupted (Tables 1 and 2). However, phenotypes are often observed after disruption of a single member, particularly in contexts in which that member is preferentially expressed (Tables 1 and 2, e.g., miR-206, miR-7a-2, miR-9-3, etc.).
Table 1Abnormal Phenotypes Observed in Mice after Knocking Out One or More Members of a Very Broadly Conserved miRNA Family | ||
miRNA family (# of genes) | miRNA(s) removed | Phenotype |
miR-1/206 (3) | miR-1-1, miR-1-2 | Postnatal lethality before weaning, complete penetrance, due to heart defects (Heidersbach et al., 2013, Wei et al., 2014) |
miR-206 | Abnormal airway smooth muscle innervation (Radzikinas et al., 2011); reduced regeneration of muscle and neuromuscular junctions after injury; accelerated pathology in neuromuscular disease models (ALS, Duchenne muscular dystrophy) (Williams et al., 2009, Liu et al., 2012) | |
miR-7 (3) | miR-7a-2 | Male and female infertility with hypogonadism due to pituitary defect (Ahmed et al., 2017); abnormal function of pancreatic beta cells (Latreille et al., 2014) |
let-7/miR-98 (12) | let7c-2∼let7b | Modest intestinal hypertrophy, with increased crypt fission (Madison et al., 2013); increase in megakaryocyte–erythroid progenitors and decrease in granulocyte–monocyte progenitors (Rowe et al., 2016); faster liver regeneration after resection (Wu et al., 2015) |
miR-9 (3) | miR-9-2, miR-9-3 | Postnatal lethality, complete penetrance; abnormal brain development (Shibata et al., 2011) |
miR-9-3 | Male and female infertility (Shibata et al., 2011) | |
miR-10 (2) | miR-10a | Increased intestinal adenomas in sensitized background (Apc) (Stadthagen et al., 2013) |
miR-10b | Enlarged spleen with expansion of germinal centers; reduced tumorigenesis of breast cancer in sensitized background (MMTV-PyMT) (Kim et al., 2016b) | |
miR-22 (1) | miR-22 | Reduced response of Th17 cells (Lu et al., 2015b); stress-induced cardiac dilation and contractile dysfunction (Gurha et al., 2012, Huang et al., 2013); reduced pathology in two disease models |
miR-29 (4) | miR-29b-1∼29a, miR-29b-2∼29c | Embryonic lethality, partial penetrance, with death of survivors at ∼4 weeks (Dooley et al., 2016, Sassi et al., 2017) |
miR-29b-1∼29a | Reduced lifespan; decreased body weight; progressive locomotor impairment and ataxia with Purkinje cell defects; reduced numbers of hematopoietic stem and progenitor cells; premature thymic involution; perturbed development of Th1 cells and other lymphoid cells; reduced susceptibility to induced autoimmune disease (Papadopoulou et al., 2011, Smith et al., 2012, Hu et al., 2015, Papadopoulou et al., 2015); reduced pathology in models of obesity and arthritis (Dooley et al., 2016, van Nieuwenhuijze et al., 2017) | |
miR-29b-2∼29c | Reduced pathology in models of obesity, arthritis, and heart disease (Dooley et al., 2016, Sassi et al., 2017, van Nieuwenhuijze et al., 2017) | |
miR-31 (1) | miR-31 | Perturbed T cell response and function; improved recovery from lymphocytic choriomeningitis virus infection; reduced pathology in model of autoimmune encephalomyelitis (Zhang et al., 2015, Moffett et al., 2017); increased tumorigenesis in model of colorectal cancer (Liu et al., 2017) |
miR-33 (1) | miR-33 | Increased cholesterol efflux and serum HDL (Horie et al., 2010); increased ApoE lipidation and Aβ degradation in the brain (Kim et al., 2015); increased obesity and liver steatosis on high-fat diet (Horie et al., 2013); reduced fibrosis in model of heart disease (Nishiga et al., 2017) |
miR-34/449 (6) | miR-34a, miR-34b∼34c, miR-449c∼449b∼449a | Postnatal lethality, incomplete penetrance; growth retardation; male and female infertility and respiratory dysfunction due to defective motile cilia (Song et al., 2014) |
miR-34b∼34c, miR-449c∼449b∼449a | Postnatal lethality, incomplete penetrance; growth retardation; abnormal brain development; male and female infertility and pulmonary pathology due to defective motile cilia (Wu et al., 2014); mitotic spindle orientation defects, neurogenesis defects with increased radial glial cells and decreased cortical neurons; thinner brain cortex (Fededa et al., 2016) | |
miR-34a, miR-34b∼34c | Resilience to stress-induced anxiety (Andolina et al., 2016); increased tumorigenesis in several sensitized backgrounds | |
miR-34a | Increased bone resorption (Krzeszinski et al., 2014); increased tumorigenesis in several sensitized backgrounds | |
miR-25/32/92/363/367 (7) | miR-92a-1 | Embryonic lethality, partial penetrance; smaller size; minor skeletal defects (Penzkofer et al., 2014, Han et al., 2015); reduced pathology in model of kidney disease (Henique et al., 2017) |
miR-96 (1) | miR-96 | Progressive hearing loss caused by sensory hair-cell degeneration due to arrested development of hair cells and auditory nerve connections (Lewis et al., 2009, Mencía et al., 2009, Kuhn et al., 2011) |
miR-124 (3) | miR-124-1 | Postnatal lethality, incomplete penetrance; reduced brain size and abnormal brain development with increased neuronal apoptosis, aberrant axon sprouting in hippocampal dentate granule neurons, and reduced retinal cone photoreceptor cells (Sanuki et al., 2011) |
miR-125 (3) | miR-125a | Defective differentiation of Treg cells; disrupted immune homeostasis with increased inflammation in immune disease models (Pan et al., 2015); perturbed development of kidneys and seminal vesicles; myeloproliferative disorder (Tatsumi et al., 2016) |
miR-133 (3) | miR-133a-1, miR-133a-2 | Perinatal lethality, incomplete penetrance, due to heart defects; adult heart failure (Liu et al., 2008); defective skeletal muscle development and function (Liu et al., 2011); increased browning of white adipose tissue (Liu et al., 2013) |
miR-184 (1) | miR-184 | Increased pancreatic beta cell proliferation and mass; decreased fasting blood glucose levels and increased fasting plasma insulin levels (Tattikota et al., 2014); thicker epidermis due to increased epidermal cell proliferation (Nagosa et al., 2017) |
miR-200bc/429 (3) | miR-200b, miR-429 | Female infertility due to pituitary defect (Hasuwa et al., 2013) |
miR-429 | Liver inflammation and dysfunction (Chen et al., 2018) | |
miR-210 (1) | miR-210 | Increased spontaneous autoantibodies (Mok et al., 2013); increased differentiation of Th17 cells with increased pathology in immune disease model (colitis) (Wang et al., 2014); decreased pathology in pulmonary hypertension model (White et al., 2015) |
miR-219 (2) | miR-219-1, miR-219-2 | Mostly penetrant neonatal lethality; knockout in oligodendrocytes causes myelination defects with tremors beginning at week 3, progressing to severe seizures, ataxia, and death by ∼4 months (Wang et al., 2017) |
miR-219-2 | Tremors at adulthood (Wang et al., 2017) | |
miR-375 (1) | miR-375 | Increased pancreatic alpha cells and decreased beta cells; hyperglycemia (Poy et al., 2009); altered membrane potential of beta cells (Salunkhe et al., 2015) |
View Table in HTML
This table lists results for 20 of the 27 murine miRNA families conserved since the bilaterian ancestor (Table S1). With regard to the other seven families, for four (miR-183, miR-193, 216a, and miR-365), a member has been knocked out but only in the context of a deletion that also removes one or two miRNAs from other families (Table 3), which complicates attribution of the phenotypes, and for the remaining three, a mouse knockout has yet to be reported. Some strains have more than one miRNA gene disrupted; multiple miRNAs from a disrupted polycistronic locus are linked with a tilde (∼), whereas those from distant loci are separated with a comma.
Table 2Abnormal Phenotypes Observed in Mice after Knocking Out One or More Members of a Broadly Conserved miRNA Family | ||
miRNA family (# of genes) | miRNA(s) removed | Phenotype |
miR-15/16/195/322/497 (7) | miR-15a∼16-1 | Increased proliferation of B cells; development of lymphoproliferative disorders (Klein et al., 2010); arrested maturation of natural killer cells (Sullivan et al., 2015); increased phagocytosis of macrophages; reduced mortality in bacterial sepsis models (Moon et al., 2014) |
miR-15b∼16-2 | Development of B cell lymphoproliferative disorders (Lovat et al., 2015) | |
miR-17/20/93/106 (6) | miR-17, miR-20a | Perinatal lethality, incomplete penetrance; vertebral homeotic transformations and other skeletal defects; reduced body weight; reduced pre-B cells (Han et al., 2015) |
miR-18 (2) | miR-18a | Reduced body weight (Han et al., 2015) |
miR-19 (3) | miR-19a, miR-19b-1 | Perinatal lethality, incomplete penetrance; reduced body weight; reduced tumorigenesis in sensitized background (Myc) (Han et al., 2015) |
miR-21 (1) | miR-21 | Reduced growth and increased apoptosis of eosinophil progenitors (Lu et al., 2013b); altered macrophage polarization with more M2 macrophages and fewer M1 macrophages (Wang et al., 2015); increased bone mass with decreased bone resorption (Hu et al., 2017); altered pathology in >20 disease/injury models |
miR-23 (2) | miR-23a | Deletion in T cells causes reduced survival of activated T cells and increased susceptibility to bacterial infection (Listeria) (Zhang et al., 2016a) |
miR-122 (1) | miR-122 | Development of fatty liver, hepatitis, fibrosis, and hepatocellular carcinoma (Hsu et al., 2012, Tsai et al., 2012); smaller hepatocytes with substantially fewer polyploid cells (Hsu et al., 2016); increased susceptibility to acetaminophen toxicity (Chowdhary et al., 2017) |
miR-126 (1) | miR-126 | Embryonic lethality, incomplete penetrance, with hemorrhaging due to loss of vascular integrity and defects in endothelial cell proliferation, migration, and angiogenesis (Kuhnert et al., 2008, Wang et al., 2008); defects in innate immune response (Agudo et al., 2014); exacerbated atherosclerosis in sensitized background (Apoe) (Schober et al., 2014) |
miR-128 (2) | miR-128-2 | Development of fatal epilepsy, complete penetrance, preceded by hyperactivity and increased exploration (Tan et al., 2013) |
miR-130/301 (4) | miR-301a | Reduced pathology in models of colitis and colitis-associated cancer; reduced tumorigenesis of lung, thymus, and skin cancers in sensitized background (Kras) (Ma et al., 2015) |
miR-132/212 (2) | miR-212∼132 | Impaired memory formation and retention (Hernandez-Rapp et al., 2015, Hansen et al., 2016); reduced depth perception with impaired maturation of binocular cortical cells (Mazziotti et al., 2017); perturbed circadian regulation with reduced dendritic spines in neurons of suprachiasmatic nucleus (Kiessling et al., 2017, Mendoza-Viveros et al., 2017); perturbed cortical synaptic transmission and plasticity (Remenyi et al., 2013); increased endothelial vasodilatory function; increased angiogenic responses (Kumarswamy et al., 2014); altered survival, proliferation, and differentiation of hematopoietic stem cells (Mehta et al., 2015); altered pathology in six disease/injury models |
miR-137 (1) | miR-137 | Early embryonic lethality, complete penetrance (Crowley et al., 2015) |
miR-139 (1) | miR-139 | Increased pathology in models of colitis and colitis-associated cancer (Maoa et al., 2015, Zou et al., 2016) |
miR-140 (1) | miR-140 | Dwarfism and craniofacial deformities, with premature differentiation of chondrocytes; osteoarthritis (Miyaki et al., 2010, Nakamura et al., 2011); increased Leydig cells in developing testes (Rakoczy et al., 2013) |
miR-142 (1) | miR-142 | Reduced CD4+ dendritic cells (Mildner et al., 2013); decreased platelets due to impaired megakaryocyte maturation (Chapnik et al., 2014); reduced function of mast cells (Yamada et al., 2014); impaired T and B1B cell development and function with immunodeficiency (Kramer et al., 2015, Sun et al., 2015, Mildner et al., 2017); impaired erythropoiesis leading to anemia (Rivkin et al., 2017) |
miR-143 (1) | miR-143 | Arterial smooth muscle defects with perturbed actin stress fibers; less scarring in response to vascular injury (Xin et al., 2009) |
miR-145 (1) | miR-145 | Reduced vascular tone with reduced blood pressure; reduced heart mass; arterial smooth muscle defects with perturbed actin stress fibers; increased lethality and less scarring in response to vascular injury (Xin et al., 2009); reduced pathology in pulmonary disease model (fibrosis) (Yang et al., 2013) |
miR-146 (2) | miR-146a | Hyperresponsive macrophages, exaggerated inflammatory response, and myeloid cell proliferation; eventual autoimmune disorders, exhaustion of hematopoietic stem cells, and hematopoietic neoplasms (Lu et al., 2010, Boldin et al., 2011, Zhao et al., 2013); altered pathology in five disease models |
miR-146b | Enlarged spleens and increased myeloid cells, spontaneous intestinal inflammation, enhanced M1 macrophage polarization (Peng et al., 2016) | |
miR-148/152 (3) | miR-148a | Elevated serum and hepatic cholesterol; accelerated carcinogenesis in chemically induced liver cancer (Cheng et al., 2017) |
miR-150 (1) | miR-150 | Increased B1 cells and enhanced humoral immune response (Xiao et al., 2007); impaired maturation of natural killer T cells (Bezman et al., 2011, Zheng et al., 2012); impaired CD8+ T cell differentiation and function (Smith et al., 2015); reduced bone mass (Choi et al., 2015); altered pathology in eight disease models |
miR-155 (1) | miR-155 | Impaired development and function of B cells, T cells, and dendritic cells (Rodriguez et al., 2007, Thai et al., 2007, Kohlhaas et al., 2009, Lu et al., 2009, O’Connell et al., 2010, Dunand-Sauthier et al., 2011); perturbed function of macrophages and mast cells (O’Connell et al., 2009, Qayum et al., 2016); altered pathology in >60 disease/injury models |
miR-181 (6) | miR-181a-1∼181b-1, miR-181a-2∼181b-2, miR-181c∼181d | Embryonic lethality, complete penetrance (Fragoso et al., 2012, Henao-Mejia et al., 2013) |
miR-181a-1∼181b-1, miR-181a-2∼181b-2 | Postnatal lethality, incomplete penetrance; reduced body weight; defects in B and T cell development (Henao-Mejia et al., 2013) | |
miR-181a-1∼181b-1 | Defects in B and T cell development, loss of natural killer T cells (Fragoso et al., 2012, Henao-Mejia et al., 2013, Ziętara et al., 2013); reduced leukemia in sensitized background (Notch1) (Fragoso et al., 2012) | |
miR-182 (1) | miR-182 | Defective antibody response (Li et al., 2016); muscle loss; fast-to-slow fiber-type switching; impaired glucose metabolism (Zhang et al., 2016b); fewer circulating tumor cells and reduced metastasis in model of lung metastasis (Sachdeva et al., 2014) |
miR-191 (1) | miR-191 | Deletion in T cells causes reduced survival of T cells (Lykken and Li, 2016) |
miR-192/215 (2) | miR-192 | Reduced kidney pathology in diabetes model (Deshpande et al., 2013) |
miR-196 (3) | miR-196a, miR-196a-2, miR-196b | Vertebral homeotic transformations, increased vertebral number (Wong et al., 2015) |
miR-199 (3) | miR-199a-2 | Premature death, starting at 6 weeks; smaller body size; perturbed gait, body trembling, irregular breathing, and hypoactivity; smaller brain size with perturbed hippocampal and cortical neuron shape and density (Tsujimura et al., 2015) |
miR-203 (1) | miR-203 | Thickened epidermis, enhanced tumorigenesis in chemically induced skin cancer (Riemondy et al., 2015) |
miR-204/211 (2) | miR-204 | Improved glucose tolerance; reduced pathology in model of diabetes (Jo et al., 2018) |
miR-211 | Retinal cone degeneration and impaired vision (Barbato et al., 2017) | |
miR-205 (1) | miR-205 | Postnatal lethality, incomplete penetrance; fragile skin and impaired hair growth due to reduced neonatal expansion of progenitors and stem cells of hair follicles and skin (Farmer et al., 2013, Wang et al., 2013); no tear glands, 50% penetrance, or defective tear-gland development (Farmer et al., 2017); deletion in thymic epithelial cells causes of T cell defects (Hoover et al., 2016) |
miR-208 (2) | miR-208a | Abnormal atrial cardiac conduction (Callis et al., 2009); reduced cardiac pathology in response to stress and in model of hypothyroidism (van Rooij et al., 2007) |
miR-214 (1) | miR-214 | Increased pathology in heart disease model (Aurora et al., 2012) |
miR-218 (2) | miR-218-1, miR-218-2 | Neonatal lethality, complete penetrance, due to lack of respiration; failure of many motor neurons to establish neuromuscular junctions; motor neuron hyperexcitability and degeneration (Amin et al., 2015) |
miR-223 (1) | miR-223 | Perturbed development of myeloid lineages, with increased proliferation of progenitors and hyperactive neutrophils and macrophages; increased spontaneous and induced lung pathology (after endotoxin challenge); altered susceptibility to infection (Johnnidis et al., 2008, Zhuang et al., 2012, Lu et al., 2013a); increased serum cholesterol and triglycerides (Vickers et al., 2014); altered pathology in >12 disease models. |
miR-338 (1) | miR-338 | Heterozygous deletion causes disrupted synaptic transmission at auditory thalamocortical projections (Chun et al., 2017) |
miR-451 (1) | miR-451 | Mild anemia and splenomegaly due to impaired erythroblast maturation (Patrick et al., 2010, Rasmussen et al., 2010) |
View Table in HTML
This table lists results for 36 of the 62 families that are conserved among mammals and fish but emerged since the bilaterian ancestor (Table S1). For 2 of the remaining 26 families, a knockout has been generated and reported to have no observed phenotype; these are the miR-30 family, for which two of the six members (miR-30d ∼miR-30b) have been simultaneously deleted (Park et al., 2012), and the miR-144 family, for which the one-and-only member was deleted (Rasmussen et al., 2010). Some strains have more than one miRNA gene disrupted; multiple miRNAs from a disrupted polycistronic locus are linked with a tilde (∼), whereas those from distant loci are separated with a comma.
In some cases, the same pri-miRNA harbors multiple miRNA hairpins (Figure 1B). To simplify annotation, each miRNA hairpin is annotated as deriving from an individual miRNA gene, with clustered miRNAs processed from the same transcript thought of as polycistronic (Lagos-Quintana et al., 2001, Lau et al., 2001, Lee et al., 2002). Although many miRNAs are processed from long ncRNAs whose only known function is to produce the miRNA, some miRNA genes overlap other genes. For example, more than a quarter of the conserved miRNAs and more than half of the poorly conserved miRNAs of mammals appear to be processed from introns of protein-coding genes (Chiang et al., 2010). Thus, some pre-mRNAs are also pri-miRNAs and vice versa (Figure 1B).
Of the 500-plus canonical miRNA genes confidently identified in the human genome, 296 fall within 177 seed families conserved among placental mammals (Table S1). Among these, 200 genes fall within 89 seed families conserved since the vertebrate ancestor of humans and bony fish, and among these, 75 genes fall within 27 seed families conserved since the bilaterian ancestor of humans, flies and nematodes (Table S1). One of these seed families (miR-99/100) predates the emergence of bilaterian animals, in that a closely related miRNA is found in sea anemone (Grimson et al., 2008). As a class of regulatory molecules, metazoan miRNAs apparently emerged even earlier—with the advent of multicellular animals, as indicated by the presence of both miRNAs and the metazoan Microprocessor in sponge but not more deeply branching species (Grimson et al., 2008). Assuming both a common ancestry of sponge and bilaterian miRNA pathways and a phylogeny in which sponges are the most deeply branching animal lineage (Simion et al., 2017), miRNAs were lost in both the placozoan and ctenophore lineages (Grimson et al., 2008, Moroz et al., 2014).
Because conserved miRNA genes can be found computationally (Lim et al., 2003) and tend to be expressed at easily detectable levels (Landgraf et al., 2007, Ruby et al., 2007b), the lists of conserved miRNA genes are likely to be nearly complete. Indeed, with one exception, all human miRNA genes conserved at least to fish were found at least 12 years ago, before the era of high-throughput sequencing. The exception is the MIR1306 gene (Friedländer et al., 2008, Morin et al., 2008), which was subsequently shown to be a repressive control element that overlaps the DGCR8 coding sequence (Han et al., 2009). Weak accumulation of mature miR-1306 suggests that its conservation is primarily due to protein-coding constraints and a negative-feedback role for the pri-miR-1306 hairpin in controlling DGCR8 expression, in which excess Drosha–DGCR8 activity crops the hairpin, thereby severing the DGCR8 mRNA (Han et al., 2009).
Although the lists of conserved miRNA genes appear to have approached completion, the number of confidently identified but poorly conserved genes continues to grow as more samples are subjected to high-throughput small-RNA sequencing, allowing very lowly or narrowly expressed miRNAs, which also tend to be poorly conserved, to exceed the thresholds for confident annotation. Thus, the recent tally of 519 is only a lower bound on the number of canonical miRNA genes in the human genome, and the actual number seems likely to exceed 600, with the number of confidently identified but poorly conserved human genes likely to surpass the 296 human miRNA genes belonging to families that were present in the last common ancestor of placental mammals. However, most of the poorly conserved miRNAs that remain unannotated have avoided identification because of either their very low or narrow expression levels, and thus current profiles of miRNAs presumably fail to account for only a tiny fraction of the miRNA molecules in the cell or tissue—an omission that seems all the more tolerable when considering the evidence that detectable miRNA-mediated repression requires high miRNA levels (i.e., hundreds if not thousands of molecules representing the miRNA seed family in each cell) (Bosson et al., 2014, Denzler et al., 2016).
Most poorly conserved miRNAs are classified as such because they are relatively young, deriving either from duplication and divergence of an ancestral gene or from the de novo emergence of a new gene (Ruby et al., 2007b, Fromm et al., 2015). Some of the recently emergent miRNAs are expressed at levels that could mediate consequential repression, and some of these found in humans might have critical primate-specific or even species-specific functions—perhaps helping to determine what makes us human. Indeed, in the Drosophila clade, clusters of poorly conserved miRNAs expressed in the testes are evolving under positive selection (Mohammed et al., 2018). More generally, however, the recent origins of most poorly conserved miRNAs suggests that poorly conserved miRNAs have not only been rapidly gained but also rapidly lost over the course of metazoan evolution, which in turn suggests that most poorly conserved miRNAs in contemporary species have probably not yet acquired a fitness-enhancing targeting interaction and seem destined to be lost before they do acquire an interaction favoring their selective retention.
Defining Features of Canonical miRNA Genes
Although hundreds of different conserved and nonconserved hairpins enter the miRNA biogenesis pathway, many thousands of others do not. The gatekeeper of the canonical pathway, which discriminates between authentic pri-miRNAs and the other hairpin-containing transcripts, is Microprocessor. When choosing the hairpins that will enter the pathway, the mammalian Microprocessor prefers a stem of 35 ± 1 bp (Fang and Bartel, 2015), a terminal unstructured loop (Zeng et al., 2005), and single-stranded regions flanking the base of the hairpin (Zeng and Cullen, 2005, Han et al., 2006) (Figure 2). Although a few mismatches or small bulges are tolerated, pairing is preferred throughout the stem, except near the base of the stem, where a complex motif that centers on a mismatch at position 8 can modestly enhance processing (Fang and Bartel, 2015). In addition, three simple primary-sequence motifs—one at each end of the stem and another in the 3′ flanking region—enhance processing (Auyeung et al., 2013) (Figure 2).
Microprocessor measures from both ends of a 35 bp stem, making offset cuts 13 and 11 nt from one end of the stem and 22 and 24 nt from the other end (Han et al., 2006, Ma et al., 2013, Fang and Bartel, 2015) (Figure 2). The motifs interact with either Microprocessor or its auxiliary factors to break the symmetry of this recognition, ensuring that Microprocessor binds in the orientation that cleaves closer to the base of the hairpin rather than the one that cleaves closer to the loop (Fang and Bartel, 2015, Nguyen et al., 2015). Interactions with these motifs also compensate for the structural defects (e.g., sub-optimal stem length, mismatches, and small bulges) observed in most natural pri-miRNAs, enhancing recognition and helping to specify the cleavage site (Fang and Bartel, 2015). Only a subset of the four motifs are required to orient the cleavage and compensate for the structural imperfections of most mammalian miRNA hairpins, and thus very few mammalian pri-miRNAs have all four motifs.
Although Microprocessor takes a lead role in choosing the transcripts that will proceed down the canonical miRNA pathway, the specificity of downstream processes, such as nucleocytoplasmic transport, Dicer-catalyzed cleavage, and Argonaute-loading (Figure 1A), can impose secondary constraints on the hairpins that can ultimately give rise to miRNAs. For example, some artificial hairpins with all four motifs can be cleaved by Microprocessor even if their stems are too short to generate a product suitable for Dicer cleavage (Fang and Bartel, 2015). However, the ideal substrates of Microprocessor give rise to pre-miRNA hairpins that have a 22 bp stem with a 5′ phosphate and a 2 nt 3′ overhang (Fang and Bartel, 2015), which are ideal substrates of the mammalian Dicer (Zhang et al., 2004, Park et al., 2011, Gu et al., 2012). Indeed, artificial genes designed solely on Microprocessor-preferred features generate miRNAs that accumulate to higher levels than do miRNAs deriving from co-expressed natural miRNA hairpins, further illustrating how Microprocessor preferences are in sync with those of downstream processes and largely define the genes that give rise to miRNAs (Fang and Bartel, 2015).
In most other bilaterian animals, the defining features of canonical miRNA genes resemble those identified in mammals. In nematodes, however, these features have diverged to the point that little if any miRNA is produced when C. elegans pri-miRNAs are expressed within mammalian cells (Auyeung et al., 2013). The defining features of C. elegans miRNA genes, as well as those of miRNA genes from deeply branching animals (such as sponge or sea anemone) are mostly unknown.
Noncanoncial miRNA Genes
Some miRNA genes produce pri-miRNAs that bypass the requirement for either Drosha- or Dicer-catalyzed cleavage. For example, certain debranched introns enter the miRNA biogenesis pathway as pre-miRNA hairpins, with their ends having been defined by the spliceosome rather than by Drosha (Okamura et al., 2007, Ruby et al., 2007a) (Figure 3B). These RNAs that are both pre-miRNAs and introns are called “mirtrons” (Ruby et al., 2007a). In addition, some miRNA genes produce tailed mirtrons, which resemble classical mirtrons except they have a tail of flanking RNA at either the 5′ or 3′ end of the pre-miRNA hairpin, which is removed by non-Drosha nucleases before the pre-miRNA enters the pathway (Ruby et al., 2007a, Babiarz et al., 2008). Indeed, most mammalian mirtrons have a 5′ tail (Wen et al., 2015).
A second class of noncanonical miRNA genes produce endogenous short-hairpin RNAs (shRNAs) (Babiarz et al., 2008) (Figure 3C), which resemble the Drosha-independent type of artificial shRNAs designed for gene-knockout experiments (Tuschl, 2002). For a few endogenous shRNAs, namely those from two adenoviral hairpins transcribed by Pol III, transcriptional initiation and termination define the respective ends of the Dicer substrate (Andersson et al., 2005, Bellutti et al., 2015). However, for the best-characterized cellular endogenous shRNAs, which are Pol II products, the transcription start site defines the 5′ terminus of the Dicer substrate, and maturation of the 3′ terminus, although known to be Drosha independent, is uncharacterized (Babiarz et al., 2008, Xie et al., 2013). Definition of the 5′ terminus by Pol II has downstream ramifications, in that the 5′ cap structure prevents the 5′ (5p) strand of the miRNA duplex from entering the silencing complex, and thus mature miRNAs accumulate from only the 3p strand (Babiarz et al., 2008, Xie et al., 2013). Many additional noncanonical miRNAs derive from hairpins that map to the 5′ regions of mRNAs or their divergent antisense transcripts; for these endogenous shRNAs, maturation of the 5′ and 3′ termini of the Dicer substrate, although known to be Drosha independent, has not been characterized (Zamudio et al., 2014).
A third class of noncanonical miRNA genes are transcribed in tandem with or as part of another type of small-RNA gene, often depending on the transcriptional and processing determinants of the other RNA to help generate the hairpin substrate of Dicer (Figure 3D). For example, some miRNAs from murine γ-herpesvirus 68 (MHV68) derive from hairpins transcribed in tandem with tRNA-like molecules (Pfeffer et al., 2005), and miRNAs from a primate γ-herpesvirus derive from hairpins transcribed in tandem with Sm class small nuclear RNAs (snRNAs) (Cazalla et al., 2011). Other examples of these chimeric hairpin RNAs include miR-1839, a mammalian miRNA that derives from a hairpin that falls within the ACA45 small nucleolar RNA (snoRNA) (Ender et al., 2008), and miR-1983, a murine miRNA that derives from a hairpin that partially overlaps a tRNA (Babiarz et al., 2008).
The mammalian noncanonical miRNA with the most clearly established function is miR-451. miR-451 is one of the most highly expressed miRNAs in the erythrocytes of vertebrate animals, and mice lacking this miRNA have defects in erythroblast maturation (Patrick et al., 2010, Rasmussen et al., 2010). In contrast to the other known noncanonical miRNAs, which require Dicer but not Drosha for their biogenesis (Figures 3B–3D), miR-451 requires Drosha but not Dicer (Cheloufi et al., 2010, Cifuentes et al., 2010, Yang et al., 2010). Following Drosha cleavage, the pre-miR-451 hairpin is too short to be cleaved by Dicer and instead directly enters Argonaute2 (Ago2), which cleaves the 3′ arm of the hairpin to generate a product that is further resected by the PARN exonuclease to produce the mature miRNA (Cheloufi et al., 2010, Cifuentes et al., 2010, Yang et al., 2010, Yoda et al., 2013) (Figure 3E). The question as to why throughout all vertebrate species miR-451, and seemingly only miR-451, is generated through this unusual pathway is one of the curiosities of miRNA evolution.
As more high-throughput sequencing datasets are analyzed, loci encoding mirtrons, tailed mirtrons, and endogenous shRNAs are making up a growing fraction of the reported miRNA genes of animals. Indeed, in species with the deepest small-RNA–sequencing coverage, including human, the number of noncanonical miRNA loci appears to exceed the number of confidently annotated canonical miRNA genes (Zamudio et al., 2014, Wen et al., 2015). However, with a few exceptions, which include miR-63, a mirtron-derived miRNA highly expressed in C. elegans (Ruby et al., 2007a), miR-320, an endogenous shRNA highly expressed in mammals (Babiarz et al., 2008), and miR-451, noncanonical miRNAs of animals are both poorly conserved and lowly expressed, which suggests that most lack important regulatory functions. Indeed, in flies, a degradation pathway preferentially destroys mirtronic miRNAs, implying that fly mirtrons might generally be detrimental if left unchecked (Bortolamiol-Becet et al., 2015, Reimão-Pinto et al., 2015).
miRNA Isoforms
Alternative choices along the miRNA biogenesis pathway can generate multiple miRNA isoforms (sometimes called “isomiRs”) from the same miRNA gene. For example, inconsistent choice of the strand loaded into Argonaute can generate functional miRNAs from both strands of the miRNA duplex, each with different sets of regulatory targets. Furthermore, imprecise cleavage by Drosha or Dicer can generate miRNAs with heterogeneous 5′ or 3′ ends, with heterogeneity at the 5′ end, which produces miRNAs with different seeds, having a profound effect on target recognition. In addition, different miRNA genes can be transcribed from the two different strands of the same DNA locus.
For most conserved miRNA hairpins, evolutionary selection appears to have enhanced the precision of biogenesis so as to maximize targeting specificity. Despite frequent heterogeneity at miRNA 3′ ends, conserved miRNAs typically have >98% homogeneity with respect to both strand choice and 5′ end identity and even higher homogeneity with respect to the strand of the DNA that gives rise to the miRNA (Chiang et al., 2010). There are, however, some interesting exceptions in which alternative isoforms expand the regulatory repertoire of conserved miRNA genes. For example, isoforms from both the 5′ and the 3′ strand of the miR-10 duplex (named miR-10-5p and miR-10-3p, respectively) appear to have regulatory roles in Drosophila (Ruby et al., 2007b, Stark et al., 2007a), and both the major and the minor 5′ end isoform of miR-223 (named miR-223.1 and miR-223.2, respectively) repress their respective targets in mouse neutrophils (Chiang et al., 2010). Overall, among the 90 miRNA families conserved to fish (89 canonical families plus miR-451), 12 produce at least one additional isoform that both has unique targeting potential and is expressed in human, mouse and zebrafish at ≥33% the level of the dominant isoform (Table S1) (Agarwal et al., 2015). Although no more than a very low level of miRNA production appears to occur from transcription in the antisense orientation in mammals (Chiang et al., 2010), transcription from both strands does expand the regulatory functions of one Drosophila locus (Bender, 2008, Stark et al., 2008, Tyler et al., 2008).
For a few miRNAs, alternative isoforms are also generated from adenosine-to-inosine (A-to-I) RNA-editing, which primarily occurs in the brain and is more prevalent in the human than in the mouse (Landgraf et al., 2007). Because I forms a Watson–Crick pair with C, not U, these edits can impact targeting, especially when they alter a seed nucleotide in a large fraction of the miRNA molecules (Kawahara et al., 2007b), although this occurs only rarely. Interestingly, the few miRNA edits likely to impact targeting alter mostly miRNAs that derive from an imprinted locus that includes 36 miRNA genes (Kawahara et al., 2008, Chiang et al., 2010).
Additional alternative isoforms are generated from terminal-nucleotide–transferase or 3′-exonuclease activities, which extend or trim miRNA 3′ termini. Although much more prevalent than A-to-I edits (Kim et al., 2016c), these modifications are not expected to substantially influence target recognition because they occur at the 3′ ends of the miRNAs. Instead, these terminal modifications are sometimes associated with pathways that influence miRNA processing and stability (Ha and Kim, 2014).
Regulation of miRNA Expression
Regulation of miRNA Production
MicroRNAs processed from the introns of mRNAs are under the same transcriptional regulation as the mRNA of their host genes (Baskerville and Bartel, 2005). All other canonical and nearly all other noncanonical miRNA loci are also under Pol II transcriptional control, which is indistinguishable from that of protein-coding genes (O’Donnell et al., 2005, Marson et al., 2008, Ozsolak et al., 2008). Exceptions include the two adenovirus shRNAs and the MHV68 chimeric hairpin RNAs, which are Pol III transcripts (Andersson et al., 2005, Bogerd et al., 2010, Diebel et al., 2010) (Figure 3).
Once the loci have been transcribed, diverse regulatory factors (often with additional, previously established cellular functions) influence the stability or processing of individual hairpins (Ha and Kim, 2014, Du et al., 2015, Treiber et al., 2017). For example, in C. elegans, targeting of the let-7 pri-miRNA by the mature let-7 miRNA can enhance processing in a positive-feedback loop (Zisoulis et al., 2012). In addition, multiple regulatory mechanisms influence the accumulation and activity of both Microprocessor and Dicer (Ha and Kim, 2014). Although changing these core activities broadly impacts miRNA production, individual miRNAs are differentially sensitive, which causes expression of some miRNAs to be affected more than others.
Of the posttranscriptional regulatory pathways that influence pri- or pre-miRNA processing or stability, one of the best-characterized involves the Lin28-directed oligo(U) tailing of the let-7 family of pre-miRNAs. Lin28 binds to the terminal loop of let-7 pre-RNAs, which promotes oligo(U)-tailing at the 3′ end of these pre-miRNAs, through the recruitment of terminal uridylyl transferases, TUT4 and TUT7 (Heo et al., 2008, Hagan et al., 2009, Heo et al., 2009, Nam et al., 2011). The oligo(U) blocks Dicer processing and stimulates pre-miRNA decay through the action of the 3′–5′ exonuclease, DIS3L2 (Heo et al., 2008, Chang et al., 2013, Ustianenko et al., 2013). Highlighting the potential biological relevance of this pathway, the let-7 miRNAs promote cellular differentiation (Melton et al., 2010), and Lin28 is one of the factors that can help reverse this differentiation during induction of pluripotent stem cells (Yu et al., 2007).
In a few cases, addition of a single nucleotide to the end of a pre-miRNA acts to enhance processing rather than inhibit it (Heo et al., 2012). Another posttranscriptional modification that influences miRNA production is A-to-I editing of pri-miRNAs, which can inhibit pri-miRNA processing and stimulate pri-miRNA degradation (Yang et al., 2006, Kawahara et al., 2007a).
Regulation of miRNA Decay
Once loaded into the silencing complex, most miRNAs are very stable, with half-lives of days (van Rooij et al., 2007, Bail et al., 2010, Gantier et al., 2011). Indeed, for many cases in which miRNAs are processed from the same primary transcripts as mRNAs, the miRNA accumulates to a much higher level than that of the cotranscribed mRNA, implying that the miRNA is much more stable. However, not all miRNAs are so stable. For example, some neuronal miRNAs have variable stability (Krol et al., 2010), and a few miRNAs are constitutively unstable, enabling a more rapid response to transcriptional changes (Rissland et al., 2011).
Artificial targets with extensive pairing to the miRNA can trigger miRNA degradation accompanied by 3′-terminal tailing and trimming of the mature miRNA (Ameres et al., 2010). In mammals, this phenomenon of target RNA–directed miRNA degradation (TDMD) is most robust in neuronal cells (de la Mata et al., 2015). In contrast to target repression, TDMD is particularly sensitive to the extent of pairing to the 3′ end of the miRNA, suggesting that this pairing might pull the 3′ end of the miRNA from the protective confines of the Argonaute protein, where it can then be acted upon by terminal transferases and exonucleases. The occurrence of TDMD among diverse eukaryotic species would explain why many classes of small silencing RNAs that routinely recognize targets with extensive complementarity (including piRNAs and endogenous siRNAs of animals and miRNAs and siRNAs of plants) are methylated at their terminal 2′ oxygen, as this methylation prevents tailing (Li et al., 2005, Ameres et al., 2010).
Natural triggers of TDMD could in principle explain some of the differential stability observed among endogenous miRNAs (Marzi et al., 2016). Such triggers were first discovered among the ncRNAs of the herpesvirus family (Cazalla et al., 2010, Libri et al., 2012, Marcinowski et al., 2012, Lee et al., 2013), with pairing between the viral ncRNAs and host miRNAs shown to be critical for virus production, presumably because of the consequent TDMD (Marcinowski et al., 2012, Lee et al., 2013). More recently, transcripts that trigger TDMD have also been identified among cellular RNAs (Bitetti et al., 2018, Kleaveland et al., 2018). For example, in some tissues, the long ncRNA Cyrano reduces miR-7 levels by >97%, with the efficiency of this TDMD greatly exceeding that of artificial or viral transcripts (Kleaveland et al., 2018). This targeted degradation of miR-7 can derepress mRNA targets of miR-7, but more strikingly, it enables a circular RNA called “Cdr1as” to accumulate in processes and other cytoplasmic regions of mammalian neurons (Kleaveland et al., 2018). Cdr1as is required for proper neuronal activity, acting through an unknown mechanism thought to involve miR-7, as Cdr1as orthologs found throughout placental mammals each have many sites to miR-7 (e.g., 73 and 130 miR-7 sites in human and mouse Cdr1as, respectively) (Hansen et al., 2013, Memczak et al., 2013, Piwecka et al., 2017).
miRNA Regulatory Mechanisms
In animals, the recognition and repression of target transcripts can occur through two alternative modes (Figure 1A). If pairing between the miRNA and the target site is sufficiently extensive, and if the miRNA is associated with an Argonaute protein that has retained its ancestral ability to catalyze the endonucleolytic cleavage characteristic of RNA interference (RNAi) (Liu et al., 2004, Meister et al., 2004), then the miRNA directs slicing of the target transcript (Hutvágner and Zamore, 2002, Yekta et al., 2004) (Figure 1A). This slicing mode of repression is common for plant miRNAs (Jones-Rhoades et al., 2006) and might also play a widespread role in sea anemone (Moran et al., 2014). It is also the basis of siRNA-mediated mRNA-knockdown technologies that have transformed biomedical research and show promise in the clinic (Elbashir et al., 2001, Bobbin and Rossi, 2016). In humans and other mammals, however, miRNA-directed slicing has been reported for only 20 cellular transcripts (Yekta et al., 2004, Davis et al., 2005, Shin et al., 2010), one circular RNA (Hansen et al., 2011), and a few viral mRNAs (Sullivan et al., 2005, Barth et al., 2008), and examples of endogenous miRNA-directed slicing have yet to be reported in non-mammalian bilaterian animals. Thus, most bilaterian miRNAs do not direct the slicing of a single endogenous target.
The repression mode that dominates in humans and other bilaterian animals acts without slicing the mRNA and does not require extensive pairing to the miRNA (Figure 1A). Instead, it requires the adaptor protein TNRC6 (present in mammals as three paralogs, TNRC6A/B/C, which have a single ortholog in flies, named GW182, and two in nematodes, named AIN-1/2) (Ding et al., 2005, Rehwinkel et al., 2005, Jonas and Izaurralde, 2015). Recruited by Ago, TNRC6 interacts with the poly(A)-binding protein (PABPC) associated with the mRNA poly(A) tail and also recruits deadenylase complexes, i.e., the PAN2–PAN3 complex and most importantly the CCR4–NOT complex (Jonas and Izaurralde, 2015) (Figure 4). The deadenylases shorten the poly(A) tail, which in most systems causes mRNA destabilization through decapping and 5′-to-3′ exonucleolytic decay (Chen and Shyu, 2011). Recruitment of TNRC6 also causes the mRNA to be less efficiently translated, an effect on translation initiation thought to be mediated, at least in part, by CCR4-NOT and its recruitment of DDX6, a helicase that binds the decapping complex and is reported to inhibit translation (Chu and Rana, 2006, Jonas and Izaurralde, 2015) (Figure 4). Moreover, DDX6 interacts with eIF4E transporter (4E-T), which competes with eIF4G for binding to eIF4E and enhances both the decay and translational repression of miRNA targets (Kamenska et al., 2014, Nishimura et al., 2015, Ozgur et al., 2015, Kamenska et al., 2016).
The relative importance of mRNA decay and translational repression, the two repressive consequences of this TNRC6-mediated regulatory mode, depends on the developmental context of the cell. In all post-embryonic cells examined, mRNA decay dominates, as revealed by experiments comparing changes in mRNA levels with changes in either protein levels or translational efficiency after either introducing or knocking out a miRNA (Baek et al., 2008, Hendrickson et al., 2009, Guo et al., 2010, Eichhorn et al., 2014). Indeed, regardless of miRNA identity, cell type, growth condition, or translational state, mRNA destabilization explains most (66% to >90%) of the steady-state repression mediated by mammalian miRNAs (Eichhorn et al., 2014). Examination of the dynamics of translational repression and mRNA destabilization as a miRNA is induced shows that although translational repression occurs rapidly (Béthune et al., 2012, Djuranovic et al., 2012), its effect on endogenous mRNAs is relatively weak, such that by the time consequential repression ensues, the effect of mRNA destabilization dominates (Eichhorn et al., 2014). These results are welcome news for those seeking to determine the molecular consequences of a miRNA. In post-embryonic contexts, the miRNA can be perturbed, and the effects on target mRNA levels (which are easier to measure than effects on either protein levels or translational efficiency) provide a nearly quantitative readout of the miRNA-mediated repression.
In the early zebrafish embryo, however, translational repression is the only repressive consequence of the TNRC6-mediated regulation, and thus monitoring mRNA levels completely misses the effects of miRNAs (Bazzini et al., 2012, Subtelny et al., 2014). This difference in the consequences of miRNA repression occurs because the posttranscriptional regulatory context is very different in the early embryo compared to later embryos or post-embryonic cells, which leads to very different consequences of miRNA-mediated poly(A)-tail shortening. In early embryos, shortening the tail of an mRNA does not change its stability but dramatically decreases its translational efficiency, whereas at gastrulation and in post-embryonic cells, shortening the tail of an mRNA reduces its stability but does not change its translational efficiency (Subtelny et al., 2014).
miRNA Target Recognition
Canonical Regulatory Sites
For sites that promote TNRC6-mediated repression, target recognition is primarily through Watson–Crick pairing between the miRNA seed (miRNA nucleotides 2–7; distinguished in this review from the “extended seed” or the “seed region,” which include miRNA nucleotides 2–8) and sites within the 3′ UTRs of target mRNAs (Bartel, 2009). Augmenting the 6 nt seed match, most of these regulatory sites also have either an additional match to miRNA nucleotide 8 or an A across from miRNA nucleotide 1, or both, to make 7 or 8 nt sites, respectively (Lewis et al., 2005) (Figure 5A). These 7–8 nt sites mediate the bulk of the repression for each miRNA and are the sites identified by the most effective target-prediction tools (Bartel, 2009, Agarwal et al., 2015). Nonetheless, the 6 nt sites that either match only the seed or are offset by one nucleotide in either the 5′ or 3′ direction (Figure 5A) can sometimes also mediate detectable repression (Friedman et al., 2009, Jan et al., 2011, Kim et al., 2016a), as can 7–8 nt sites in open reading frames (ORFs) or 5′ UTRs (Lewis et al., 2005, Stark et al., 2007b, Schnall-Levin et al., 2010, Agarwal et al., 2017). Pairing to the 3′ region of the miRNA, particularly pairing involving miRNA nucleotides 13–16, can supplement pairing to the seed region (Bartel, 2009) (Figure 5B), but this 3′-supplementary pairing has surprisingly little influence on site affinity and efficacy (Grimson et al., 2007, Wee et al., 2012, Salomon et al., 2015), and only about 5% of the seed-matched regulatory sites undergoing purifying selection appear to include this additional pairing (Friedman et al., 2009).
The preference for an A across from the first nucleotide of the miRNA, which is observed regardless of the miRNA sequence (Figure 5A), implies that the miRNA does not pair with this target nucleotide but rather a protein prefers an A at this position (Lewis et al., 2005). Indeed, structural analyses show that the first nucleotide of the miRNA is buried within Ago, unable to interact with the target (Ma et al., 2005, Parker et al., 2005), and that Argonaute has a pocket that specifically binds an A at this position (Schirle et al., 2015).
The seed was originally thought to be a segment of the miRNA that was fully preorganized and accessible for target recognition (Bartel, 2004). Structural results have revised this idea, revealing that a guide-RNA kink and steric barrier imposed by Ago α-helix 7 leaves only miRNA nucleotides 2–5 suitably preorganized and accessible for the target search (Elkayam et al., 2012, Nakanishi et al., 2012, Schirle and MacRae, 2012) (Figure 6A, step 1), and single-molecule analyses demonstrate that this sub-seed segment is indeed most critical for target association (Chandradoss et al., 2015, Salomon et al., 2015). As pairing propagates to miRNA nucleotides 6–8, α-helix 7 is displaced and assumes a conformation that reinforces perfect pairing to nucleotides 6 and 7 (Schirle et al., 2014, Klum et al., 2018) (Figure 6, step 2). Also in this region of the protein is a series of serine residues that undergo a phosphorylation cycle that promotes proper targeting (Golden et al., 2017, Quévillon Huberdeau et al., 2017).
The seed was given its name with the idea that it nucleated pairing that could then propagate to other parts of the miRNA (Bartel, 2004), and for the extensively paired sites that can be sliced, pairing is still commonly thought to grow from the seed, propagating 5′-to-3′ along the miRNA (Wang et al., 2009). However, this idea faces some serious topological obstacles, as tight binding of the seed region would keep this region stationary during propagation, and thus contiguous propagation would require challenging conformational gymnastics to enable the guide strand to wrap two full turns around the target strand (Bartel, 2009). Here, I propose an alternative model, which is more parsimonious with these topological considerations as well as the preferred region of 3′-supplementary pairing. In this model, instead of propagating directly from the seed helix, pairing skips from the seed region to a second nucleation region, generating an intermediate that resembles the conformation observed with most 3′-supplementary sites (Figure 6, step 3). Pairing then forms on both sides of this second helix. Because the seed region is fixed within its binding channel, it remains stationary while the second helix rotates around its helical axis to enable pairing to propagate between the two helices (from either or both directions) to include miRNA nucleotides 9–12 (Figure 6, step 4). As the second helix begins to rotate and as pairing propagates beyond miRNA nucleotides 17, the miRNA 3′ terminus is pulled from its binding pocket, thereby helping to free the second helix for further rotation (Figure 6, step 4). In such a model, pairing to the miRNA central region and the accompanying conformational change required to position the Argonaute active site for target slicing would occur only for targets with suitable pairing to both the seed and the miRNA 3′ region. Importantly, all four steps are reversible, which also provides a pathway for the stepwise release of the passenger strand after a miRNA duplex has been loaded.
Additional Regulatory Sites and Binding Sites
Noncanonical sites, which lack a contiguous 6 nt match to the seed region, can also mediate repression, although only one type of noncanonical site is known to both mediate readily detectable repression and yield a significant signal above background in global analyses of conservation. This is the 3′-compensatory site, for which extensive pairing to the miRNA 3′ region compensates for imperfect pairing to a core seed nucleotide (Figure 5C) (Bartel, 2009). These 3′-compensatory sites account for less than 1% of the preferentially conserved miRNA sites in mammalian mRNAs, presumably because their large size compared to the size of canonical sites makes them correspondingly more difficult to acquire and retain in mRNA sequences (Friedman et al., 2009). Nonetheless, a potential advantage of these larger sites is that their strong dependence on pairing to miRNA nucleotides falling outside the seed region offers a mechanism to avoid the regulatory redundancy often observed for co-expressed miRNAs from the same seed family. For example, the need to respond with temporal precision explains why the lin-41 mRNA of nematodes has two 3′-compensatory sites that respond specifically to the let-7 miRNA and not to earlier-expressed paralogous miRNAs with the same seed (Brennecke et al., 2005, Lewis et al., 2005, Ecsedi et al., 2015).
Although miRNAs primarily function at canonical sites, their binding appears more promiscuous. High-throughput analyses of mRNA segments that crosslink to Argonaute show that in cells miRNA silencing complexes bind to many sites that lack contiguous 6 nt matches to the seed region, such that in aggregate these noncanonical sites account for approximately half of the intracellular crosslinks (Chi et al., 2009, Hafner et al., 2010, Loeb et al., 2012, Helwak et al., 2013, Grosswendt et al., 2014). However, the noncanonical binding sites identified by crosslinking do not function to mediate detectable repression, in that mRNAs with only these sites are no more responsive to miRNA perturbation than are other mRNAs without canonical sites (Agarwal et al., 2015). Presumably each miRNA typically samples many low-affinity noncanonical sites before encountering a site with sufficient dwell time to enact repression, such that in aggregate it spends at least as much time at the abundant noncanonical sites as at the functional canonical ones (Agarwal et al., 2015, Denzler et al., 2016).
miRNA Target Abundance
By virtue of their small size, seed-matched regulatory sites can readily emerge in 3′ UTRs, enabling each miRNA to have a widespread influence on mRNA regulation and 3′-UTR evolution (Farh et al., 2005, Stark et al., 2005). Indeed, within human 3′ UTRs, each of the 90 broadly conserved miRNA families has an average of >300 7–8 nt sites under selective pressure to maintain their pairing to the miRNA, and this average exceeds 500 when including the 6 nt sites (Friedman et al., 2009). Some 3′ UTRs are targeted more than once by the same miRNA, and 3′ UTRs with a preferentially conserved site to one miRNA family usually have preferentially conserved sites to other families (with each conserved miRNA target preferentially conserving sites to an average of 4–5 different miRNA families). Yet despite this concentration of targeting of some miRNAs compared to others, the average number of conserved targets per family exceeds 400, and 3′ UTRs of 57.8% ± 3.0% (95% confidence interval) of the human mRNAs are conserved targets of miRNAs (Friedman et al., 2009). When also considering other preferentially conserved sites, such as those observed in ORFs, albeit at a lower density than in 3′ UTRs (Lewis et al., 2005, Stark et al., 2007b, Schnall-Levin et al., 2010), and those observed for more recently emergent miRNA families, the estimate of conserved targets confidently exceeds 60% of human mRNAs (Friedman et al., 2009). For nematodes and flies, the respective estimates are lower, at 27% and 37% of annotated mRNAs, respectively (Jan et al., 2011, Agarwal et al., 2017). Experiments that globally examine the molecular effects of perturbing a miRNA family not only confirm repression of many mRNAs with conserved sites but also reveal repression of even more mRNAs with non-conserved seed-matched sites (Krützfeldt et al., 2005, Lim et al., 2005, Giraldez et al., 2006, Rodriguez et al., 2007). Thus, it would seem difficult for a developmental or physiological process to escape the influence of miRNAs.
Although low-affinity noncanoncial sites contribute little to repression, they do contribute, by virtue of their sheer numbers, to the effective number of sites in the cell that compete for miRNA binding. As a result, the effective target abundance (defined as the number of sites that must be added to achieve half-maximal de-repression of targets) in each cell typically exceeds the miRNA abundance—even for very highly expressed miRNAs such as miR-122 in hepatocytes, which is present at 120,000 molecules per cell, yet has an even greater effective target abundance (Denzler et al., 2014, Denzler et al., 2016).
The high effective target abundances have important ramifications. For example, with so many sites competing for miRNA binding, few if any sites are saturated, and thus even the most repressed targets are sensitive to increased miRNA levels. High effective target abundances also refute the plausibility of the competing endogenous RNA (ceRNA) hypothesis (Denzler et al., 2014, Denzler et al., 2016), which posits that changes in the expression of individual cellular targets commonly and perceptibly influence the repression of other cellular targets by modulating the amount of free miRNA (Salmena et al., 2011). Moreover, the high effective target abundances explains why lowly expressed miRNAs have negligible effects, which in turn seriously challenges the idea that the miRNAs delivered by exosomes might reach consequential levels in recipient cells.
miRNA Target-Prediction Strategies
Numerous algorithms have been developed to predict the regulatory targets of miRNAs, with at least 25 sets of target predictions from at least 16 different groups available for the human miRNAs (Bartel, 2009, Agarwal et al., 2015). To reliably predict targets of a miRNA, the first step is to search for 7–8 nt canonical sites in annotated 3′ UTRs. On its own, this simple text-search step generates predictions that are more responsive to the miRNA than those of some of the popular algorithms (Agarwal et al., 2015). However, not all mRNAs with a 7–8 nt canonical site respond to the miRNA (Baek et al., 2008, Selbach et al., 2008), and thus additional strategies that further differentiate among these predictions can add value. One such strategy is to choose 3′ UTRs with sites that appear to have been selectively conserved in evolution (Bartel, 2009), as mRNAs with conserved sites to a broadly conserved miRNA are about twice as likely to respond to the loss of that miRNA than those with poorly conserved sites (Baek et al., 2008). However, even when imposing a conservation cutoff, the lists of predicted targets typically include hundreds of unique mRNAs for each highly conserved miRNA family. If these lists are too large for experimental follow-up, one option is to apply a more stringent conservation cutoff to prioritize those with the greatest probability of conserved targeting, which tend to be even more responsive to the miRNA than mRNAs with lower probability of conserved targeting (Friedman et al., 2009).
Despite utility of prioritizing conserved targets, some of the most responsive targets are not among those with the most confidently detected site conservation. Moreover, the miRNAs that emerged more recently, including those conserved no further than placental mammals, have too few targets under selective conservation for any of the conserved targets to be confidently predicted above the background (Friedman et al., 2009). Thus, to supplement and complement the analysis of conservation, quantitative models have been developed that use the features of the sites and their contexts within 3′ UTRs to predict targeting efficacy. Features of sites that correlate with efficacy include the type of site (Figure 5A), the number of sites, the potential for 3′-supplementary pairing, and site context (Grimson et al., 2007). With respect to site context, the more effective sites tend to be near the edges of UTRs (but ≥15 nt downstream of the stop codon to avoid displacement of the silencing complex by the ribosome) and within regions predicted to be more structurally accessible (Bartel, 2009). Also correlating with efficacy are features of the miRNA, including its predicted seed-pairing stability and the number of its transcriptome sites competing for binding (Garcia et al., 2011), and features of the mRNAs, including UTR length, ORF length, presence of alternative 3′ UTR isoforms, and presence of additional marginal sites in 3′ UTRs and ORFs (Agarwal et al., 2015). Quantitative models that parameterize informative features and then train on high-throughput measurements of miRNA effects are becoming increasingly accurate at predicting the mRNAs most responsive to miRNA perturbation (Agarwal et al., 2015). Indeed, the most accurate of these quantitative models is as informative as the best high-throughput in vivo crosslinking approaches, even after filtering the crosslinking results to include only mRNAs with a canonical site to the miRNA (Agarwal et al., 2015). Despite matching the utility of the crosslinking methods, the computational approach still has room for improvement, with both false positives and false negatives. Nonetheless, because the experimental approach also yields false positives and false negatives, the computational model and experimental approach happen to perform similarly.
miRNA Biological Functions
Biological Importance of miRNAs
Systematic generation and analysis of miRNA knockouts in C. elegans created the initial impression that most metazoan miRNAs—even most miRNAs that are broadly conserved—have only subtle or redundant biological functions (Miska et al., 2007, Alvarez-Saavedra and Horvitz, 2010). Excluding genes that fall within the lin-4, let-7, and lsy-6 miRNA families, which already had genetically informed functions (Lee et al., 1993, Reinhart et al., 2000, Johnston and Hobert, 2003), (and also excluding mutations of 13 loci that corresponded to false-positive miRNA annotations), this effort examined 83 miRNA genes representing 48 seed families (including 38 families that are both conserved in other nematodes and for which every known member was simultaneously disrupted). Despite the scale of the effort, abnormal phenotypes were initially observed for mutants in only 5 of the 48 families (miR-35, miR-51, miR-58, miR-240, and miR-786) (Miska et al., 2007, Alvarez-Saavedra and Horvitz, 2010). With more thorough analyses, abnormal phenotypes have since been observed for mutants in another seven of these families when grown under normal lab conditions (miR-34, miR-71, miR-79, miR-83, miR-238, miR-246, miR-259) (de Lencastre et al., 2010, Boulias and Horvitz, 2012, Pedersen et al., 2013, Burke et al., 2015), and for mutants in another four when examined in the context of stress conditions (miR-63, miR-67, miR-86, and miR-231) (Nehammer et al., 2015, Yang et al., 2016, Ma et al., 2017). In addition, an abnormal phenotype has been observed after disrupting a member of a family not examined in the initial effort (miR-791) (Drexel et al., 2016). However, for most miRNA families disrupted in C. elegans, abnormal phenotypes have either not been detected or have been detected only in the context of sensitized genetic backgrounds (Brenner et al., 2010).
Interestingly, this notion that most individual miRNA families have only subtle or redundant biological functions seems to apply uniquely to nematodes, in that a very different conclusion has emerged for both flies and mammals. In a systematic analysis of miRNA knockout phenotypes in flies, 69 of 88 knockouts examined (78%) had severe mutant phenotypes scored with >99% confidence (Chen et al., 2014), and this fraction increased slightly to 65 of 81 (80%) when considering knockouts of only canonical miRNAs conserved at least throughout flies and to 23 of 28 (82%) for the subset of mutants that disrupt a miRNA in a family conserved since the ancestor of bilaterian animals.
With respect to mice, knockouts have been reported for members of 20 of the 27 mammalian miRNA families conserved since the bilaterian ancestor, and for each of these 20, abnormal phenotypes have been observed (Table 1). Abnormal knockout phenotypes also have been reported for another 36 miRNA families conserved at least to fish (Table 2). Thus, of the 90 mammalian miRNA families conserved to fish, 56 (62%) have been associated with an abnormal knockout phenotype. Among these 56, 4 have abnormalities observed only in the context of a disease or injury model (miR-130/301, miR-139, miR-192/215, and miR-214; Table 2). However, for the other 52, a broad diversity of defects are observed for mice examined under normal lab conditions, often with severe consequences. For example, mutants in 15 families have significantly reduced viability manifested either in utero or prior to weaning, and others cause fatal neurological disorders, infertility, blindness, deafness, immune disorders, or cancer (Tables 1 and 2). Analyses of the effects of simultaneously disrupting miRNAs from multiple families (typically expressed from polycistronic loci) attribute functions to as many as 13 additional miRNA families conserved at least to fish (Table 3). Thus, of the 90 mammalian miRNA families conserved to fish, 69 (77%) have been associated with an abnormal knockout phenotype, and many of the remaining 21 have yet to be examined.
Table 3Abnormal Phenotypes Observed in Mice after Knocking Out miRNAs Representing Multiple miRNA Families | ||
miRNA families (conservation, # of genes) | miRNA(s) removed | Phenotype |
miR-1/206 (bilat. 3), miR-133 (bilat. 3) | miR-1-1∼133a-2, miR-1-2∼133a-1 | Early embryonic lethality, complete penetrance, due to severe heart malformations (Wystub et al., 2013) |
miR-96 (bilat. 1), miR-183 (bilat. 1), miR-182 (vert. 1) | miR-183∼96∼182 | Retinal degeneration following progressive synaptic defects of photoreceptors; impaired balance (Lumayag et al., 2013); decreased corneal nerve density; increased neutrophil activity; decreased disease with Pseudomonas infection of the cornea (Muraleedharan et al., 2016) |
miR-96 (bilat. 1), miR-183 (bilat. 1) | miR-183∼96 | Defects in retinal cone development, progressive loss of photoreceptor fuction, retinal degeneration; impaired balance; impaired olfactory function (Xiang et al., 2017) |
miR-193 (bilat. 2), miR-365 (bilat. 2) | miR-193b∼365-1 | Increased mammary stem/progenitor cells; accelerated differentiation of mammary epithelium during puberty and pregnancy (Yoo et al., 2014) |
miR-25/32/92/363/367 (bilat. 7), miR-17/20/93/106 (vert. 6), miR-18 (vert. 2), miR-19 (vert. 3) | miR-17∼18∼19a∼20a∼19b-1∼92a-1 | Perinatal lethality, complete penetrance, due to lung and heart defects; vertebral homeotic transformations and other skeletal defects; reduced body weight; reduced pre-B cells (Ventura et al., 2008, Han et al., 2015); conditional deletion perturbs development and/or function in each of 18 different cell types |
miR-25/32/92/363/367 (bilat. 7), miR-17/20/93/106 (vert. 6), miR-18 (vert. 2), miR-19 (vert. 3) | miR-17∼18∼19a∼20a∼19b-1∼92a-1, miR-106b∼93∼25 | Embryonic lethality at midgestation, complete penetrance (Ventura et al., 2008) |
miR-25/32/92/363/367 (bilat. 7), miR-17/20/93/106 (vert. 6) | miR-106b∼93∼25 | Enhanced adiposity leading to insulin resistance (Cioffi et al., 2015); cardiac arrhythmia (Chiang et al., 2014) |
miR-23 (vert. 2), miR-24 (vert. 2), miR-27 (vert. 2) | miR-23a∼27a∼24-2, miR-23b∼27b∼24-1 | Hypersensitive T cells; increased T helper cell responses and tissue pathology in an immune disease model (asthma) (Pua et al., 2016); decreased hematopoietic stem cells and progenitors; skewed hematopoiesis, with B cells produced at the expense of myeloid cells (Kurkewich et al., 2016, Kurkewich et al., 2017) |
miR-200bc/429 (bilat. 3), miR-200a/141 (vert. 2) | miR-200c∼141 | Impaired enamel formation (Cao et al., 2013) |
miR-200bc/429 (bilat. 3), miR-200a/141 (vert. 2) | miR-200b∼200a∼429 | Adipocyte-specific knockout causes increased obesity with high-fat diet (Tao et al., 2016) |
miR-143 (vert. 1), miR-145 (vert. 1) | miR-143∼145 | Arterial smooth muscle defects with perturbed actin stress fibers; increased lethality and less scarring in response to vascular injury (Elia et al., 2009, Xin et al., 2009); swelling of the kidney due to impaired ureter function (Medrano et al., 2014); reduced intraocular pressure (Li et al., 2017); altered pathology in several disease/injury models |
miR-144 (vert. 1), miR-451 (vert. 1) | miR-144∼451 | Increased incidence of spontaneous B lymphoma and acute myeloid leukemia in aged mice (Ding et al., 2018) |
miR-216a (bilat. 1), miR-216b (vert. 1), miR-217 (vert. 1) | miR-216b∼216a∼217 | Embryonic lethality at about E9.5, complete penetrance (Azevedo-Pouly et al., 2017) |
miR-199 (vert. 3), miR-214 (vert. 1) | miR-199a-2∼214 | Neonatal and postnatal lethality, partial penetrance, smaller body size, skeletal abnormalities, partial paralysis (Watanabe et al., 2008) |
miR-291a/294/302abd (vert. 5), 4 other less broadly conserved families | miR-290∼291a∼292∼291b∼293∼294∼295 | Embryonic lethality, partial penetrance; female sterility with ovarian failure due to defects in primordial germ cell migration (Medeiros et al., 2011); smaller placenta with defective maternal–fetal transport (Paikari et al., 2017) |
miR-291a/294/302abd (vert. 5), miR-302c (vert. 1) | miR-302b∼302c∼302a∼302d | Late embryonic lethality, complete penetrance; failure of neural tube closure, with precocious neuronal differentiation and increased proliferation of neural progenitors; abnormal eye development (Parchem et al., 2015) |
miR-291a/294/302abd (vert. 5), miR-302c (vert. 1), 4 other less broadly conserved families | miR-290∼291a∼292∼291b∼293∼294∼295, miR-302b∼302c∼302a∼302d | Early embryonic lethality, complete penetrance (Parchem et al., 2015) |
miR-208 (vert. 2), miR-499 (vert. 1) | miR-208b, miR-499 | Reduced proportion of slow myofibers in skeletal muscle (van Rooij et al., 2009) |
miR-15/16/195/322/497 (vert. 7), miR-503 (euth. 1) | miR-322∼503 | Defects in mammary gland involution after weaning; increased body weight due to more white fat (Llobet-Navas et al., 2014); increased tumorigensis of breast cancer in sensitized background (Rodriguez-Barrueco et al., 2017); increased angiogenic response to inflammation (Lee et al., 2017) |
miR-15/16/195/322/497 (vert. 7), miR-503 (euth. 1), miR-351 (p.c. 1) | miR-322∼503∼351 | Deletion in cartilage causes perinatal lethality, partial penetrance, due to tracheal defects (observed specifically in pure C57BL/6N genetic background) (Bluhm et al., 2017) |
miR-431 (euth. 1), miR-433 (euth. 1), miR-127 (euth. 1), miR-434 (p.c. 1) | miR-431∼433∼127∼434 | Note that the following phenotypes are observed only when inheriting deletion from maternal allele (part of paternally imprinted cluster expressed from only maternal allele). Either neonatal lethality or slower growth after weaning, depending on genetic background; enlarged placenta with defects (Sekita et al., 2008) |
Cluster of 36 genes from 35 families; all but 2 of the 35 families have only a single murine gene (exceptions, miR-381/539 and miR-329/362); 27 families conserved among eutheria, the other 8 conserved less broadly | miR-379∼411∼299∼380∼1197∼323∼758∼329∼494∼679∼1193∼666∼543∼495∼667∼376c∼376b∼376a∼300∼381∼487b∼539∼544∼382∼134∼668∼485∼154∼496a∼377∼541∼409∼412∼369∼410∼3072 | Note that the following phenotypes are observed only when inheriting deletion from maternal allele (paternally imprinted cluster expressed from only maternal allele). Neonatal lethality, partial penetrance; transient postnatal growth retardation; postnatal metabolic defects, including inefficient mobilization of glycogen stores (Labialle et al., 2014); increased anxiety-related behavior in adult mice (Marty et al., 2016) |
View Table in HTML
For each miRNA family, its conservation is listed in parentheses, together with the number of murine genes that contribute to that family. The conservation categories are bilaterian (bilat., conserved since the bilaterian ancestor), vertebrate (vert., conserved among mammals and fish but emerged since the bilaterian ancestor), eutherian (euth., conserved among placental mammals but not to fish), and poorly conserved (p.c., not conserved among placental mammals). Each strain has more than one miRNA gene disrupted; multiple miRNAs from a disrupted polycistronic locus are linked with a tilde (∼), whereas those from distant loci are separated with a comma.
Why might broadly conserved miRNAs be less consequential in nematodes than in either flies or mammals? Perhaps much of nematode development is too rapid for optimal miRNA regulation. Mammalian miRNAs often take at least a day to reach the levels needed for consequential regulation, but C. elegans larvae hatch, with many cells already terminally differentiated, only 14 hr after fertilization. Regardless of the reason for this reduced role, the more frequent loss in nematodes of otherwise broadly conserved miRNA families supports the idea that in nematodes other regulatory processes have exapted ancestral miRNA functions. For example, the C. elegans lineage has lost 8 of the 31 miRNA families that non-nematode lineages have inherited from the bilaterian ancestor (the miR-7, -33, -96, -153, -184, -210, -219, and -278 families), and as many as 15 of the 31 have been lost in the nematode lineage that gave rise to Globodera pallida (Fromm et al., 2015, Sarkies et al., 2015). In contrast, the Drosophila lineage has lost only 2 of the 31 (miR-153 and miR-10, with retention of alternative miR-10 isoforms), and the mammalian lineage has lost only 4 (miR-76, -252, -278, and -281). The seemingly reduced role for miRNAs in nematodes makes the original discovery of the lin-4 RNA all the more impressive.
Although abnormal knockout phenotypes have revealed functions for most of the mammalian miRNA families conserved to fish, less is known of the biological functions of more recently emerged mammalian miRNAs. In humans, 96 miRNA genes from 88 families are conserved among placental mammals but not to fish (Table S1). Genes for a third of these families fall within two paternally imprinted miRNA clusters, which when disrupted in mouse, yield phenotypes typical of imprinted genes, including defective placental development, viability/growth defects, and anxiety-related behavior (Sekita et al., 2008, Labialle et al., 2014, Marty et al., 2016) (Table 3). Among the others, six have been individually disrupted (each the sole member of its family), and four have abnormal phenotypes (miR-188, miR-378, miR-486, and miR-505) (Carrer et al., 2012, Li et al., 2015, Yang et al., 2017, Jee et al., 2018), whereas two are reported to have no observable phenotype (miR-296 and miR-335) (Park et al., 2012, Hiramuki et al., 2015).
For the hundreds of other human miRNAs not conserved to mouse, confidently inferring function without the possibility of a mouse model is difficult. Indeed, even for the 88 pan-mammalian miRNA families, mouse models might be of more limited utility. These pan-mammalian miRNAs each have an average of only ∼11 targets conserved above background, which is far less than the average of >400 found for each of the more broadly conserved miRNAs (Friedman et al., 2009). Therefore, any abnormal phenotypes observed for knockouts of these miRNAs in mouse are more likely to be attributable to non-conserved targeting and thus less likely to report on their functions in humans.
One difference between flies and mice is that in flies abnormal phenotypes are usually apparent even when only one member of a conserved family is disrupted (Chen et al., 2014), whereas in mice, redundancy among family members often requires disruption of multiple family members before phenotypes become apparent. Thus, when considering some of the broadly conserved families for which only relatively subtle abnormalities have been reported, such as let-7 and miR-10 (Table 1), more severe abnormalities are expected when additional family members are simultaneously disrupted. In both flies and mice, animals heterozygous for a disrupted miRNA gene typically appear normal—the most notable exceptions being for mutations at the miR-96 and miR-17∼92 loci, which each cause haploinsufficient abnormalities in both mice and humans, with loss of one copy of miR-96 causing deafness (Lewis et al., 2009, Mencía et al., 2009) and loss of one copy of the miR-17∼92 miRNA cluster causing skeletal anomalies, growth defects, and learning disabilities (de Pontual et al., 2011, Fiori et al., 2015, Han et al., 2015).
Although the impact or full impact of many conserved miRNA families has yet to be explored, the results to date already implicate mammalian miRNAs in diverse developmental, cellular, and physiological processes. As mentioned in the opening paragraph of this review, many miRNAs are required for proper development, including that of the skeleton, teeth, brain, eyes, neurons, muscle, heart, lungs, kidneys, vasculature, liver, pancreas, intestine, skin, fat, breast, ovaries, testes, placenta, thymus, and each hematopoietic lineage (Tables 1, 2, and 3). In addition, many influence cellular phenomena and functions, including axon sprouting, synapse formation and function, mitotic spindle orientation, polyploidization, ciliogenesis, and diverse functions in various hematopoietic lineages (Tables 1, 2, and 3). Moreover, many miRNAs influence physiological processes, including cardiac conduction, blood pressure, lipid or cholesterol metabolism, insulin production, pituitary function, mobilization of glycogen, Aβ protein degradation, bone resorption, fibrosis, and the overall growth of embryos or pups (Tables 1, 2, and 3). In addition, many miRNA knockout strains have differential responses to models of neuronal, cardiac, muscular, pulmonary, vascular, endocrinological, hepatic, intestinal, renal, immunological, or metabolic diseases or injuries, and some have altered susceptibility to fungal or bacterial infections or altered propensity to develop tumors in cancer models (Tables 1, 2, and 3).
Sculptors of the Transcriptome
The reason that metazoan miRNAs are so important for mammals, flies, and presumably most other animals, is that they add a dense layer of posttranscriptional gene control that complements and extends the regulation occurring at other levels of the gene-expression program. The rates of mRNA transcription, processing, and transport determine the identities and amounts of mRNAs that enter the cytoplasm. Once in the cytoplasm, the mRNAs encounter the miRNA milieu of the cell and undergo different shades of repression, depending on the identities and activities of the miRNAs in the cell, the number of sites to these co-expressed miRNAs, and the efficacies of these sites. For each site, the repression is typically quite modest—usually less than 50% and often less than 20% (Baek et al., 2008, Selbach et al., 2008). However, multiple sites in the same UTR can add up to much more substantial repression, as each additional site tends to act additively, and some sites, particularly those within about 8–40 nt from each other, act cooperatively (Grimson et al., 2007, Saetrom et al., 2007). Because the miRNA milieu differs between cell types and over the course of development, the animal can achieve complex gene-expression patterns with relatively simple promoters for both the mRNAs and the miRNAs.
In this view, the metazoan miRNAs can be thought of as the sculptors of the transcriptome. Transcription and other nuclear events set up a column of gene expression, and then the miRNAs, like a stone sculptor, chip away at this column. Occasionally they chip away enough to either trigger or sharpen a developmental transition, but more generally they produce a much more complex topology of gene expression, with more optimal levels of many proteins in each cell of each tissue.
As participants in many selectively conserved regulatory interactions (e.g., >40,000 in mammals) (Friedman et al., 2009), miRNAs have been extensively integrated within gene-regulatory networks—of potential utility in any regulatory architecture with a repressive component. With this modular, combinatorial regulatory pathway at its disposal, evolution can more easily generate complex, highly connected regulatory networks required to robustly cope with environmental, genetic, or stochastic contingencies (Hornstein and Shomron, 2006). Indeed, as might be expected for transcription factors or any other component of these networks, deletion of miRNAs can reduce tolerance to environmental challenges, genetic perturbations, injury, or disease (Ebert and Sharp, 2012, Mendell and Olson, 2012, Cassidy et al., 2016, Kasper et al., 2017) (Tables 1, 2, and 3).
The regulatory and developmental robustness conferred by miRNAs stems from their integration within larger regulatory networks and not a supernatural ability to preferentially repress mRNAs that are stochastically too abundant in a cell. Nonetheless, as with other posttranscriptional processes that reduce the protein output from each mRNA, miRNA-mediated repression, when coupled with a compensatory increase in transcription, can in principle decrease the cell-to-cell variability (i.e., the noise) of gene expression (Bartel and Chen, 2004). On the other hand, adding a regulatory component, such as a miRNA, to a system also adds some intrinsic noise, due to variable expression of that component (Schmiedel et al., 2015). Thus, depending upon whether the decreased noise imparted by increased transcription can overcome the increased noise imparted by adding the miRNA component, the net effect of adding the miRNA component varies, with lowly expressed genes tending to undergo a net decrease in noise and highly expressed genes tending to undergo a net increase (Schmiedel et al., 2015).
Functions of Individual Interactions
The analyses of miRNA knockouts, which have greatly expanded the known functions of mammalian miRNAs (Tables 1, 2, and 3), often investigate the effects of the miRNAs on predicted targets and sometimes implicate the de-repression of just one or two targets as the primary cause of an abnormal phenotype. The most definitive approach for isolating and confirming the importance of a particular target is to disrupt the miRNA-binding site(s) within the endogenous target gene and examine the extent to which the resulting de-repression of that target phenocopies the miRNA knockout (Dorsett et al., 2008, Teng et al., 2008, Ecsedi et al., 2015, Lu et al., 2015a, Drexel et al., 2016, McJunkin and Ambros, 2017, Mildner et al., 2017). With the development of more convenient genome-editing methods, this approach is gaining popularity and yielding insights that would be difficult to obtain by previous approaches, showing, for instance, that a single miRNA target site can be essential for viability (McJunkin and Ambros, 2017).
As the compendium of phenotypic effects of individual target interactions grows, so will a more robust molecular understanding of miRNA biological functions. However, at least for the foreseeable future, these experiments can query only a small sampling of the biological targets of each miRNA. Thus, the focus will presumably remain on targets considered most likely to help explain the miRNA-knockout phenotype, as judged using site conservation, repression magnitude, and known or suspected functions of the targets.
In addition to its low throughput, the approach of disrupting individual interactions faces other limitations. A major challenge is to design phenotypic analyses with the sensitivity needed to uncover the functional roles of many of these individual interactions. This challenge stems from the modest repression conferred by most interactions, the potential buffering effect of interactions occurring within complex regulatory networks with bifurcating pathways and feedback control that enable robust response despite a defective edge in the network, and the permissiveness of standard lab conditions, which have been developed to preserve and propagate mutant lines rather than to simulate life in the wild (Bartel, 2009). In contrast, comparative sequence analysis, the approach that reveals the large scope of biological targets, discerns the results of the big experiment (i.e., biological evolution) that has been operating over many millions of years at the level of populations evolving in the wild. Because a very subtle fitness disadvantage—even one contingent on an intermittent environmental or genetic stress—can prevent alleles with mutated regulatory sites from becoming fixed in a population, purifying selection is an exquisitely sensitive indicator of biological function. Although comparable sensitivity will never be achieved in the lab, the innovative strategies that have revealed additional phenotypes for miRNA knockouts in the contexts of environmental/genetic stress or injury/disease models (Mendell and Olson, 2012, Cassidy et al., 2016, Kasper et al., 2017) (Tables 1, 2, and 3) should also be productive when examining the effects of disrupting individual targeting interactions.
Concluding Remarks
After the seminal discovery of the lin-4 miRNA 25 years ago, the next big surprise was the abundance of this class of regulatory RNAs in animals, a finding that was soon followed by the surprising abundance of their regulatory targets. The more recent surprises have included the molecular consequences of miRNA-mediated repression in post-embryonic cells (i.e., the dominance of mRNA destabilization as compared to translational repression), as well as preponderance of fly and mouse miRNA families with abnormal knockout phenotypes. The next surprise? Time will tell, but based on their past record, miRNAs will not disappoint.