N Nature Genetics · Dec 03, 2025 ADAR1 editing is necessary for only a small subset of cytosolic dsRNAs to evade MDA5-mediated autoimmunity Endogenous long double-stranded RNAs (dsRNAs), which are not edited by the RNA editing enzyme ADAR1, may activate the antiviral dsRNA receptor MDA5 to trigger interferon-mediated immune responses. Among the large number of endogenous long dsRNAs, the key substrates that activate MDA5—termed as immunogenic dsRNAs—remain largely unidentified. Here we reveal that human immunogenic dsRNAs constitute a surprisingly small fraction of all cellular dsRNAs. We found that these immunogenic dsRNAs were highly enriched in mRNAs and depleted of introns, consistent with their role as cytosolic MDA5 substrates. We validated the MDA5-dependent immunogenicity of these dsRNAs, which was dampened following ADAR1-mediated RNA editing. Notably, immunogenic dsRNAs were enriched at genetic susceptibility loci associated with common inflammatory diseases, implying their functional importance. We anticipate that a focused analysis of immunogenic dsRNAs will enhance our understanding and treatment of cancer and inflammatory diseases, where the roles of dsRNA editing and sensing are increasingly recognized. Immunogenetics Transcriptomics biology
N Nature Genetics · Nov 27, 2025 Passenger mutations link cellular origin and transcriptional identity in human lung adenocarcinomas DNA damage is preferentially repaired in expressed genes; thus, genome-wide correlations between somatic mutation patterns and normal cell transcription may reflect tumor cell origins. Accordingly, we found that aggregate lung adenocarcinoma (LUAD) and squamous cancer (LUSC) somatic mutation density associated most strongly with distal (alveolar) and proximal (basal) lung cell-type-specific gene expression, respectively, consistent with presumed LUAD and LUSC cell origins. Analyzing individual genomes, 21% of LUADs bore mutational footprints of proximal airway origins, with 38% classified as ambiguous. Distal origin LUADs, enriched forKRASandSTK11drivers, occurred mainly in smokers; proximal origin LUADs, enriched forEGFRdrivers, were more common in never-smokers. Ambiguous origin LUADs showed APOBEC signatures andSMARCA4alterations.TP53mutant LUADs with non-distal cell origins preferentially exhibited non-distal transcriptional identity. Our study reveals a complex interplay between lineage and identity in LUAD evolution and offers a scalable strategy to infer tumor origins in human cancers. Genomics Non-small-cell lung cancer Transcriptomics biology
N Nature Genetics · Nov 27, 2025 A genome-wide association study of mass spectrometry proteomics using a nanoparticle enrichment platform Most studies to date of protein quantitative trait loci (pQTLs) have relied on affinity proteomics platforms, which provide only limited information about the targeted protein isoforms and may be affected by genetic variation in their epitope binding. Here we show that mass spectrometry (MS)-based proteomics can complement these studies and provide insights into the role of specific protein isoform and epitope-altering variants. Using the Seer Proteograph nanoparticle enrichment MS platform, we identified and replicated new pQTLs in a genome-wide association study of proteins in blood plasma samples from two cohorts and evaluated previously reported pQTLs from affinity proteomics platforms. We found that >30% of the evaluated pQTLs were confirmed by MS proteomics to be consistent with the hypothesis that genetic variants induce changes in protein abundance, whereas another 30% could not be replicated and are possibly due to epitope effects, although alternative explanations for nonreplication need to be considered on a case-by-case basis. Genetics research Genome-wide association studies Proteomics biology
N Nature Genetics · Nov 24, 2025 Proteome-wide model for human disease genetics Missense variants remain a challenge in genetic interpretation owing to their subtle and context-dependent effects. Although current prediction models perform well in known disease genes, their scores are not calibrated across the proteome, limiting generalizability. To address this knowledge gap, we developed popEVE, a deep generative model combining evolutionary and human population data to estimate variant deleteriousness on a proteome-wide scale. popEVE achieves state-of-the-art performance without overestimating the burden of deleterious variants and identifies variants in 442 genes in a severe developmental disorder cohort, including 123 novel candidates. These genes are functionally similar to known disease genes, and their variants often localize to critical regions. Remarkably, popEVE can prioritize likely causal variants using only child exomes, enabling diagnosis even without parental sequencing. This work provides a generalizable framework for rare disease variant interpretation, especially in singleton cases, and demonstrates the utility of calibrated, evolution-informed scoring models for clinical genomics. Computational biology and bioinformatics Genetic association study Neurodevelopmental disorders biology
N Nature Genetics · Nov 24, 2025 Spatially resolved multi-omics of human metabolic dysfunction-associated steatotic liver disease Metabolic dysfunction-associated steatotic liver disease (MASLD) is a leading cause of chronic liver disease worldwide. We generated single-cell and spatial transcriptomic and metabolomic maps from 61 human livers, including controls (n= 10), metabolic dysfunction-associated steatotic liver (MASL) (n= 17) and metabolic dysfunction-associated steatohepatitis (MASH) (n= 34). We identified microphthalmia-associated transcription factor (MITF) as a key regulator of the lipid-handling capacity of lipid-associated macrophages (LAMs), and further revealed a hepato-protective role of LAMs mediated through hepatocyte growth factor secretion. Unbiased deconvolution of spatial transcriptomics delineated a fibrosis-associated gene program enriched in advanced MASH, suggesting profibrotic crosstalk between central vein endothelial and hepatic stellate cells within fibrotic regions. Mass spectrometry imaging-based spatial metabolomics demonstrated MASLD-specific accumulation of phospholipids, potentially linked to lipoprotein-associated phospholipase A2-mediated phospholipid metabolism in LAMs. This spatially resolved multi-omics atlas of human MASLD, which can be queried at theHuman Masld Spatial Multiomics Atlas, provides a valuable resource for mechanistic and therapeutic studies. Gene expression Liver diseases biology
N Nature Genetics · Nov 20, 2025 Scalable and accurate rare variant meta-analysis with Meta-SAIGE Meta-analysis enhances the power of rare variant association tests by combining summary statistics across several cohorts. However, existing methods often fail to control type I error for low-prevalence binary traits and are computationally intensive. Here we introduce Meta-SAIGE—a scalable method for rare variant meta-analysis that accurately estimates the null distribution to control type I error and reuses the linkage disequilibrium matrix across phenotypes to boost computational efficiency in phenome-wide analyses. Simulations using UK Biobank whole-exome sequencing data show that Meta-SAIGE effectively controls type I error and achieves power comparable to pooled individual-level analysis with SAIGE-GENE+. Applying Meta-SAIGE to 83 low-prevalence phenotypes in UK Biobank and All of Us whole-exome sequencing data identified 237 gene–trait associations. Notably, 80 of these associations were not significant in either dataset alone, underscoring the power of our meta-analysis. Bioinformatics Genetics research Genome-wide association studies Software Genetics Genomics Human Clinical
N Nature Genetics · Nov 18, 2025 Adenine DNA methylation associated with transcriptionally permissive chromatin is widespread across eukaryotes DNA methylation is a key regulator of eukaryotic genomes, most commonly through 5-methylcytosine (5mC). In contrast, the existence and function ofN6-methyladenine (6mA) in eukaryotes have been controversial, with conflicting reports resulting from methodological artifacts. Nevertheless, some unicellular lineages, including ciliates, early-branching fungi and the algaChlamydomonas, show robust 6mA signals, raising questions about their origin and evolutionary role. Here we apply Oxford Nanopore sequencing to profile 6mA at base-pair resolution across 18 unicellular eukaryotes representing all major supergroups. We find that robust 6mA patterns occur only in species that encode the adenine methyltransferase AMT1. Notably, 6mA consistently accumulates downstream of transcriptional start sites, positioned between H3K4me3-marked nucleosomes, indicating a conserved association with transcriptional activation. Our results support the idea that the last eukaryotic common ancestor had a dual methylation system, with transcription-linked 6mA and repressive 5mC, which has been repeatedly simplified in both multicellular and unicellular lineages through the loss of the AMT1 pathway. Epigenetics Epigenomics Gene regulation biology
N Nature Genetics · Nov 18, 2025 Disentangling the architectural and non-architectural functions of CTCF and cohesin in gene regulation Cohesin- and CTCF-mediated chromatin loops facilitate enhancer–promoter and promoter–promoter interactions, but their impact on global gene regulation remains debated. Here we show that acute removal of cohesin or CTCF in mouse cells dysregulates hundreds of genes. Cohesin depletion primarily downregulates CBP/p300-dependent putative enhancer targets, whereas CTCF loss both up- and downregulates enhancer targets. Beyond loop anchoring, CTCF directly modulates transcription, acting as an activator or repressor depending on its binding position and orientation at promoters. Mechanistically, when activating, CTCF increases DNA accessibility and promotes RNA polymerase II recruitment; when repressing, it prevents RNA polymerase II binding without altering chromatin accessibility. Promoter-bound CTCF activates housekeeping genes essential for cell proliferation. CTCF’s transcriptional activation function—but not its loop anchoring role—is shared with its vertebrate-specific paralog, CTCFL. These findings reconcile architectural and non-architectural roles of cohesin and CTCF, offering a unified model for their functions in enhancer-dependent and enhancer-independent transcription control. Epigenetics Gene regulation biology mouse experiments
N Nature Genetics · Nov 17, 2025 African-ancestry-specific variant IKKβ p.Glu502Lys confers high lupus risk Cutaneous lupus erythematosus (CLE) is an autoimmune disease of the skin, occurring with or without systemic lupus erythematosus (SLE). People with African ancestry have a higher risk than people with other ancestries of developing lupus1but have been underrepresented in genetic studies. We whole-genome-sequenced 27,820 Americans with genetically inferred African ancestry from the Diverse Ancestry Cohort, including people with CLE (n= 211) and/or SLE (n= 574). We discovered an association with a rare missense variant inIKBKB, rs115698972G>A, IKKβE502K, exclusive to people with African ancestry, conferring an odds ratio (OR) of 5.4 for CLE and 3.3 for SLE. These associations replicated in the All of Us and VA Million Veteran Research Programs for CLE (ORmeta= 3.8,Pmeta= 5.3 × 10−20,n= 1,243) and SLE (ORmeta= 3.2,Pmeta= 1.0 × 10−19,n= 1,697). In this cohort, IKKβE502Kaccounts for 10.4% of CLE cases and 6.4% of SLE cases, confers a high lupus risk, and contributes substantially to the disease prevalence among people with African ancestry. This highlights the value of including diverse ancestries in genetic association studies. Genome-wide association studies Systemic lupus erythematosus biology
N Nature Genetics · Nov 14, 2025 Uniform dynamics of cohesin-mediated loop extrusion in living human cells Most animal genomes are partitioned into topologically associating domains (TADs), created by cohesin-mediated loop extrusion and defined by convergently oriented CCCTC-binding factor (CTCF) sites. The dynamics of loop extrusion and its regulation remain poorly characterized in vivo. Here we tracked the motion of TAD anchors in living human cells to visualize and quantify cohesin-dependent loop extrusion across multiple endogenous genomic regions. We show that TADs are dynamic structures whose anchors are brought in proximity about once per hour and for 6–19 min (~16% of the time). Moreover, TADs are continuously extruded by multiple cohesin complexes. Remarkably, despite strong differences in Hi-C patterns across chromatin regions, their dynamics is consistent with the same density, residence time and speed of cohesin. Our results suggest that TAD dynamics is primarily governed by the location and affinity of CTCF sites, enabling genome-wide predictive models of cohesin-dependent chromatin interactions. Bioinformatics Biophysics Genetic engineering Genetics Microscopy
N Nature Genetics · Nov 14, 2025 Genome-wide association study and polygenic risk prediction of hypothyroidism We performed a genome-wide meta-analysis of hypothyroidism (113,393 cases and 1,065,268 controls), free thyroxine (191,449 individuals) and thyroid-stimulating hormone (482,873 individuals). We identified 350 loci associated with hypothyroidism, including 179 not previously reported, 29 of which were linked through thyroid-stimulating hormone. We found that many hypothyroidism risk loci regulate blood cell counts and the circulating inflammasome, and through multiple gene-mapping strategies, we prioritized 259 putative causal genes enriched in immune-related functions. We developed a polygenic risk score (PRS) based on more than 115,000 hypothyroidism cases to address diagnostic challenges in individuals with or at risk of thyroid hormone deficiency. We show that the highest predictive accuracy for hypothyroidism was achieved when combining the PRS with thyroid hormones and thyroid-peroxidase autoantibodies, and that the PRS was able to stratify risk of progression among individuals with subclinical hypothyroidism. These findings demonstrate the potential for a hypothyroidism PRS to support the prediction of disease progression and onset in thyroid hormone deficiency. Genome-wide association studies Thyroid diseases biology
N Nature Genetics · Nov 13, 2025 Genome-wide association analyses identify distinct genetic architectures for early-onset and late-onset depression Major depressive disorder (MDD) is a common and heterogeneous disorder of complex etiology. Studying more homogeneous groups stratified according to clinical characteristics, such as age of onset, can improve the identification of the underlying genetic causes and lead to more targeted treatment strategies. We leveraged Nordic biobanks with longitudinal health registries to investigate differences in the genetic architectures of early-onset (eoMDD;n= 46,708 cases) and late-onset (loMDD;n= 37,168 cases) MDD. We identified 12 genomic loci for eoMDD and two for loMDD. Overall, the two MDD subtypes correlated moderately (genetic correlation,rg= 0.58) and differed in their genetic correlations with related traits. These findings suggest that eoMDD and loMDD have partially distinct genetic signatures, with a specific developmental brain signature for eoMDD. Importantly, we demonstrate that polygenic risk scores (PRS) for eoMDD predict suicide attempts within the first 10 years after the initial diagnosis: the absolute risk for suicide attempt was 26% in the top PRS decile, compared to 12% and 20% in the bottom decile and the intermediate group, respectively. Taken together, our findings can inform precision psychiatry approaches for MDD. Depression Population genetics biology
N Nature Genetics · Nov 12, 2025 Computationally efficient meta-analysis of gene-based tests using summary statistics in large-scale genetic studies Meta-analysis of gene-based tests using single-variant summary statistics is a powerful strategy for genetic association studies. However, current approaches require sharing the covariance matrix between variants for each study and trait of interest. For large-scale studies with many phenotypes, these matrices can be cumbersome to calculate, store and share. Here, to address this challenge, we present REMETA—an efficient tool for meta-analysis of gene-based tests. REMETA uses a single sparse covariance reference file per study that is rescaled for each phenotype using single-variant summary statistics. We develop new methods for binary traits with case–control imbalance, and to estimate allele frequencies, genotype counts and effect sizes of burden tests. We demonstrate the performance and advantages of our approach through meta-analysis of five traits in 469,376 samples in UK Biobank. The open-source REMETA software will facilitate meta-analysis across large-scale exome sequencing studies from diverse studies that cannot easily be combined. Genome-wide association studies Software
N Nature Genetics · Nov 11, 2025 Stable clonal contribution of lineage-restricted stem cells to human hematopoiesis Dynamic steady-state lineage contribution of human hematopoietic stem cell (HSC) clones needs to be assessed over time. However, clonal contribution of HSCs has only been investigated at single time points and without assessing the critical erythroid and platelet lineages. Here we screened for somatic mutations in healthy aged individuals, identifying expanded HSC clones accessible for lineage tracing of all major blood cell lineages. In addition to HSC clones with balanced contribution to all lineages, we identified clones with all myeloid lineages but no or few B and T lymphocytes or all myeloid lineages and B cells but no T cells. No other lineage restriction patterns were reproducibly observed. Retrospective phylogenetic inferences uncovered a ‘hierarchical’ pattern of descendant subclones more lineage biased than their ancestral clone and a more common ‘stable’ pattern with descendant subclones showing highly concordant lineage contributions with their ancestral clone, despite decades of separation. Prospective lineage tracing confirmed remarkable stability over years of HSC clones with distinct lineage replenishment patterns. Ageing Stem-cell research biology
N Nature Genetics · Nov 07, 2025 Genetic basis of flavor complexity in sweet corn Sweet corn is an important vegetable crop consumed globally. However, the genetic differentiation between field corn and sweet corn, and the impact of breeding on the metabolite composition and flavor (other than sweetness) of sweet corn, remain poorly understood. Here we assembled a cultivated sweet-corn genome de novo and re-sequenced 295 diverse sweet-corn inbred lines. We examined the genetic architecture of sweet-corn kernel quality by combining genetic, metabolite and expression profiling methodologies. New genes (for example,ZmAPS1,ZmSK1andZmCRR5) and metabolites associated with flavor and consumer preference were identified, highlighting important target flavor metabolites, including sugars, acids and volatiles. These findings provide valuable knowledge and targets for future genetic breeding of sweet-corn flavor, and to balance grain yield and quality and contribute to our broader understanding of crop diversification. Genome-wide association studies Plant genetics Population genetics biology
N Nature Genetics · Nov 07, 2025 TGF-β builds a dual immune barrier in colorectal cancer by impairing T cell recruitment and instructing immunosuppressive SPP1+macrophages Transforming growth factor β (TGF-β) signaling in the tumor microenvironment predicts resistance to immune checkpoint blockade (ICB). While TGF-β inhibition enhances ICB efficacy in murine cancer models, clinical trials have yet to demonstrate benefit, underscoring the need to better understand its immunoregulatory roles across disease contexts. Using mouse models of advanced colorectal cancer and patient-derived data, we demonstrate that TGF-β impairs antitumor immunity at multiple levels in liver metastases. It acts directly on T cells to block recruitment of peripheral memory CD8+T cells, thereby limiting the effectiveness of ICB. Concurrently, TGF-β instructs tumor-associated macrophages to suppress clonal expansion of newly arrived T cells by inducing SPP1 expression. This extracellular matrix protein promotes collagen deposition and accumulation of tumor-associated macrophages and fibroblasts, ultimately driving ICB resistance. Our findings reveal how TGF-β coordinates immunosuppressive mechanisms across innate and adaptive immune compartments to promote metastasis, offering new avenues to improve immunotherapy in colorectal cancer. Colon cancer Immunosurveillance Metastasis biology mouse experiments