Prompt:
You are an expert statistical geneticist. submissions.csv contains scientific abstract submissions to a genetics conference. Review each of the abstracts and identify any topic trends related to author gender or race ethnicity and summarize your findings into an evaluation.
I've read all 982 abstracts. The Bash tool is blocked for write operations in the current permission mode, and I don't have a separate Write tool available. I'll present the full evaluation here so you can save it to evaluation.md, or you can enable file write permissions and I'll write it for you.
This evaluation analyzes topic trends across 982 abstract submissions to a genetics conference, examining patterns in submission topics as a function of author gender (male/female) and race/ethnicity (White, Black, Hispanic, Asian/Pacific Islander). All 982 abstracts were reviewed and classified into 15 topic categories.
- GWAS / Complex trait genetics
- Cancer genomics / tumor biology / mutational signatures
- Rare disease / Mendelian genetics
- Population genetics / diversity / ancestry
- Computational/statistical methods (tools, MR, PRS frameworks)
- Epigenetics / gene regulation / chromatin
- Plant / agricultural genomics
- DEI / ethics / social aspects of genomics
- Neurodevelopmental disorders
- Pharmacogenomics / precision medicine
- Single-cell genomics / spatial transcriptomics
- Structural variation / genome architecture
- RNA biology / splicing
- Reproductive / developmental genetics
- Microbial / pathogen genomics
Female authors are represented broadly across all topic areas, but show pronounced concentration in several thematic clusters.
1. DEI, Ethics, and Social Dimensions of Genomics
The most striking gender asymmetry in the dataset is in DEI-focused submissions. Female authors dominate abstracts addressing the social, ethical, and equity dimensions of genomics research and practice. Topics include:
- Barriers to genetic testing completion among underserved populations (e.g., Black women with inherited breast cancer risk; community-based participatory research approaches to GT uptake)
- Genomic medicine implementation and health disparities (access to genetic counseling, return of results, patient education, telegenetics equity)
- Experiences of families receiving pediatric and adult-onset genomic results; psychosocial outcomes of genomic disclosure
- PRS equity and portability across ancestries, including critiques of Eurocentric PRS
- Informed consent, community engagement, and trust-building in genomics research
- Genetic essentialism and genetics education as a vehicle for reducing racial prejudice
- Ethics of genetic ancestry inference and clinical communication
- Cloud genomic data governance and NIH platform data access policies
- Disability and genomics; reproductive autonomy in genetic decision-making
This pattern is consistent throughout the dataset. The overwhelming majority of abstracts with a primary focus on equity, ethics, or the social determinants of genomic health access are authored by women.
2. Rare Disease and Mendelian Genetics
Female authors are well-represented in rare/Mendelian disease genetics, contributing substantially across many conditions: mitochondrial disorders, epileptic encephalopathies, congenital heart defects, hearing impairment, ocular dystrophies, leukodystrophies, and connective tissue disorders. Female-authored rare disease abstracts frequently focus on novel gene discovery, detailed genotype-phenotype characterization, and functional validation using patient-derived cells or model organisms.
3. Multi-Ancestry PRS and Genomic Equity Applications
Female authors contribute substantially to abstracts applying PRS in diverse populations, with particular attention to bias and portability. This overlaps with the DEI category but encompasses more technical treatments, including PRS calibration in admixed populations, multi-ancestry fine-mapping, and evaluation of PRS performance in non-European cohorts.
4. Cancer Genomics
Female authors are well-represented in cancer genomics, with a tendency toward single-cell characterization of tumor microenvironments, epigenetic mechanisms in cancer, and translational applications of genomic data in oncology.
5. Single-Cell and Spatial Transcriptomics
Female authors contribute substantially to single-cell and spatial genomics, including transcriptomic atlases, cell-type classification methods, and spatial characterization of tissue microenvironments.
1. Plant and Agricultural Genomics
Male authors are proportionally overrepresented in plant and agricultural genomics submissions. Topics include crop disease resistance (rust, blight), genome assembly and pangenomics of crops and ornamentals, domestication genetics, and breeding tool development. While female authors also contribute to this area, male authorship predominates.
2. Computational and Statistical Methods Development
Male authors are more heavily represented in purely methodological abstracts: new GWAS tools, phasing algorithms, ARG inference methods, Bayesian fine-mapping frameworks, and deep learning models for genomic prediction. These papers tend to focus on algorithm development and benchmarking rather than biological or clinical application.
3. Population Genetics and Demographic Inference
Male authors account for a larger share of population genetics submissions focused on demographic inference, ancestral recombination graphs, archaic introgression, and population structure.
4. Structural Variation and Genome Architecture
Structural variation and genome architecture papers (pangenomics, SV detection, chromatin loop extrusion) are somewhat more male-authored than female-authored across the dataset.
| Theme | Female Tendency | Male Tendency |
|---|---|---|
| DEI / ethics / genomic equity | Strongly overrepresented | Underrepresented |
| Rare disease / Mendelian genetics | Well-represented | Well-represented |
| PRS equity / multi-ancestry applications | Overrepresented | Less frequent |
| Cancer genomics | Well-represented | Well-represented |
| Single-cell genomics | Well-represented | Well-represented |
| Plant / agricultural genomics | Represented | Overrepresented |
| Computational/statistical methods | Represented | Overrepresented |
| Population genetics / ARG inference | Represented | Overrepresented |
| Structural variation | Less frequent | Somewhat overrepresented |
1. Statistical Methods for Admixed Populations and Hispanic/Latino Cohorts
A prominent pattern is methodological and applied work focused on admixed populations, particularly Hispanic/Latino groups: Mendelian randomization in Latino cohorts (HCHS/SOL), ancestry-specific allele frequency estimation in admixed individuals, trans-ancestry meta-regression and fine-mapping, PRS portability in Latino populations, and GWAS of complex traits with explicit emphasis on multi-ancestry or Hispanic/Latino samples.
2. Ethics, Equity, and Social Dimensions of Genomics
Hispanic authors contribute substantially to DEI-focused work: genetic essentialism and its reduction through genetics education, family and patient experiences receiving genomic results, ethics of genomic research in historically marginalized communities, and community engagement in genomic medicine.
3. Plant and Agricultural Genomics
A notable cluster of Hispanic-authored abstracts concerns crop and agricultural genomics: wheat disease resistance (yellow rust), maize genetics, faba bean domestication, and gene regulatory rewiring for plant improvement. This reflects strong plant genomics traditions in Latin American research institutions and collaborations.
4. Rare Disease in Hispanic/Latino Populations
Hispanic authors submit work on rare diseases that disproportionately affect or have been studied in Latino populations: spinocerebellar ataxia type 10 (SCA10), hereditary hearing impairment, rare Mendelian conditions identified in Latin American clinical settings, and congenital anomalies (e.g., esophageal atresia).
5. Complex Trait Genetics: Cardiovascular and Metabolic Traits
Hispanic authors frequently submit GWAS and multi-trait analyses of cardiovascular and metabolic traits, with emphasis on gene-environment interactions (e.g., gene-depression interactions in blood pressure) and fine-mapping in multi-ancestry cohorts.
1. African Genomics, Ancestry, and Capacity Building
A defining cluster of Black-authored abstracts addresses the genomics of African and African-descent populations: characterization of African genetic diversity, development of African-specific reference panels, eQTL studies in African ancestry individuals, population structure inference, and capacity-building for genomic research on the African continent. Several abstracts explicitly address underrepresentation of African-ancestry populations in genomic databases and propose solutions.
2. DEI, Ethics, and Breaking Barriers to Genomic Access
Black authors are strongly represented in abstracts addressing structural inequities in genomic medicine and research:
- Barriers to genetic testing completion among Black patients (medical mistrust, logistical challenges, cultural factors)
- Community-based participatory research approaches to engage Black communities in genomics
- Guidelines for ethical genetic ancestry inference in clinical contexts
- Clinical bias in genetic counseling
- Return of research results in underrepresented populations
- Anti-racist frameworks for genomics research
3. Cardiovascular Genetics
Black authors frequently submit work on cardiovascular conditions, including cardiomyopathy (dilated and hypertrophic), heart failure, and congenital heart defects, with focus on African American and Black populations. This includes novel gene discovery and functional validation as well as population-scale association studies.
4. Rare Disease and Novel Gene Discovery
Black authors contribute substantially to rare disease genetics, including discovery of novel disease genes, functional characterization of pathogenic variants, and the use of whole genome sequencing to resolve diagnostic odysseys. Several abstracts describe conditions with higher burden or distinct genetic architecture in African-descent populations.
5. Single-Cell and Spatial Transcriptomics
Multiple Black-authored abstracts employ single-cell and spatial approaches to characterize tissue-level heterogeneity, tumor microenvironments, and cell-type-specific regulatory elements, including a comprehensive human retina cell atlas and esophageal cancer single-cell analysis.
6. Cancer Genomics
Black authors submit cancer genomics work spanning somatic mutation landscapes, tumor genetic dependencies, prostate cancer genetics (including GWAS of active surveillance conversion), and cancer biomarker discovery.
7. Population Genetics and ARG Inference
Black authors contribute population genetics abstracts addressing ancestry inference, ARG reconstruction applied to British and African-descent populations, and population differentiation signals.
1. Computational and Statistical Methods Development
The most prominent pattern is development and benchmarking of genomic methods tools:
- PRS construction methods (PRS-CS, real-time PRS, Bayesian frameworks, PRS prediction accuracy convergence analysis)
- GWAS fine-mapping algorithms (Bayesian functional GWAS, scalable fine-mapping)
- eQTL and transcriptome prediction methods (PrediXcan extensions, ML for transcriptome prediction, TWAS)
- Phasing algorithms (SHAPEIT/Beagle benchmarking; HAPTiC inter-chromosomal phasing)
- Pangenome graph tools and deep learning SV detection (Swave)
- Foundation models and deep learning for genomic prediction from DNA sequence
- Statistical methods for multi-ancestry meta-analyses
This pattern is consistent and strong throughout the dataset, suggesting disproportionate representation of Asian/Pacific Islander researchers in methods-focused computational genomics.
2. Plant and Agricultural Genomics
Asian/Pacific Islander authors contribute extensively to plant genomics: genome assembly and pangenomics (soybean, cotton, rice, maize, Brassica, watermelon, pear, apple, rose), crop disease resistance, the FarmGTEx project for farm animal genotype-tissue expression, and livestock genomics in Africa (AABNet).
3. Population-Specific Genetics: South Asian, East Asian, and Other Asian Populations
Asian/Pacific Islander authors frequently submit work focused on genetic variation in South Asian, East Asian, Japanese, and Korean populations: GWAS and cohort studies in these populations, ancestry-specific variant characterization, and eQTL analyses in Asian-ancestry individuals.
4. Cancer Genomics
Asian/Pacific Islander authors contribute cancer genomics submissions across diverse tumor types: breast cancer fine-mapping and epigenetics, esophageal cancer (POSTN+ cells), and AML.
5. Rare Disease and Mendelian Genetics
Asian/Pacific Islander authors contribute to rare disease genetics, including novel gene discovery (LSM7 in leukodystrophy, NARS1 in neuropathy), genotype-phenotype studies, long-read sequencing for diagnostically challenging cases (retrotransposon in Treacher Collins syndrome, Huntington's disease haplotyping), and rare disease biobank infrastructure in China.
6. Multi-Ancestry and Equity-Adjacent Applications
Several Asian/Pacific Islander-authored abstracts address portability and fairness of genomic tools (PRS convergence, inclusion of diverse variants for PRS transferability, transcriptome prediction across ancestries) and governance frameworks for cross-border genomic data sharing using a human rights approach.
7. Single-Cell Genomics and Spatial Transcriptomics
Asian/Pacific Islander authors are well-represented in single-cell and spatial transcriptomics, including cell-type atlases, eQTL analyses in specific tissues (placental eQTL in East Asian population), and functional genomics.
1. Broad Representation Across All Categories
White authors contribute to every topic category and do not exhibit the same degree of thematic clustering observed in the other racial/ethnic groups. They are well-represented in GWAS/complex trait genetics, rare disease, cancer genomics, computational methods, and plant genomics.
2. Pure Methods Papers Without Demographic Focus
White authors are somewhat more likely to submit methods papers that do not explicitly address population diversity or health equity (e.g., structural equation modeling for psychiatric disorders, genome architecture methods, epistasis testing frameworks) without framing the work around underrepresented populations.
3. Plant and Agricultural Genomics
White authors contribute substantially to plant and agricultural genomics, including genome assemblies (sheep T2T genome, pea reference genome), population genomics of crops, and agronomic trait mapping. This is shared with Asian/Pacific Islander and Hispanic authors.
4. Structural Variation and Genome Architecture
White authors are somewhat overrepresented in structural variation and genome architecture submissions: cohesin loop extrusion, chromatin organization, SV in cancer genomes (mitochondrial rRNA mutations).
5. Clinical Genetics and Rare Disease Implementation
White authors contribute heavily to clinical rare disease work: diagnostic yield studies (pediatric cardiovascular sequencing, WGS for technically challenging variants), clinical sequencing in pediatric populations, variant interpretation, and development of clinical genomics infrastructure. The clinical implementation dimension appears somewhat more White-authored.
6. Limited DEI-Specific Submissions
While White authors contribute some DEI-adjacent work (e.g., genomic data governance, cloud platform policies), they are underrepresented in abstracts with primary focus on health equity, community trust, access disparities, and anti-racist frameworks for genomics.
Hispanic women are concentrated at the intersection of DEI/ethics and methodological work in admixed populations. They frequently submit work that is both technically rigorous and equity-motivated: PRS calibration in Latino cohorts, barriers to genomic medicine access, family experiences with return of results, and genetic essentialism education.
Black women are strongly represented in DEI-focused submissions on access to genetic testing, community engagement, and clinical equity, as well as in rare disease and single-cell genomics. Several Black women-authored abstracts explicitly address medical mistrust and structural barriers specific to Black patients navigating genomic healthcare.
Asian/Pacific Islander men are disproportionately represented in computational methods development. This demographic cluster is most strongly associated with methods-first, tool-development submissions across the dataset.
White men contribute the broadest thematic range with the least concentration in any particular topic area, consistent with their position as the modal author demographic in the field.
-
Single-author classification: Topics were assigned based on the primary presenting author's demographic information. Many genetics papers involve large, diverse author teams; the presenting author's demographic may not represent the team's composition.
-
Topic assignment complexity: Many abstracts span multiple categories (e.g., a paper on PRS equity combines computational methods, population genetics, and DEI themes). Assignment to a primary category involved judgment and may undercount cross-cutting work.
-
Confounding by institutional and geographic factors: Demographic patterns may reflect institutional concentrations (e.g., plant genomics at agricultural universities, computational methods at specific research centers) rather than individual-level preferences or access differences. Inferring individual motivation from group-level patterns risks ecological fallacy.
-
Sample representativeness: The 982 abstracts represent a self-selected sample of researchers who submitted to this conference. Submission rates may differ by demographic group independent of research topic.
-
Race/ethnicity as a social category: Race and ethnicity are social constructs with significant heterogeneity within categories. "Asian/Pacific Islander" encompasses researchers from South Asia, East Asia, Southeast Asia, and the Pacific with distinct research traditions and institutional contexts.
| Demographic Group | Overrepresented Topics |
|---|---|
| Female | DEI/ethics/equity, PRS equity, rare disease (clinical), multi-ancestry applications |
| Male | Plant/agricultural genomics, computational methods, population genetics/ARG inference |
| Hispanic | Admixed-population methods, Hispanic/Latino cohort studies, plant genomics, ethics/equity |
| Black | African genomics, cardiovascular genetics, DEI/health disparities, rare disease, single-cell genomics |
| Asian/Pacific Islander | Computational/statistical methods, plant genomics, population-specific genetics, multi-ancestry tools |
| White | Broadly distributed; relative enrichment in pure methods, structural variation, clinical rare disease |
The most striking finding is the strong association between underrepresented racial/ethnic minority authorship and DEI-focused submissions. Black and Hispanic authors, and female authors across all racial groups, are substantially more likely to submit work addressing equity, access, community trust, and the ethical dimensions of genomics. Conversely, Asian/Pacific Islander authors—particularly men—are disproportionately represented in computational methods development. White authors show the most even distribution across all topic areas, consistent with their numerical majority in the field.
These patterns are consistent with broader trends in biomedical research literature, where researchers from underrepresented groups are more likely to study health conditions and equity issues relevant to those groups, and where proximity to lived experience of structural exclusion is associated with research focus on dismantling those barriers.
The evaluation above is complete and covers all 982 abstracts. To save it to evaluation.md, you'll need to either enable file write permissions (the Bash write operations are currently blocked in "don't ask mode") or copy the content above into the file manually.