Multiomic integration of transcriptome and genome analysis from a single trophectoderm (TE) biopsy with the Embgenix GT-omics Kit
Integrating transcriptome analysis into preimplantation genetic testing workflows enhances our understanding of embryo status and allows for the identification of RNA-based biomarkers to augment the effectiveness of assisted reproductive technology (ART)
Facilitating RNA biomarker discovery for improved ART, the Embgenix GT-omics Kit incorporates whole-genome amplification, cDNA synthesis, and NGS library preparation into a streamlined workflow for the parallel capture of genome and transcriptome information from a single TE biopsy sample
Delivering performance comparable to established standalone methods, the kit enables combined CNV and differential expression analyses with high sensitivity, accuracy, and reproducibility
Preimplantation genetic testing for aneuploidies (PGT-A) is increasingly used in assisted reproductive technology to complement morphology-based evaluations by enabling screening for chromosomal abnormalities that would otherwise go undetected. Although PGT-A has been demonstrated to increase the likelihood of a successful pregnancy, particularly when maternal age exceeds 35, the success rate for assisted reproduction with current PGT-A methods has plateaued at 50% (Munné et al, 2019).
Current preimplantation genetic testing (PGT) methods rely heavily on genomic DNA analysis from trophectoderm (TE) biopsies, while RNA is typically discarded during sample preparation. However, integrating transcriptome analysis into PGT workflows offers a deeper understanding of embryo viability and enables the discovery of RNA-based biomarkers that could enhance assisted reproductive technology (ART) outcomes. A recent retrospective study found that certain transcriptional profiles were associated with significantly higher implantation rates (88.6% vs. 50%), underscoring the potential value of incorporating RNA analysis into preimplantation testing (Jin et al, 2024).
To support continued progress in this area, we developed the Embgenix GT-omics Kit—a multiomic solution designed to enable simultaneous genomic and transcriptomic analysis from a single TE biopsy. Based on the same chemistry approaches recommended in Macaulay et al, the workflow combines the proven reproducibility of PicoPLEX whole-genome amplification (WGA) with the high sensitivity of SMART cDNA synthesis and an optimized NGS library preparation method. Accompanying bioinformatics tools provide a seamless end-to-end solution for simultaneous CNV and transcriptome analysis from a single TE biopsy sample.
In this tech note, we show that the Embgenix GT-omics Kit successfully enables simultaneous genome and transcriptome profiling from a single biopsy sample, allowing researchers to obtain more biological insights. DNA-seq and RNA-seq libraries produced using the kit perform comparably to standalone NGS-based PGT-A and transcriptome analysis methods for TE biopsy samples, demonstrating that combining RNA-seq and DNA-seq library production into a single workflow does not affect data quality. Furthermore, we demonstrate the accuracy of transcriptomic data produced with the Embgenix GT-omics Kit through low-input spike-in control experiments.
Figure 1.Embgenix GT-omics workflow. Following lysis of TE biopsy samples, polyadenylated mRNA is captured using oligo(dT)-coated magnetic beads, while gDNA remains in the supernatant. cDNA is synthesized from bead-bound mRNA, and gDNA undergoes whole-genome amplification in separate tubes, followed by library preparation. RNA-seq and DNA-seq libraries are pooled, sequenced, and analyzed using Embgenix Analysis Software and Cogent NGS tools.
Results
Reliable CNV calling and high-quality transcriptomics from TE biopsy samples
In developing a combined DNA-seq and RNA-seq library prep method for TE biopsy samples, it was important that both types of libraries were as robust as those obtained from standalone methods. TE biopsy samples from four embryos (two separate TE biopsies per embryo) were processed using the Embgenix GT-omics Kit. DNA-seq libraries were sequenced, downsampled to 1.5 x 106 paired-end reads, and resulting data was subjected to CNV analysis using Embgenix Analysis Software (Figure 2). Embgenix Analysis Software uses the informative reads and DLRS metrics to assess the suitability of sequencing data for CNV analysis, and all TE biopsy samples satisfied the QC criteria applied by the software. Assay noise metrics (DLRS, MAPD, NME) were similar to those obtained from the standalone Embgenix PGT-A assay (Figure 2, Panel B).
We also compared CNV analysis results to those obtained from an alternative PGT-A assay performed by an outside laboratory. The CNV plots, sample calls, and karyotypes were concordant with the Embgenix GT-omics Kit and the alternative PGT-A assay. The results demonstrate that DNA-seq data generated from TE biopsy samples using the Embgenix GT-omics Kit provides reliable, accurate CNV analysis with minimal background noise comparable to standalone PGT-A methods.
Sample
Embryo 1
Embryo 2
Embryo 3
Embryo 4
GM8831 control
Replicate
1
2
1
2
1
2
1
2
N/A
Alternative assay result
Trisomy 18 & 20, XY
Trisomy 18 & 20, XY
Trisomy 4, XX
Trisomy 4, XX
Monosomy 13 & 15, XX
Monosomy 13 & 15, XX
Trisomy 9, XY
Trisomy 9, XY
N/A
Sample call
Aneuploid
Aneuploid
Aneuploid
Aneuploid
Aneuploid
Aneuploid
Aneuploid
Aneuploid
Segmental Aneuploid
Karyotype
48, XY, seq(18)x3, seq(20)x3
48, XY, seq(18)x3, seq(20)x3
47, XX, seq(4)x3
47, XX, seq(4)x3
44, XX, seq(13)x1, seq(15)x1
44, XX, seq(13)x1, seq(15)x1
47, XY, seq(9)x3
47, XY, seq(9)x3
46, XY, seq(13q32.1q34)x1
Sex
Male
Male
Female
Female
Female
Female
Male
Male
Male
QC status
Pass
Pass
Pass
Pass
Pass
Pass
Pass
Pass
Pass
Total reads
1,500,000
1,500,000
1,500,000
1,500,000
1,500,000
1,500,000
1,500,000
1,500,000
1,500,000
Informative reads
921,765
922,546
920,313
915,817
949,269
947,484
945,092
941,590
948.483
% informative reads
61.45
61.50
61.35
61.05
63.28
63.17
63.01
63.77
63.23
DLRS
0.0982
0.1012
0.0974
0.0980
0.0949
0.0979
0.1030
0.0925
0.1042
MAPD
0.1420
0.144
0.1545
0.1460
0.1393
0.1444
0.1545
0.1330
0.1534
NME
199
380
222
249
284
251
327
188
69
Figure 2. CNV analysis of TE biopsies using the Embgenix GT-omics Kit and Embgenix Analysis Software. Panel A. CNV plots displaying normalized counts of sequencing reads mapped to 1 Mb bins across each chromosome for four pairs of TE biopsy samples (each pair of samples are derived from the same embryo) and one control sample derived from the lymphoblastoid cell line GM08331. Aneuploid and mosaic calls are marked with blue arrows. Panel B. Automated sample classification, karyotype calls, and corresponding QC metrics for each sample analyzed. Alternative assay results refers to CNV calls obtained for each embryo via analysis of a separate biopsy sample with a comparable assay. Total reads denotes the total number of sequencing reads submitted for analysis, while informative reads represent the number of sequencing reads that were successfully mapped and used for CNV analysis. DLRS (derivative log ratio spread) and mean absolute pairwise difference (MAPD) quantify signal noise, serving as key metrics for evaluating data suitability for accurate CNV analysis. NME (noise metric euploid) is an additional noise metric intended to assess assay performance across experiments. QC Status indicates whether a sample met predefined thresholds for informative reads and DLRS, ensuring data quality for downstream analysis.
RNA-seq libraries were sequenced, downsampled to 4.0 x 106 paired-end reads per sample, and analyzed using the Cogent NGS Analysis Pipeline (CogentAP). The distribution of reads mapping to exonic, intronic, intergenic, mitochondrial, and ribosomal RNA regions was consistent across each biopsy sample per embryo, and each biopsy sample yielded over 16,000 unique detected genes, with an average of 17,527 genes identified across all six biopsies (Figure 3, Panel A). All RNA-seq libraries provided full gene-body coverage comparable to SMART chemistry (Figure 3, Panel B). A 3′ bias was observed in a single sample—Embryo 4-2—likely reflecting partial RNA degradation introduced during sample preparation, which resulted in reduced 5′ coverage (Figure 3, Panel B).
Figure 3. Transcriptome analysis of TE biopsy samples using the Embgenix GT-omics Kit and Cogent NGS tools. Panel A. Distribution of sequencing reads mapped to exonic, intronic, mitochondrial, rRNA, and intergenic regions for each TE biopsy sample. The gene count, or the number of unique genes detected for each replicate based on mapping of RNA-seq, is shown above the graph. Panel B. Gene-body coverage of RNA-seq libraries generated from TE biopsy samples.
Accurate transcript quantitation for low-input samples
We have demonstrated the comparable sensitivity of the Embgenix GT-omics method compared to standalone assays using TE biopsy samples. Next, the accuracy and reproducibility of the data was evaluated using synthetic RNA reference standards from the External RNA Controls Consortium (ERCC). These standards consisted of 92 distinct polyadenylated RNA species with known sequences, each present at defined concentrations in one of two formulations (Mix1 vs. Mix2). To simulate real-world TE biopsy sample processing conditions, ERCC standards were combined with five-cell samples derived from the GM05067 cell line at two different dilution levels (low vs. high), generating four distinct RNA concentration conditions. Samples were processed in triplicate using the Embgenix GT-omics Kit and resulting RNA-seq libraries were sequenced, downsampled to 4 x 106 reads per sample, and analyzed using a custom analysis pipeline. Measured RNA quantities were compared with expected values to assess the correlation across the four concentration levels, validating the accuracy and reproducibility of the transcriptomic data obtained.
The resulting profiles for synthetic ERCC transcripts exhibited strong linear correlations (R > 0.99 for intra-mix comparisons and R > 0.93 for comparisons with expected ERCC values), as visualized in a Pearson correlation matrix (Figure 4, Panel A), highlighting the high reproducibility provided by the GT-omics assay. Comparison of measured vs. expected ERCC transcript counts at each of the four concentrations (Figure 4, Panel B) demonstrated the assay’s ability to detect transcripts at an abundance of 100 copies or more with a sequencing depth of 8 x 106 reads. While measured fold changes did not precisely match expected values, particularly at lower abundance levels, a clear correlation was observed between expected and measured fold changes. These results highlight the Embgenix GT-omics Kit’s ability to generate reliable data for differential expression analysis, including quantification of low-abundance transcripts.
Figure 4. Assessing transcriptomic accuracy with synthetic RNA spike-in standards. Panel A. Pearson correlation matrix illustrating the correlations in measured quantities of 92 ERCC spike-in RNA species. These species were introduced at two different ratios (Mix1 vs. Mix2) and added to GM05067 cells at two dilution levels (low vs. high) in triplicate or measured directly using the Embgenix GT-omics Kit and custom software tools. Panel B. Scatter plot comparing the measured fold changes of 92 ERCC spike-in RNA species (Y-axis) to their expected fold changes (X-axis) at four different concentrations. Each dot represents an individual RNA species, with spiked-in concentrations indicated by the color gradient to the right of the plot. The plot was generated using the Embgenix GT-omics Kit and a custom analysis pipeline.
Conclusions
Using TE biopsy samples and synthetic RNA reference standards as benchmarks, we demonstrate that the Embgenix GT-omics Kit enables simultaneous genome and transcriptome profiling with sensitivity, accuracy, and reproducibility comparable to standalone methods. This is accomplished through a simple, streamlined workflow compatible with standard molecular biology equipment.
Incorporation of both genomic and transcriptomic profiling during preimplantation testing will allow us to uncover critical details about gene expression patterns, regulatory mechanisms, and cellular processes that influence embryo development and enable the identification of prospective biomarkers for embryo status and implantation potential. These biomarkers may further guide researchers in assessing embryo quality and developmental potential for ART, advancing the field of reproductive medicine, and increasing the chances of live birth delivery.
Methods
Samples
TE biopsy samples, collected in collaboration with an external laboratory, were obtained as re-biopsies from embryos that had undergone prior vitrification. To assess the transcriptomic accuracy of the method, ERCC ExFold RNA Spike-In Mixes (ThermoFisher Scientific; Cat. # 4456739) containing blends of 92 different synthetic RNA species in two different formulations were added to the GM05067 lymphoblastoid cell line (Coriell Institute) samples at two different concentrations.
Embgenix GT-omics assay
Lysates were processed according to the Embgenix GT-omics Kit protocol shown in Figure 1. cDNA and WGA products were subjected to enzymatic fragmentation, adapter ligation, and PCR amplification to yield RNA-seq and DNA-seq libraries, respectively. Following clean-up, RNA-seq and DNA-seq libraries were pooled and sequenced. Then, the data was analyzed to identify genomic variants and perform differential expression analysis.
Sequencing
All libraries were sequenced on an Illumina NextSeq 550 using 2 x 75 bp paired-end reads with a NextSeq 500/550 Mid Output v2.5 Kit (150 Cycles; Cat. # 20024904).
Analysis
Following library preparation and sequencing, DNA-seq data were downsampled to 1.5 x 106 paired-end reads per cell, and copy number analysis was performed using Embgenix Analysis software. RNA-seq data were downsampled to 4.0 x 106 paired-end reads per sample and analyzed using CogentAP. For analysis of sequencing data from the ERCC spike-in mixes, RNA-seq data was downsampled to 8 x 106 reads, Illumina adapter sequences were trimmed using Trimmomatic, and reads were aligned with Bowtie2. Mapped read counts were then used to calculate fold changes between samples using a custom analysis pipeline.
References
Jin, J. et al. Multi-omics PGT: re-evaluation of euploid blastocysts for implantation potential based on RNA sequencing. Hum Reprod.39, 2861–2872 (2024).
Macaulay, I. et al. G&T-seq: parallel sequencing of single-cell genomes and transcriptomes. Nature methods. 12, 519–522 (2015).
Munné, S. et al. Preimplantation genetic testing for aneuploidy versus morphology as selection criteria for single frozen-thawed embryo transfer in good-prognosis patients: a multicenter randomized clinical trial. Fertil Steril. 112, 1071–1079 (2019).