We use cookies to improve your browsing experience and provide meaningful content. Read our cookie policy. Accept
  •  Customer Login
  • Register
  •  View Cart (0)
  •  Customer Login
  • Register
  •  View Cart (0)

Takara Bio
  • Products
  • Services & Support
  • Learning centers
  • APPLICATIONS
  • About
  • Contact Us

Clontech Takara Cellartis

Close

  • ‹ Back to Bioinformatics resources
  • Automation systems
  • Next-generation sequencing
    • Product line overview
    • RNA-seq
    • Technical notes
    • Technology and application overviews
    • FAQs and tips
    • DNA-seq protocols
    • Bioinformatics resources
    • Webinars
    • Posters
  • Spatial biology
  • mRNA and cDNA synthesis
  • Gene function
  • Stem cell research
  • Protein research
  • PCR
  • Cloning
  • Nucleic acid purification
  • Antibodies and ELISA
  • Real-time PCR
CogentAP resource portal CogentAP product docs & additional resources
CogentAP software page Cogent NGS Analysis Pipeline software
Home › Learning centers › Next-generation sequencing › Bioinformatics resources › Cogent NGS Analysis Pipeline v1.0 User Manual

Learning centers

  • Automation systems
    • Shasta Single Cell System introduction
      • Biomarker and drug discovery
      • Shasta educational library
    • SmartChip Real-Time PCR System introduction
      • SmartChip Real-Time PCR System applications
        • Pathogen detection in human samples and food
          • Pathogen detection from UTI, STI and wound infections
          • High-throughput detection of SARS-CoV-2
        • Antibiotic resistance genes
          • Screening for antibiotic resistance genes in manure and sewage
          • Uncovering antibiotic resistance genes in soil, sediment, and sludge
          • Tracking down antibiotic resistance genes in hospitals
          • Identifying antibiotic resistance genes in water
        • mRNA, miRNA, and lncRNA as disease biomarkers
        • Genotyping using animal and blood samples
      • SmartChip Real-Time PCR System video resources
        • Webinar: Monitoring ARGs in environmental samples
        • Webinar: antibiotic resistance screening
        • SmartChip tutorials
    • ICELL8 introduction
      • ICELL8 cx applications
        • Archival nanowell sequencing
      • ICELL8 technology overview
      • ICELL8 cx technical specifications
      • ICELL8 technical specifications (original system)
      • ICELL8 system vs plate-seq
      • Webinars
        • Webinar: Leveraging single-cell transcriptomics and epigenomics for biomarker discovery
        • Advances in single-cell indexing registration
        • Single-Cell Workshop at 2020 NextGen Omics Series UK
        • The power of full-length scRNA-seq
        • Sign up: cardiomyocyte webinar
      • Technical notes
        • Enhancing biomarker discovery with SMART-Seq Pro kit and ICELL8 cx system
        • ICELL8 cx system target enrichment for fusions
        • ICELL8 cx system reagent formulation and dispense guidelines
        • Improved detection of gene fusions, SNPs, and alternative splicing
        • Full-length transcriptome analysis
        • High-throughput single-cell ATAC-seq
        • Protocol: High-throughput single-cell ATAC-Seq
        • Single-cell identification with CellSelect Software
        • Single-cell analysis elucidates cardiomyocyte differentiation from iPSCs
        • Combined TCR profiling and 5’ DE in single cells
        • Automated, high-throughput TCR profiling
      • Sample preparation protocols
        • Basic cell preparation for the ICELL8 cx system
        • Protocol: Nuclei isolation from mammalian cells
        • Protocol: Mouse cardiomyocyte preparation
        • Isolate cells of any size
      • Video resources
      • Citations
      • Posters
      • System & software notices
    • Apollo library prep system introduction
      • Automated VeriSeq library preparation for PGS
      • Apollo library prep system citations
      • SMART-Seq v4 chemistry for the Apollo system
      • Apollo library prep system overview
      • Apollo system technical specifications
      • In-tip bead separation on the Apollo system
  • Next-generation sequencing
    • Product line overview
    • RNA-seq
      • Automated library prep
      • Technologies and applications
        • SMART technology
        • Single-cell mRNA-seq
        • Total RNA-seq
        • SMART-Seq PLUS solutions
      • Technotes
        • Enabling long-read RNA sequencing from low-input samples
        • Singular for low input total RNA seq
        • All-in-one cDNA synthesis and library prep from single cells
        • Automation-friendly, all-in-one cDNA synthesis and library prep
        • All-in-one cDNA synthesis and library prep from ultra-low RNA inputs
        • 3' mRNA libraries from single cells (SMART-Seq v4 3' DE Kit)
        • Full-length mRNA-seq for target capture
        • Stranded libraries from single cells
        • Stranded libraries from picogram-input total RNA (v3)
        • Stranded libraries from 100 pg-100 ng total RNA
        • Stranded libraries from 100 ng - 1 ug total RNA
        • Stranded libraries from FFPE inputs (v2)
        • Nonstranded libraries from FFPE inputs
        • Singular and Takara Bio library prep
        • Full-length, single-cell, and ultra-low-input RNA-seq with UMIs
      • Webinars
        • Pushing the limits of sensitivity for single-cell applications
        • Capturing biological complexity by high-resolution single-cell genomics
        • Taking single-cell RNA-seq by STORM
        • STORM-seq Q&A
        • Neural multiomics Q&A
        • Liver metabolic function, dissecting one cell at a time
        • Pushing the limits Q&A
        • Total RNA sequencing of liquid biopsies
        • Liver metabolic function Q&A
        • Automating full-length single-cell RNA-seq libraries
        • Single-cell whole transcriptome analysis
        • Sensitivity and scale for neuron multiomics
      • RNA-seq tips
      • RNA-seq FAQs
    • Technical notes
      • DNA-seq
        • Next-gen WGA method for CNV and SNV detection from single cells
        • Low-input whole-exome sequencing
        • DNA-seq from FFPE samples
        • Low cell number ChIP-seq using ThruPLEX DNA-Seq
        • Detection of low-frequency variants using ThruPLEX Tag-Seq FLEX
        • ThruPLEX FLEX outperforms NEBNext Ultra II
        • Streamlined DNA-seq from challenging samples
        • High-resolution CNV detection using PicoPLEX Gold DNA-Seq
        • ThruPLEX FLEX data sheet
        • Low-volume DNA shearing for ThruPLEX library prep
        • NGS library prep with enzymatic fragmentation
        • Comparing ThruPLEX FLEX EF to Kapa and NEBNext
      • Immune Profiling
        • Track B-cell changes in your mouse model
        • Efficient and sensitive profiling of human B-cell receptor repertoire
        • TCRv2 kit validated for rhesus macaque samples
        • Improved TCR repertoire profiling from mouse samples (bulk)
        • TCR repertoire profiling from mouse samples (bulk)
        • BCR repertoire profiling from mouse samples (bulk)
        • Improved TCR repertoire profiling from human samples (bulk)
        • TCR repertoire profiling from human samples (single cells)
        • BCR repertoire profiling from human samples (bulk)
      • Epigenetic sequencing
        • ChIP-seq libraries for transcription factor analysis
        • ChIP-seq libraries from ssDNA
        • Full-length small RNA libraries
        • Methylated DNA-seq with MBD2
      • Reproductive health technologies
        • Embgenix ESM Screen
        • Embgenix PGT-A
    • Technology and application overviews
      • Embgenix GT-omics Oncology Tech Note
      • Sequencing depth for ThruPLEX Tag-seq
      • Whole genome amplification from single cells
    • FAQs and tips
      • Positive and negative controls in scRNA-seq
      • DNA-seq FAQs
      • ChIP-seq FAQs
      • Indexing FAQs
      • TCR-seq methods: Q&A
    • DNA-seq protocols
      • Using UMIs with ThruPLEX Tag-Seq FLEX
      • Targeted capture with Agilent SureSelectQXT
      • Exome capture with Illumina Nextera Rapid Capture
      • Targeted capture with Roche NimbleGen SeqCap EZ
      • Targeted capture with IDT xGen panels
      • Targeted capture with Agilent SureSelectXT
      • Targeted capture with Agilent SureSelectXT2
    • Bioinformatics resources
      • Cogent NGS Analysis Pipeline
        • Cogent NGS Analysis Pipeline notices
      • Cogent NGS Discovery Software
        • Cogent NGS Discovery Software notices
      • Cogent NGS Immune Profiler
        • Cogent NGS Immune Profiler Software notices
      • Cogent NGS Immune Viewer
      • Embgenix Analysis Software
      • SMART-Seq DE3 Demultiplexer
    • Webinars
      • Harnessing the power of full-length transcriptome analysis for biomarker discoveries
      • SMART-Seq Pro kits for biomarker detection
      • Takara Bio Single-Cell Workshop, Spring 2021
      • Single-Cell Workshop at 2020 NextGen Omics Series UK
      • Immunogenomics to accelerate immunotherapy
      • MeD-Seq, a novel method to detect DNA methylation
      • Single-cell DNA-seq
    • Posters
      • Long-read mRNA-seq poster
  • Spatial biology
  • mRNA and cDNA synthesis
    • mRNA synthesis
      • mRNA synthesis selection guide
      • mRNA synthesis FAQs
      • Takara IVTpro mRNA Synthesis System
        • Cloning Kit for mRNA Template
        • Takara IVTpro T7 mRNA Synthesis Kit
      • 5-prime capping of mRNA
        • Post-transcriptional capping
        • Co-transcriptional capping
      • Download resources
        • Sign up to download our infographic
      • Webinar: Streamlining your IVT workflow to maximize mRNA yields
    • cDNA synthesis
      • Premium total and poly A+ RNA
      • SMARTer RACE 5'/3' Kit
      • Cloning antibody variable regions
  • Gene function
    • Gene editing
      • Gene editing product finder
      • Gene editing tools and information
        • sgRNA design tools
        • Tools for successful CRISPR/Cas9 genome editing
        • Gene editing posters
        • Customer data for Guide-it products
        • How to design sgRNA sequences
        • Introduction to the CRISPR/Cas9 system
        • Gene editing of CD3+ T cells and CD34+ HSCs
      • CRISPR/Cas9 knockouts
        • Mutation detection kit comparison
        • Screening for effective guide RNAs
        • Monoallelic versus biallelic mutants
        • Indel identification kit for mutation characterization
      • CRISPR/Cas9 knockins
        • Choosing an HDR template format
        • Homology-directed repair FAQs
        • Mouse CRISPR knockin protocol
        • Site-specific gene knockins using long ssDNA
        • Efficient CRISPR/Cas9-mediated knockins in iPS cells
        • Oligo design tool for detecting precise insertions
          • Oligo design tool user guide (insertions)
      • Genome-wide screening
        • CRISPR library screening
        • CRISPR library screening webinar
        • Phenotypic screen using sgRNA library system
      • Creating and screening for SNPs
        • SNP detection with knockin screening kit
        • Oligo design tool for SNP screening
          • Oligo design tool user guide (SNPs)
        • Sign up: SNP engineering webinar
        • Guide-it SNP Screening Kit FAQs
      • CRISPR/Cas9 delivery methods
        • Electroporation-grade Cas9 for editing in diverse cell types
        • CRISPR/Cas9 gene editing with AAV
        • CRISPR/Cas9 gesicles overview
        • Cas9 Gesicles—reduced off-target effects
        • sgRNA-Cas9 delivery to many cell types
        • Tet-inducible Cas9 for gene editing
      • Cre recombinase
        • Control your Cre recombinase experiments
        • Fast Cre delivery with gesicle technology
    • Viral transduction
      • Recombinant virus comparison
      • Product finder
      • Transduction posters
      • Lentivirus
        • Lenti-X Transduction Sponge overview
          • Lenti-X Transduction Sponge tech note
          • Lenti-X Transduction Sponge compatible cell types
        • Webinars
          • Webinar: Cellular reprogramming of cancer cells for immunotherapy
          • Lentiviral particles webinar
        • Customizable SARS-CoV-2 pseudovirus
        • Lenti-X FAQs
        • Lentiviral workflow
        • Lentiviral products guide
        • Lentivirus biosafety
        • Lentiviral tips
        • Lentiviral vectors
        • Lenti-X packaging
        • High-throughput lentivirus production
        • Lenti-X Concentrator
        • Lentiviral titration
        • Lenti-X GoStix Plus video protocols
        • Lenti-X GoStix Plus FAQs
        • Rapid lentivirus titration by p24 ELISA
        • Lentiviral particles
        • Lentiviral particles—fluorescent
        • Lentiviral particles FAQs
      • Retrovirus
        • Retroviral products
        • Retro-X FAQs
        • Retro-X packaging
        • Retro-X Concentrator
      • Adeno-associated virus
        • AAV workflow
        • AAV products
        • AAV FAQs
        • AAV tech notes
          • Customer data: Purified AAV9 delivery (mouse brain)
          • Customer data: Purified AAV9 delivery (songbird brain)
          • Serotype-independent AAV vector purification
          • Customer data: Purified AAV2 delivery (mouse brain)
        • AAV videos
          • AAV2 purification video
          • AAVpro "All Serotypes" protocols
        • AAVpro Concentrator overview
      • Adenovirus
        • Adenoviral FAQs
        • Adenoviral products
        • Fastest, easiest adenoviral system ever
        • Tet-inducible adenovirus
        • Adenovirus purification kits
        • Adenovirus purification mega-scale
        • Adenovirus rapid titer
        • Adenoviral titration
    • T-cell transduction and culture
      • Technology overview
      • Adoptive T-cell therapy (ACT)
      • RetroNectin FAQs
      • Hematopoietic cell transduction
      • T-cell expansion
      • Serum-free T-cell culture
      • CultiLife culture bags protocol
      • Cytokine analysis
    • Inducible systems
      • iDimerize systems
        • Inducible protein-protein interactions—iDimerize systems
        • iDimerize systems journal club
        • iDimerize in vivo protocol
        • iDimerize systems citations
        • ARGENT cell signaling regulation kits from ARIAD
      • ProteoTuner systems
        • ProteoTuner technology overview
        • ProteoTuner citations
        • Inducible protein stabilization systems product selection guide
      • Tet-inducible systems
        • Tet systems product selection guide
        • Tet systems overview
        • Tet-One technology overview
        • Tet-On 3G plasmid system kit components
        • Tet-On 3G lentiviral system kit components
        • Tet system webinars
    • Transfection reagents
      • Xfect Transfection Reagent
      • Transfection tips
      • Cell lines transfected with the Xfect reagent
      • Protein transfection FAQs
      • Xfect RNA Transfection Reagent
      • Transfection reagent products
      • Protocol: transfection of cerebellar slice cultures with Xfect reagent
      • Calcium phosphate transfection of neurons
    • Fluorescent proteins
      • Fluorescent protein vector finder
      • Fluorescent protein excitation and emission maxima
      • Fluorescent protein antibodies selection guide
      • Fluorescent protein antibody citations
        • GFP antibody citations
        • RFP antibody citations
      • Fluorescent protein quick guide
      • Fluorescent retroviral expression vectors
      • Verify miRNA expression
      • Find EGFP vector alternatives
    • Cell biology assays
      • Technical notes
        • Cell viability and proliferation measurement
      • Citations
  • Stem cell research
    • Overview
      • Stem cell research products
      • Stem cell media products
      • Hepatocyte products
      • iPS cell to hepatocyte differentiation overview
    • Protocols
      • Hepatocytes
        • Video protocols for hiPS-HEP v2 cells
        • Getting started with hepatocyte differentiation
      • Pluripotent stem cells
        • Single-cell cloning with DEF-CS 500 Culture System
        • Transferring iPSCs on MEFs to DEF-CS
        • Transferring iPSCs from other media to DEF-CS
        • Spin embryoid body formation
        • Reprogramming PBMCs
        • Reprogramming fibroblasts
      • Cardiomyocytes
        • Cardiomyocytes in FLIPR 384-well plate format
        • Cardiomyocytes on the Patchliner system
        • Cardiomyocytes on the Maestro MEA system
        • Cardiomyocytes on the MED64 MEA system
        • Cardiomyocytes on the CardioExcyte 96 system
        • Cardiomyocytes on the xCELLigence RTCA CardioECR system
    • Applications
    • Technical notes
      • Pluripotent stem cells
        • Using the DEF-CS system to culture human iPS cells
        • Comparison of the Cellartis DEF-CS system with other vendors' human iPS cell culture systems
        • Reprogramming PBMCs
        • Reprogramming fibroblasts
      • Gene editing in hiPS cells
        • Tagging an endogenous gene with AcGFP1 in hiPS cells
        • Tagging an endogenous gene with a myc tag in hiPS cells
        • Generating clonal hiPS cell lines deficient in CD81
        • Introducing a tyrosinemia-related SNP in hiPS cells
        • Inserting an expression cassette into the AAVS1 locus in hiPS cells
        • Editing hiPS cells using electroporation
        • Editing hiPS cells using gesicle technology
        • Single-cell cloning of hiPS cells
      • Organoids
        • Retinal organoid differentiation from iPSCs cultured in the Cellartis DEF-CS 500 Culture System
        • Liver organoid differentiation from iPSCs for prediction of drug-induced liver injury
        • Generation of embryonic organoids using NDiff 227 neural differentiation medium
      • Beta cells
        • Beta cells for disease modeling
      • Hepatocytes
        • hiPS-HEP cells for disease modeling
        • hiPS-HEP cells for drug metabolism studies
        • Power medium for long-term human primary hepatocyte culture
        • iPS cell to hepatocyte differentiation system
      • Cardiomyocytes
        • Making engineered heart tissue with cardiomyocytes
      • Neural stem cells
        • RHB-A neural stem cell medium
    • Posters
    • Webinars
      • Using hiPS gene editing to create a tyrosinemia disease model
    • Videos
      • Hepatocyte offerings
    • FAQs
      • Cellartis DEF-CS 500 Culture System FAQs
      • Cellartis enhanced hiPS-HEP FAQs
      • Cellartis iPS Cell to Hepatocyte Differentiation System FAQs
    • Citations
      • Cellartis MSC Xeno-Free Culture Medium
      • Cellartis Power Primary HEP Medium
      • Cellartis DEF-CS 500 Culture System
      • Cellartis Enhanced hiPS-HEP cells
      • Cellartis hPS cell-derived cardiomyocytes
      • Cellartis iPS Cell to Hepatocyte Differentiation System
      • GS1-R
      • 2i mES/iPSC medium
      • iMatrix-511
      • 3i mES/iPSC medium
      • NDiff 227
      • NDiff N2
      • RHB-A
      • STEM101
      • STEM121
      • STEM123
    • Selection guides
      • Stem cell antibody selection guide
      • Stem cell media product finder
      • Stem cell tools product finder
      • Hepatocyte product finder
  • Protein research
    • Capturem technology
      • Capturem protocols
      • Capturem tech notes and applications
      • FAQs about Capturem technology
      • Capturem technology citations
      • Capturem posters
    • Antibody immunoprecipitation
      • IP and Co-IP of cardiac voltage-gated ion channel proteins
      • Tech note: thiophilic antibody purification resins
      • Tech note: IP and Co-IP
    • His-tag purification
      • Purification methods overview
      • TALON resin selection guide
      • Selection guide: His60 resin
      • xTractor Buffer is optimized for superior protein yield
      • Why tag a protein?
      • Tech note: cobalt resin
      • Simplified purification of active, secreted his-tagged proteins
      • Overview: His60
      • Tech note: Capturem technology
      • Tech note: Capturem large volume
      • Magnetic beads
      • FAQs: TALON
      • Protocols
        • Video: Capturem his maxiprep
        • Video: Capturem his miniprep
        • Visual protocol: Capturem his maxiprep
        • Visual protocol: Capturem his miniprep
        • Capturem nickel column reagent compatibility
        • TALON reagent compatibility
        • His60 reagent compatibility
        • TALON: Native vs denaturing purification
        • Protocol: denaturing purification with TALON resin, imidazole elution
        • Protocol: native purification with TALON resin, imidazole elution
        • Protocol: native purification with TALON resin, pH elution
    • Other tag purification
      • Streptavidin-based enrichment using Capturem technology
      • Selection guide: peptide tags
      • Myc-tagged protein purification overview
      • GST-tagged protein purification overview
    • Phosphoprotein and glycoprotein purification
      • Non-tagged protein purification overview
      • Phosphoprotein purification overview
    • Matchmaker Gold yeast two-hybrid systems
      • Matchmaker Gold Yeast Two-Hybrid System
      • Make your own library for yeast two-hybrid screening
      • Mate and Plate yeast two-hybrid cDNA libraries
      • Aureobasidin A for improved selectable drug resistance in yeast
    • Expression systems
      • Protein expression overview
      • Insect expression overview
      • Mammalian expression overview
      • pHEK293 Ultra expression overview
      • OKT3 expression in mammalian cells
      • Bacterial expression overview
  • PCR
    • Citations
      • PrimeSTAR HS
      • EmeraldAmp MAX
      • Terra PCR Direct
      • EmeraldAmp GT
      • Takara Ex Taq
      • PrimeSTAR Max
      • PrimeSTAR GXL
      • Takara LA Taq
      • SpeedSTAR HS
      • Takara Taq and Taq HS
      • Titanium Taq
    • PCR selection guide
    • PCR enzyme brochure
    • Technical notes
      • Rapid, high-performance multiplex PCR
      • EmeraldAmp outperforms MyTaq Red mix
      • Fast and accurate PCR
      • Methylation studies
      • Hot-start PCR
      • Long-range PCR with LA Taq
      • Direct PCR from human nail
      • Direct PCR from meat samples
      • Megaprimer PCR with PrimeSTAR GXL
      • Amplifying GC-rich templates
      • Titanium Taq for high-throughput genotyping
      • Colony PCR in under an hour
      • High-throughput endpoint PCR
      • Direct PCR from blood
      • PrimeSTAR GXL for targeted sequencing
      • Detecting somatic mosaicism using massively parallel sequencing
    • FAQ
      • Primer design
      • Optimization
      • Troubleshooting
      • Applications and conditions
      • Shipping, storage, and handling
    • Go green with lyophilized enzymes
  • Cloning
    • Automated In-Fusion Cloning
    • In-Fusion Cloning general information
      • In-Fusion Cloning overview
      • In-Fusion Cloning guide
      • In-Fusion Cloning and competition
        • In-Fusion Snap Assembly vs. GeneArt Gibson Assembly HiFi
        • In-Fusion Snap Assembly vs. NEBuilder HiFi
        • Sequence accuracy in seamless cloning
        • Choosing a seamless cloning method
        • Improving background over Gibson Assembly
        • A successful alternative to ligation cloning
        • Single- and multiple-insert cloning
        • Easy cloning into lentiviral vectors
        • Outperforming TOPO cloning
      • In-Fusion Cloning citations
      • Stellar Competent Cells product overview and performance data
      • EcoDry reagents and sustainability
    • Primer design and other tools
      • Seamless cloning primer design
      • In-Fusion Cloning tutorials
        • Cloning one or more fragments
        • Deleting a sequence
        • Inserting a sequence
        • Deleting and replacing a sequence
      • In-Fusion molar ratio calculator
      • Simulate your construct
    • In‑Fusion Cloning tips and FAQs
    • Applications and technical notes
      • In-Fusion Cloning applications collection
      • Efficient multiple-fragment cloning
      • Mutagenesis with In-Fusion Cloning
      • Rapid, high-throughput cloning for antibody development
      • Solve a synthesis challenge with easy multiple-insert cloning
      • Direct cloning into large vectors
      • Simplified insertion of a GFP-encoding cassette into a 100-kb plasmid
      • Efficient cloning for sgRNA/Cas9 plasmids
      • In-Fusion Cloning of sgRNAs
      • De novo insertion of small fusion protein tags
    • Sign up to stay updated
    • Traditional molecular cloning
      • Restriction enzyme overview
        • General information about restriction enzymes
        • Star activity of restriction enzymes
        • Inactivation of restriction enzymes
        • Buffer activity with restriction enzymes
        • Universal buffers for double digestion with restriction enzymes
        • Restriction enzymes affected by methylation
        • Methylation-sensitive restriction enzymes
        • QC of restriction enzymes
      • Ligation cloning overview
      • Ligation product guide
    • In-Fusion Cloning webinars
  • Nucleic acid purification
    • Nucleic acid extraction webinars
    • Product demonstration videos
    • Product finder
    • Plasmid kit selection guide
    • Plasmid purification
      • NucleoSpin Plasmid Transfection-grade
      • NucleoSnap Plasmid Midi
      • NucleoBond Xtra Midi/Maxi
      • EF plasmid purification overview
    • Genomic DNA purification
      • Tissue
        • NucleoSpin Tissue
        • NucleoSpin DNA RapidLyse
        • NucleoSpin Tissue XS
        • NucleoMag DNA Swab
        • NucleoBond HMW DNA
        • NucleoSpin DNA Lipid Tissue
      • FFPE
        • NucleoMag DNA FFPE
        • NucleoSpin 96 DNA FFPE
      • Blood and plasma
        • NucleoMag Blood
        • NucleoSnap cfDNA
        • NucleoSpin cfDNA XS
        • NucleoMag cfDNA
        • NucleoSpin cfDNA Midi
      • Plant
        • NucleoMag 384 Plant
        • NucleoSpin Plant II
      • Other organisms and samples
        • NucleoMag DNA Bacteria
        • NucleoMag DNA Food
        • NucleoSpin DNA Insect
        • NucleoSpin Microbial DNA
    • DNA/RNA cleanup and extraction
      • NucleoSpin Gel and PCR XS product overview
      • NucleoSpin Gel and PCR Clean-up
      • NucleoMag NGS Clean-up and Size Select
      • NucleoSpin RNA Clean-up
    • RNA purification
      • RNA purification overview
      • NucleoSpin RNA Plus
      • NucleoSpin RNA Plus XS
      • NucleoSpin RNA
      • NucleoProtect RNA
      • NucleoSpin miRNA
      • NucleoSpin RNA Blood
      • NucleoSpin miRNA Plasma
      • NucleoZOL for RNA isolation
      • High-throughput RNA isolation from FFPE samples
      • NucleoSpin RNA Plant and Fungi
    • RNA purification kit finder
    • Viral DNA and RNA purification
      • Nucleic acid detection from wastewater
      • NucleoSpin VET
      • NucleoMag VET
      • NucleoMag Virus
      • NucleoMag Pathogen
    • Parallel DNA, RNA & protein
      • NucleoSpin RNA/Protein
      • NucleoSpin TriPrep
    • Automated DNA and RNA purification
      • Application notes, by sample type
        • Plasmid purification
        • DNA cleanup
        • DNA from cells and tissues
        • RNA from cells and tissues
        • DNA from blood and bodily fluids
        • DNA and RNA from plants, fungi, and food
        • Viral RNA and DNA purification
      • Application notes, by platform
        • Eppendorf
        • Tecan
        • Hamilton
        • Thermo Fisher Scientific
        • MASMEC Biomed
    • Accessory selection guides
      • Sample homogenization beads
      • NucleoBond racks selection guide
    • Microbiome
      • NucleoMag DNA Microbiome
      • NucleoSpin Soil
      • NucleoSpin 8/96 Soil
      • NucleoSpin DNA Stool
      • NucleoSpin 96 DNA Stool
      • NucleoMag DNA/RNA Water
      • NucleoSpin eDNA Water
      • NucleoBond RNA Soil
      • NucleoSpin RNA Stool
  • Antibodies and ELISA
    • Osteocalcin focus
  • Real-time PCR
    • Download qPCR resources
      • Download our mycoplasma white paper
      • Download our mycoplasma infographic
    • Overview
      • One-step RT-qPCR kits
      • Two-step RT-qPCR kits
      • Probe-based qPCR kits
      • TB Green-based qPCR kits
      • Videos: qPCR analysis for challenging inputs
      • qRT-PCR to detect RNA present at low levels
    • Product finder
    • Reaction size guidelines
    • Real-time PCR products brochure
    • Real-time PCR tutorial videos
    • Guest webinar: extraction-free SARS-CoV-2 detection
    • Technical notes
      • Unbiased preamplification of limited samples
      • Accurate gene expression analysis with TB Green Premix Ex Taq II
      • Efficient quantification of human gene expression with PrimeScript Reverse Transcriptase
      • Rapid qPCR analysis from blood samples using PrimeScript Reverse Transcriptase
      • Monitoring siRNA knockdown
      • qPCR without optimization using TB Green Premix Ex Taq
      • Fast synthesis of cDNA templates for real-time RT-PCR
      • Specific, consistent real-time PCR with TB Green Premix Ex Taq II
      • Mir-X microRNA quantification
    • FAQs
New products
Need help?
Contact Sales
CogentAP resource portal CogentAP product docs & additional resources
CogentAP software page Cogent NGS Analysis Pipeline software

Cogent NGS Analysis Pipeline v1.0 User Manual

(Last updated: 01-Oct-2020)

Cogent NGS Analysis Pipeline (CogentAP) is bioinformatic software for analyzing RNA-seq NGS data generated using the following systems or kits:

  • ICELL8 cx Single-Cell System or the ICELL8 Single-Cell System on the single-cell full-length transcriptome (SMART-Seq ICELL8 workflow)
  • ICELL8 cx Single-Cell System or the ICELL8 Single-Cell System on the single-cell differential expression (3′ DE or 5′ DE) workflows (ICELL8 3′ DE or ICELL8 TCR)
  • SMARTer Stranded Total RNA-Seq Kit v3 - Pico Input Mammalian

The program takes input data from sequencing and outputs an HTML report, with results typical to single-cell analysis, plus other files, such as a gene matrix, to continue further analysis. R data object with pre-computed results based on recommended parameters are also output. Either the standard output files or the R data object can serve as input for Cogent NGS Discovery Software (CogentDS), another bioinformatic software package provided by Takara Bio.

CogentAP software is written in Python and can be run either via a GUI or command-line interface.

I. Before you begin

A. Supported operating systems

CogentAP software is designed to be installed on a server running Linux. The following versions of Linux have been tested and are supported for use with CogentAP software:

  • CentOS 6.9 & 6.10
  • RedHat 7.5
  • Ubuntu 17

B. Hardware requirements

For analyzing the output of Illumina NextSeq® High-Output sequencing data analysis, the following server requirements (or better) are recommended:

  • CPU: 24-cores
  • RAM: 64 GB
  • Free hard drive space: 500 GB

Testing was also done on MiniSeq™, MiSeq®, HiSeq®, and NovaSeq™ datasets.

  • MiniSeq or MiSeq—less computational power may be needed than the specifications described above for NextSeq output
  • HiSeq or NovaSeq—requires more computational power than described above for NextSeq output

Precise hardware requirements were not determined for output from these datasets. Support for performance issues of the servers in conjunction with these dataset types may be limited.

C. User account requirements

CogentAP software can be installed in two different access scenarios; the user account requirement depends on which scenario you wish to implement in your environment.

  • For use by a single user (single Linux username/account), CogentAP software can be installed and run by any account type with install and executable privileges (regular or root access).
  • For use by multiple users (accounts), CogentAP software can either be installed singly across the multiple accounts (regular or root access) or installed for system-wide access by an administrator with root access.

D. Additional hardware and software dependencies and recommendations

  • ICELL8 cx Single-Cell System or ICELL8 Single-Cell System
  • Bash UNIX shell
  • Internet connectivity on the server
    The installation process requires Internet connectivity as it sources scripts from GitHub, Bioconda, and CRAN and downloads genome information from Ensembl. Please ensure that internet connectivity is available on the UNIX server while installing.
  • Conda
    CogentAP software leverages the open-source package manager Conda for installation of the pipeline and its dependencies. Any tools and applications required by the pipeline are installed through Conda inside a local environment created specifically for CogentAP software.

    If Conda is not currently installed on the server, instructions to do so can be found at https://conda.io/miniconda.html. We recommend installation of the lightweight version for Python 3.7+ (typically the 64-bit bash installer), also called Miniconda3 (version 4.8.2 or later).
  • bcl2fastq
    CogentAP takes as input raw-fastq files that must be converted from the sequencer output FASTQ files using the conversion software bcl2fastq.

    If you're not sure where to find bcl2fastq in your environment, it can be downloaded from https://support.illumina.com/sequencing/sequencing_software/bcl2fastq-conversion-software.html.
  • Keyboard, monitor, and mouse directly into the server, or a remote access program
    CogentAP software must be run on the Linux server. If users do not have direct console access, a remote access program that enables a Virtual Network Computing (VNC) connection is required through a program such as RealVNC (realvnc.com), TightVNC (tightvnc.com), TigerVNC (tigervnc.org), or similar.

    For more information on VNC, along with other VNC clients that can be used, please see the Wikipedia entry at https://en.wikipedia.org/wiki/Virtual_Network_Computing.

E. Required input files

  • Paired-end read FASTQ files
  • Sample description file

    • Well-list file—a well-selection text file derived from CellSelect software that contains well-level sample information. A typical CellSelect well-list file should have no more than 1,728 wells.
    • Well-list-like format file—a text file that contains sample information, including columns "Barcode" and "Sample". Each column name is case sensitive. The "Barcode" column contains i7 and i5 indexes concatenated with a plus-sign ("+") (e.g., TAGCGAGT+CCGTTGCG).

II. Software overview

Figure 1

Figure 1. High-level analysis workflow of CogentAP and how its output can be carried over to CogentDS.

CogentAP software consists of two main parts, the demultiplexer (demuxer) and the analyzer.

  • The demultiplexer extracts the barcode from the sequencing data (based on the protocol) and writes it into FASTQ files at the end of the read name. There are two options:

    • The default leaves the barcode-assigned reads in combined FASTQ files (which saves time during subsequent analysis)
    • The second option splits the data up into barcode-level FASTQ files which can be carried over to other analysis pipelines
  • The analyzer takes the data sent to it by the demultiplexer and performs the following functions:

    • Read trimming (using Cutadapt)
    • Genome alignment (using the STAR aligner)
    • Calculating Unique Start Stop (USS) positions (using the samtools and bedtools, only for UMI-enabled kits)
    • Counting (using Subread)
    • Summarization (using custom scripts)
    • Generating an HTML report (using CogentDS)

The demultiplexer and analyzer can be invoked using a graphical user interface (GUI), called the CogentAP launcher, or can be run on the command line.

III. Installation & configuration options

The CogentAP software is available for download on the ICELL8 software portal at takarabio.com/ICELL8-software. An email will be sent to you with information on downloading the install script and a unique password required to run the installation on your server.

A. Verifying Conda installation

The following steps can be used to verify that Conda is installed properly on the server.

  1. Type the following command in at the prompt in any directory location on the server:

    conda -V

    If Conda is successfully installed, it should return text with the version number.

    e.g.,
    conda 4.8.2

  2. Check to see if the base Conda environment can be activated.

    1. Type the following command into the command-line Linux prompt on the server:

      source activate

      Or, if that returns an error, try:

      conda activate

      A successful Conda install will result in a change in the prompt, as shown in Figure 2.

      figure 2

      Figure 2. High-level screenshot of the Linux command line showing a successful check of the base Conda environment.

    2. If Conda is successfully installed and the prompt changed as displayed in Figure 2, type the following command to return to the default Linux prompt:

      source deactivate

      or

      conda deactivate

      This command will take the user back to the Linux prompt and out of the Conda environment.

  3. Installation of Miniconda3 typically adds the location of its installation to the user's system environment. This is also required for the successful installation of CogentAP software.

    The following steps can be used to verify that the Conda $PATH is configured correctly.

    1. Open the file .bash_profile, which will be located in the home directory (for an individual user account):

      more ~/.bash_profile

    2. Verify a line similar to the following is showing in the file:

      export PATH="/home/<USERNAME>/miniconda3/bin:$PATH"

      where <USERNAME> is replaced by the username of the account that installed Conda.

      e.g., username is ‘myacct':

      export PATH="/home/myacct/miniconda3/bin:$PATH"

      If the line isn't displaying or the .bash_profile file does not exist, it will need to be manually created and populated. For more information on setting an environment variable, see a UNIX user manual or a forum post like https://stackoverflow.com/a/7502128.

B. Installation

  1. Download the installation script (takarabio_Linux64_installer.sh), following the directions on the thank you page or the email sent after submitting the sign-up form on the CogentAP product page.
  2. Move or copy the installation script onto the Linux server into the directory location where you want to install the CogentAP software.

    NOTE: The account logged into while doing the installation must have read/write privileges to the install directory chosen.

  3. From the same directory location in Step 2, run the following command:

    bash takarabio_Linux64_installer.sh CogentAP <PASSWORD>

    <PASSWORD> will be replaced by the unique password included in the sign-up confirmation email.

    NOTE: The installation process at this point will take anywhere from 90 minutes to 3 hours to complete, depending on the computational capacity of the server and the download speed of the internet connection. No further user input is required until the install is complete. The installation procedure may be left to run overnight, with the user not present at the console. If desired, the user can work in another terminal window simultaneous to the install.

    Once the installation is complete, a message will display saying CogentAP software is ready to use.

    figure 3

    Figure 3. Text to console illustrating a successful CogentAP software install on the Linux server.

    There should be a folder named CogentAP in the directory into which the software was installed (Step 2) which contains files and directories required by the pipeline’s scripts.

    figure 4

    Figure 4. The sub-directory and files list of the CogentAP/ folder post-install.

C. (Optional) Set up a $COGENT_AP_HOME environmental variable

For ease of use, it's recommended that the CogentAP software install directory location be added to the .bash_profile as a permanent environmental variable.

Example:

If your account name is 'myacct', the absolute pathname for myacct's home directory is /home/myacct, and CogentAP software was installed in the ~/bin directory, edit .bash_profile to add the following line:

export COGENT_AP_HOME=/home/myacct/bin/CogentAP

Once added to the profile, you will either need to log out and back in to the account, or load the file in with the following command:

source ~/.bash_profile

The phrase $COGENT_AP_HOME can then be used as an alias shortcut to reference /home/myacct/bin/CogentAP.

Example:

Running the following command logged in as 'myacct':

cd $COGENT_AP_HOME

will change directories to ~/bin/CogentAP.

NOTE: Subsequent references to $COGENT_AP_HOME in this document refer to the full path where CogentAP software is installed.

D. Updating the pipeline scripts

To update CogentAP, run the following two commands in sequence:

cd $COGENT_AP_HOME

sh CogentAP_setup.sh update

E. Add a genome build

The human genome (Ensembl hg38 release 94) is built as default with the pipeline, but the genomes of other species can be loaded into the software post-install. To do so, download the following two files for the genome of interest:

    1. The FASTA file containing all the sequences (chromosomes and contigs)
    2. The GTF file containing the annotation, importantly the gene information for analysis
    3. Run the script:

      $COGENT_AP_HOME/cogent add_genome \
      -g <common_species_name> \
      -f <FASTA_FILENAME> \
      -a <GTF_FILENAME>

      Where <common_species_name> is the name of the genome being added (e.g., fruitfly) and the <FASTA_FILENAME> and <GTF_FILENAME> are the exact file names and path(s) for the FASTA and GTF files, respectively, on the server. For additional help with this script, type:

      $COGENT_AP_HOME/cogent add_genome -h

Example, using the fruit fly genome from Ensembl.org (https://uswest.ensembl.org/info/data/ftp/index.html):

      1. Download the FASTA file (Drosophila_melanogaster.BDGP6.dna.chromosome.2L.fa) using FTP:

        wget ftp://ftp.ensembl.org/pub/release-94/fasta/drosophila_melanogaster/dna/Drosophila_melanogaster.BDGP6.dna.toplevel.fa.gz

        or copy and paste the URI:

        ftp://ftp.ensembl.org/pub/release-94/fasta/drosophila_melanogaster/dna/Drosophila_melanogaster.BDGP6.dna.toplevel.fa.gz

        into the address bar of a web browser.
      2. Download the GTF file (Drosophila_melanogaster.BDGP6.94.gtf):

        wget ftp://ftp.ensembl.org/pub/release-94/gtf/drosophila_melanogaster/Drosophila_melanogaster.BDGP6.94.gtf.gz

        or copy and paste:

        ftp://ftp.ensembl.org/pub/release-94/gtf/drosophila_melanogaster/Drosophila_melanogaster.BDGP6.94.gtf.gz

        into the address bar of a web browser.
      3. If the downloaded files are stored in ~/ensembl directory in "myacct", run the following script:

        $COGENT_AP_HOME/cogent add_genome \
        -g fruitfly \
        -f ~/ensembl/Drosophila_melanogaster.BDGP6.dna.chromosome.2L.fa \
        -a ~/ensembl/Drosophila_melanogaster.BDGP6.94.gtf

NOTE: As FASTA and GTF files are a standard file format, files from any source, should work with this script. However, this script has only been tested on genomes downloaded from Ensembl. If a problem is encountered using files from another source it is recommended to try the import using the Ensembl file versions of the genome.

F. Uninstall

CogentAP software can be uninstalled by deleting the CogentAP/ directory defined as $COGENT_AP_HOME from the server.

If $COGENT_AP_HOME has been defined in .bash_profile, edit the file to remove the reference to $COGENT_AP_HOME as well.

IV. Running the pipeline

Before running an analysis, raw-fastq files need to be generated from the sequencer-output FASTQ files. Once those are created, CogentAP can be run in either of two ways:

  1. Using a graphical user interface (GUI) (Section IV.B)
  2. On the command line (CLI) (Section IV.C)

A. Generating raw-fastq files

CogentAP demultiplexer takes two, one-paired raw-fastq files as input (not split by barcode).

  1. Log in to the server that stores the sequencing run output folder. This server will typically have the program bcl2fastq installed (see Section I.D for more information about bcl2fastq).
  2. Change to the directory where you want to generate the raw-fastq files.
  3. Run bcl2fastq with the following syntax:

    bcl2fastq -R <RUN_FOLDER> \
    -o <RUN_ID> \
    --no-lane-splitting \
    --sample-sheet $COGENT_AP_HOME/config/SampleSheet_dummy.csv > <RUN_ID>.stdout \
    2> <RUN_ID>.stderr

    where

    • <RUN_FOLDER> is the path to the sequencing run folder
    • <RUN_ID> is the ID automatically generated by Illumina sequencer

    The file SampleSheet_dummy.csv is stored in the CogentAP config folder, under the main install folder.

    NOTE: Recent versions of bcl2fastq (2.17 and higher) have a bug where the indexes required for demultiplexing will not be inserted into the raw-fastq if a samplesheet is not specified in the command syntax. We recommend using the SampleSheet_dummy.csv file provided for every run of the bcl2fastq command to generate the raw-fastq files in order to prevent encountering that issue.

  4. Retrieve the raw-fastq files. These are typically named Undetermined_*.fastq.gz and are generated in the <RUN_ID> folder in the working directory from Step 2.

    NOTE: It is recommended that the raw-fastq files are moved to a directory on the server on which CogentAP is installed in order to reduce processing time.

B. GUI

The CogentAP GUI is a program called CogentAP_launcher. Since it's graphical, it should be accessed by connecting to the Linux server using a remote access tool like VNC (see Section I.D, above).

  1. Connect to the Linux console where CogentAP software is installed, either remotely or via a graphical interface on the server.
  2. Once connected, navigate using the file browser functionality to the folder in which CogentAP software is installed. In Figure 5, the CogentAP install folder is located on the user's desktop.

     

    figure 5

    Figure 5. The CogentAP/ folder on the desktop, opening it to highlight the CogentAP_launcher executable file.

  3. In the install folder, locate the file CogentAP_launcher and double-click on it to run. This will open the CogentAP GUI (Figure 6).

     

    figure 6

    Figure 6. High-level view of the CogentAP GUI.

    The launcher has four tabs: Run Information, Advanced Options, Demuxer Options, and Analyzer Options.

    1. The Run Information tab takes as input:

       

      • The workflow (default being 'Demultiplexer & Analyzer')
      • R1 and R2 raw-fastq files (generated in Section IV.A)
      • The well-list file (see Section I.E for more information about this file)
      • The protocol: full-length, 3′ DE, or 5′ DE
      • The genome to use for alignment (default: human hg38)
      • The directory location in which to output the results
      • The name for a new analysis folder, which will be created in the directory location specified above and will contain the analysis output files

      NOTE: Use the scrollbar on the right of the window to view all the fields.

    The other three tabs are intended for advanced users only, but their purposes are listed below:

    1. Advanced Options can be used to modify parameters such as the number of processor cores available for analysis, the number of mismatches to consider during barcode assignment, etc.
    2. Demuxer Options can be used to modify parameters on the demultiplexer process. The UMI length can be changed in this tab. Please input corresponding length to the kit used, such as SMARTer Stranded Total RNA-Seq Kit v3 - Pico Input Mammalian.
    3. Analyzer Options tab can be used to modify parameters of the analyzer's process such as normalizing method, skip trimming, etc.
  4. Once all the fields have been appropriately filled, click on [Start] to launch the pipeline. The process typically takes a few hours to run.
  5. Upon successful completion, the UI displays a "Finished" message as shown in the right panel of Figure 6 (above).

C. Command-line interface

NOTE: See Section V.B for an example of the full syntax for the command-line scripts.

CogentAP software starts from the main script, cogent, and has three subcommands: demux, analyze, and add_genome. The demux and analyze commands will be covered in this section.

In general, you would first run the demuxer (cogent demux) to obtain the demultiplexed FASTQ files. These files are then used as input to run the analyzer (cogent analyze) to obtain the output files (described in detail in Section VI). These scripts can be launched from any location (working directory) on the Linux server where CogentAP software is installed.

The full list of arguments can be accessed using the syntax:

cogent <COMMAND> -h

See examples for <COMMAND> values 'demux' and 'analyze' in Figures 7 and 8, below.

%COGENT_AP_HOME%/cogent demux -h

figure 7
Figure 7. The output of cogent demux -h at the command line.

%COGENT_AP_HOME%/cogent analyze -h

figure 8

Figure 8. The output of cogent analyze -h at the command line.

D. Processing time

The time taken by the pipeline will vary based on the hardware specifications of the server on which it is run and the size of the raw-fastq input files. During testing, a MiSeq run (~25M read pairs) typically took about 1–1.5 hr, and a NextSeq High Ouput run (~400M read pairs) typically took ~20–25 hr to complete.

NOTE: These baselines might be exceeded if the raw-fastq files are stored on a network device instead of locally on the server where CogentAP is installed.

V. Test dataset

A mini dataset file (test dataset) is included in the CogentAP distribution package; it can be found under the main installation folder in a sub-folder called $COGENT_AP_HOME/test/ (Figure 9, below). This dataset can be used to test the running of the pipeline end-to-end and will provide a sample of the output files. The output (report and stats only) from the test dataset are also included and can be found in $COGENT_AP_HOME/test/out_test/. This included output can be used to compare to the output of your test run to verify everything is working correctly.

NOTE: The mini dataset should not be used for inference purposes. Output statistics and plots will only be meaningful with a real dataset.

figure 9

Figure 9. The test/ folder under $COGENT_AP_HOME. The sample *.fastq.gz files and example output directory out_test/ can be found there.

A run using the test data can be started using either the UI launcher (Section A) or the command line (Section B).

The test run using either method should take ~5–10 min to complete successfully.

A. Test data through CogentAP launcher UI

figure 10

Figure 10. Configuration of the fields in the UI launcher for the test dataset files

The parameters defined in the CogentAP launcher fields in Figure 10 are listed below. The paths are written as $COGENT_AP_HOME being /usr/local/bin/.

Field name Input value
Workflow Demultiplexer & Analyzer
R1 Fastq /usr/local/bin/CogentAP/test/test_FL_R1.fastq.gz
R2 Fastq /usr/local/bin/CogentAP/test/test_FL_R2.fastq.gz
Well List File /usr/local/bin/CogentAP/test/99999_CogentAP
Protocol Full Length
Genome hg38
Output Directory /tmp/CogentAP_testing/
Output Folder install_test

Table 1. Parameter information for example CogentAP launcher run of the provided test dataset in Figure 10.

Once completed, the output folder (install_test/) will be located under the directory specified as 'Output Directory' (/tmp/CogentAP_testing/).

B. Test data through the command-line interface

  1. Run the demultiplexer script.

     

    $COGENT_AP_HOME/cogent demux \
    -i $COGENT_AP_HOME/test/test_FL_R1.fastq.gz \
    -p $COGENT_AP_HOME/test/test_FL_R2.fastq.gz \
    -b $COGENT_AP_HOME/test/99999_CogentAP_test_selected_WellList.TXT \
    -t Full_length \
    -o Desktop/install_test \
    -n 8

    figure 11

    Figure 11. Example syntax for running the demux script on the provided test data via the command line.

  2. Once the demuxer is finished, run the data through the analysis script.

     

    $COGENT_AP_HOME/cogent analyze \
    -i $COGENT_AP_HOME/install_test/install_test_demuxed_R1.fastq \
    -p $COGENT_AP_HOME/install_test/install_test_demuxed_R2.fastq \
    -g hg38 \
    -o Desktop/install_test/analysis \
    -n 8
    -d Desktop/install_test/install_test_counts_all.csv
    -t Full_length

    figure 12

    Figure 12. Example syntax for running the demux script on the provided test data via the command line.

Once completed, the output folder (install_test/) will be located under the directory specified as 'Output Directory' (Desktop/ as part of the parameter -o Desktop/install_test).

VI. Output files

The pipeline produces output files that serve two purposes:

  1. Summarization of results using typical statistics and plots
  2. Facilitating further analyses using our interactive R kit, Cogent NGS Discovery Software (CogentDS), or any other tertiary analysis tool

A. Output folder structure

Both the CogentAP GUI launcher and CLI are designated to output to a folder specified in the run parameters, but the file structure within that directory is slightly different between the two methods.

  • The output folder from CogentAP launcher UI will resemble Figure 13:

     

    Figure 13

    Figure 13. Folders and files of the output directory by launcher UI.

  • The folder structure from the CLI will resemble Figure 14:

     

    Figure 14. Folders and files of the output directory by command-line interface.

B. HTML report

The HTML report is generated by the same report generator process as CogentDS using standard parameters and contains the example statistics and plots listed below. For complete details, please see the Cogent NGS Discovery Software User Manual.

  1. Experiment overview

     

    Figure 15

    Figure 15. Sample experiment overview section of the HTML report.

  2. Correlation analyses

     

    Figure 16

    Figure 16. Example correlation analyses section of the HTML report.

  3. Various counts like reads, genes, Mito, Ribo.

     

    Figure 17

    Figure 17. Example gene counts chart from the HTML report.

  4. Clustering tables

     

    Figure 18

    Figure 18. Example clustering tables from the HTML report.

C. Stats file

The Stats file is provided in CSV format and contains barcode-level statistics across the analysis pipeline. Starting from barcoded reads, it summarizes the number of reads after every step in the pipeline: for example, trimmed reads, mapped reads, exon/intron/intergenic reads, mitochondrial reads, ribosomal reads, etc. It also lists the number of genes detected per barcode. The columns shown in this file depends on the reagent kit used to generate the input data. As examples, you can see columns related to UMI when you use ICELL8_3DE_UMI or Strnd_UMI. The full list of possible columns in this file is described in the Appendix.

Figure 19

Figure 19. Example Stats file output.

D. Gene matrix file

The gene matrix file (sometimes called the gene table or counts matrix) is also in CSV format and contains gene counts for each barcode, with the genes in the rows and barcodes/cells in the columns. The file contains raw counts that can then be normalized and transformed using CogentDS. An example is shown below.

figure 20

Figure 20. Example of a gene matrix file.

E. Gene info file

The gene info file is a CSV-formatted file that contains the main annotation for the genes as described in the GTF file that is part of the genome build. It includes gene ID (used in CogentDS), Ensembl gene ID, gene symbol, and gene length (used for some normalizations).

figure 21

Figure 21. Example of a gene info file.

F. CogentDS.analysis.rda

During the generation of the HTML report, an R Data object, CogentDS.analysis.rda, is created with the results of various modules. This file can be used directly as input into CogentDS to perform further analysis, which saves on processing time in that tool.

G. Extra folder

The extras/ folder contains similar result files as B–F above, in the same format, but the data is calculated by also including intron regions of genes. Additionally, when used with a UMI enabled kit, results without UMI calculation are also stored in this folder. You can also use these files for tertiary analysis.

  • From the GUI, extras/ can be found in <output folder>/analysis
  • From the CLI, it will be in the folder specified by the value of the cogent analyze command output_dir (-o) argument

Appendix

Columns in Stats file

The tables below document all potential columns that might appear in the Stats output file (Section VI.C).

As mentioned in that section, not all stats files will include every column listed.

Columns that will be present in all Stats files output by CogentAP (input workflow agnostic).

Column nameDescription
Barcode Detected barcodes. This value will usually be the sample name from the well-list or well-list-like file, but there are three exceptions, documented in the table below.
Sample Sample names described in sample description file.
Barcoded_Reads Number of reads after demultiplexing.
Trimmed_Reads Number of remained reads after trimming.
Unmapped_Reads Number of reads not mapped to genome.
Mapped_Reads Number of reads mapped to genome.
Multimapped_Reads Number of reads mapped to multiple genomic locations.
Uniquely_Mapped_Reads Number of reads mapped to one genomic location. These reads are used for counting.
Exon_Reads Number of reads assigned to an exonic region.
Ambiguous_Exon_Reads Number of reads assigned to exonic regions of multiple genes.
Intron_Reads Number of reads assigned to an intronic region.
Ambiguous_Intron_Reads Number of reads assigned to intronic regions of multiple genes.
Gene_Reads Number of reads assigned to a gene region (exon + intron).
Intergenic_Reads Number of reads assigned to an intergenic region.
No_of_Genes Number of detected genes.
Mitochondrial_Reads Number of reads assigned to mitochondrial chromosome.
Ribosomal_Reads Number of reads assigned to a ribosomal gene.

The "Barcode" column, in addition to the samples named in the well-list or well-list-like file, will also have three additional rows, which are described in the following table.

Barcode field valueDescription
Short Number of reads containing N in barcode or having shorter length than barcode.
Unselected Number of reads having a barcode included in Chip’s description, but not included in sample description file.
Undetermined Number of reads having undetermined barcode.

Additional columns in the Stats file for 3′ DE analysis with UMIs on ICELL8 system.

The table below lists additional columns that will be present in the Stats file when the input FASTQ files result from the ICELL8 3′ DE for UMI Reagent Kit (Cat # 640005) workflow on the ICELL8 Single-Cell System (Cat # 640000).

Column name Description
No_of_UMIs Number of UMI variations detected after demultiplexing.
Exon_nUMIs Number of deduplicated reads assigned to an exonic region. Deduplication is done by UMI.
Intron_nUMIs Number of deduplicated reads assigned to an intronic region. Deduplication is done by UMI.
Gene_nUMIs Number of deduplicated reads assigned to a gene region (exon + intron). Deduplication is done by UMI.

Additional columns in Stats file for the SMARTer Stranded Total RNA-Seq Kit v3- Pico Input Mammalian protocol.

The table below lists additional columns that will be present in the Stats file when the input FASTQ files result from the SMARTer Stranded Total RNA-Seq Kit v3- Pico Input Mammalian protocol.

Column name Description
No_of_UMIs Number of UMI variations detected after demultiplexing.
Exon_nUMIs Number of deduplicated reads assigned to an exonic region. Deduplication is done by UMI.
Exon_nUSSs Number of deduplicated reads assigned to an exonic region. Deduplication is done by Unique Start&Stop Site (USS).
Exon_nUMIs_USSs Number of deduplicated reads assigned to an exonic region. Deduplication is done by both UMI and USS.
Intron_nUMIs Number of deduplicated reads assigned to an intronic region. Deduplication is done by UMI.
Intron_nUSSs Number of deduplicated reads assigned to an intronic region. Deduplication is done by Unique Start&Stop Site (USS).
Intron_nUMIs_USSs Number of deduplicated reads assigned to an intronic region. Deduplication is done by both UMI and USS.
Gene_nUMIs Number of deduplicated reads assigned to a gene region (exon + intron). Deduplication is done by UMI.
Gene_nUSSs Number of deduplicated reads assigned to a gene region (exon + intron). Deduplication is done by Unique Start&Stop Site (USS).
Gene_nUMIs_USSs Number of deduplicated reads assigned to a gene region (exon + intron). Deduplication is done by both UMI and USS.
Strand_Specificity Ratio of reads detected as correct strand after mapping to genome.

RNA-seq

Cogent NGS Analysis Pipeline

Analyze sequencing data generated by select Takara Bio applications.

Cogent NGS Discovery Software

Visualize sequencing data using the output from the Cogent NGS Analysis Pipeline.

SMART-Seq DE3 Demultiplexer

Demultiplex sequencing data from SMART-Seq mRNA 3′ DE into sorted read data files.

Takara Bio USA, Inc.
United States/Canada: +1.800.662.2566 • Asia Pacific: +1.650.919.7300 • Europe: +33.(0)1.3904.6880 • Japan: +81.(0)77.565.6999
FOR RESEARCH USE ONLY. NOT FOR USE IN DIAGNOSTIC PROCEDURES. © 2025 Takara Bio Inc. All Rights Reserved. All trademarks are the property of Takara Bio Inc. or its affiliate(s) in the U.S. and/or other countries or their respective owners. Certain trademarks may not be registered in all jurisdictions. Additional product, intellectual property, and restricted use information is available at takarabio.com.

Takara Bio

Takara Bio USA, Inc. provides kits, reagents, instruments, and services that help researchers explore questions about gene discovery, regulation, and function. As a member of the Takara Bio Group, Takara Bio USA is part of a company that holds a leadership position in the global market and is committed to improving the human condition through biotechnology. Our mission is to develop high-quality innovative tools and services to accelerate discovery.

FOR RESEARCH USE ONLY. NOT FOR USE IN DIAGNOSTIC PROCEDURES (EXCEPT AS SPECIFICALLY NOTED).

Support
  • Contact us
  • Technical support
  • Customer service
  • Shipping & delivery
  • Sales
  • Feedback
Products
  • New products
  • Special offers
  • Instrument & reagent services
Learning centers
  • NGS
  • Gene function
  • Stem cell research
  • Protein research
  • PCR
  • Cloning
  • Nucleic acid purification
About
  • Our brands
  • Careers
  • Events
  • Blog
  • Need help?
  • Announcements
  • Quality and compliance
  • That's Good Science!
Facebook Twitter  LinkedIn

logo strip white

©2025 Takara Bio Inc. All Rights Reserved.

Region - North America Privacy Policy Terms and Conditions Terms of Use

Top



  • COVID-19 research
  • Viral detection with qPCR
  • SARS-CoV-2 pseudovirus
  • Human ACE2 stable cell line
  • Viral RNA isolation
  • Viral and host sequencing
  • Vaccine development
  • CRISPR screening
  • Drug discovery
  • Immune profiling
  • Publications
  • Next-generation sequencing
  • Spatial omics
  • RNA-seq
  • DNA-seq
  • Single-cell NGS automation
  • Reproductive health
  • Bioinformatics tools
  • Immune profiling
  • Real-time PCR
  • Great value master mixes
  • Signature enzymes
  • High-throughput real-time PCR solutions
  • Detection assays
  • References, standards, and buffers
  • Stem cell research
  • Media, differentiation kits, and matrices
  • Stem cells and stem cell-derived cells
  • mRNA and cDNA synthesis
  • In vitro transcription
  • cDNA synthesis kits
  • Reverse transcriptases
  • RACE kits
  • Purified cDNA & genomic DNA
  • Purified total RNA and mRNA
  • PCR
  • Most popular polymerases
  • High-yield PCR
  • High-fidelity PCR
  • GC rich PCR
  • PCR master mixes
  • Cloning
  • In-Fusion seamless cloning
  • Competent cells
  • Ligation kits
  • Restriction enzymes
  • Nucleic acid purification
  • Automated platforms
  • Plasmid purification kits
  • Genomic DNA purification kits
  • DNA cleanup kits
  • RNA purification kits
  • Gene function
  • Gene editing
  • Viral transduction
  • Fluorescent proteins
  • T-cell transduction and culture
  • Tet-inducible expression systems
  • Transfection reagents
  • Cell biology assays
  • Protein research
  • Purification products
  • Two-hybrid and one-hybrid systems
  • Mass spectrometry reagents
  • Antibodies and ELISAs
  • Primary antibodies and ELISAs by research area
  • Fluorescent protein antibodies
  • New products
  • Special offers
  • OEM
  • Portfolio
  • Process
  • Facilities
  • Request samples
  • FAQs
  • Instrument services
  • Apollo services
  • ICELL8 services
  • SmartChip ND system services
  • Gene and cell therapy manufacturing services
  • Services
  • Facilities
  • Our process
  • Resources
  • Customer service
  • Sales
  • Make an appointment with your sales rep
  • Shipping & delivery
  • Technical support
  • Feedback
  • Online tools
  • GoStix Plus FAQs
  • Partnering & Licensing
  • Vector information
  • Vector document overview
  • Vector document finder
Takara Bio's award-winning GMP-compliant manufacturing facility in Kusatsu, Shiga, Japan.

Partner with Takara Bio!

Takara Bio is proud to offer GMP-grade manufacturing capabilities at our award-winning facility in Kusatsu, Shiga, Japan.

  • Automation systems
  • Shasta Single Cell System introduction
  • SmartChip Real-Time PCR System introduction
  • ICELL8 introduction
  • Next-generation sequencing
  • RNA-seq
  • Technical notes
  • Technology and application overviews
  • FAQs and tips
  • DNA-seq protocols
  • Bioinformatics resources
  • Webinars
  • Spatial biology
  • Real-time PCR
  • Download qPCR resources
  • Overview
  • Reaction size guidelines
  • Guest webinar: extraction-free SARS-CoV-2 detection
  • Technical notes
  • Nucleic acid purification
  • Nucleic acid extraction webinars
  • Product demonstration videos
  • Product finder
  • Plasmid kit selection guide
  • RNA purification kit finder
  • mRNA and cDNA synthesis
  • mRNA synthesis
  • cDNA synthesis
  • PCR
  • Citations
  • PCR selection guide
  • Technical notes
  • FAQ
  • Cloning
  • Automated In-Fusion Cloning
  • In-Fusion Cloning general information
  • Primer design and other tools
  • In‑Fusion Cloning tips and FAQs
  • Applications and technical notes
  • Stem cell research
  • Overview
  • Protocols
  • Technical notes
  • Gene function
  • Gene editing
  • Viral transduction
  • T-cell transduction and culture
  • Inducible systems
  • Cell biology assays
  • Protein research
  • Capturem technology
  • Antibody immunoprecipitation
  • His-tag purification
  • Other tag purification
  • Expression systems
  • Antibodies and ELISA
  • Molecular diagnostics
  • Interview: adapting to change with Takara Bio
  • Applications
  • Solutions
  • Partnering
  • Contact us
  • mRNA and protein therapeutics
  • Characterizing the viral genome and host response
  • Identifying and cloning protein targets
  • Expressing and purifying protein targets
  • Immunizing mice and optimizing vaccines
  • Pathogen detection
  • Sample prep
  • Detection methods
  • Identification and characterization
  • SARS-CoV-2
  • Antibiotic-resistant bacteria
  • Food crop pathogens
  • Waterborne disease outbreaks
  • Viral-induced cancer
  • Immunotherapy research
  • T-cell therapy
  • Antibody therapeutics
  • T-cell receptor profiling
  • TBI initiatives in cancer therapy
  • Cancer research
  • Kickstart your cancer research with long-read sequencing
  • Sample prep from FFPE tissue
  • Sample prep from plasma
  • Cancer biomarker quantification
  • Single cancer cell analysis
  • Cancer transcriptome analysis
  • Cancer genomics and epigenomics
  • HLA typing in cancer
  • Gene editing for cancer therapy/drug discovery
  • Alzheimer's disease research
  • Antibody engineering
  • Sample prep from FFPE tissue
  • Single-cell sequencing
  • Reproductive health technologies
  • Embgenix FAQs
  • Preimplantation genetic testing
  • ESM partnership program
  • ESM Collection Kit forms
  • Infectious diseases
  • Develop vaccines for HIV
Create a web account with us

Log in to enjoy additional benefits

Want to save this information?

An account with takarabio.com entitles you to extra features such as:

•  Creating and saving shopping carts
•  Keeping a list of your products of interest
•  Saving all of your favorite pages on the site*
•  Accessing restricted content

*Save favorites by clicking the star () in the top right corner of each page while you're logged in.

Create an account to get started

  • BioView blog
  • Automation
  • Cancer research
  • Career spotlights
  • Current events
  • Customer stories
  • Gene editing
  • Research news
  • Single-cell analysis
  • Stem cell research
  • Tips and troubleshooting
  • Women in STEM
  • That's Good Support!
  • About our blog
  • That's Good Science!
  • SMART-Seq Pro Biomarker Discovery Contest
  • DNA extraction educational activity
  • That's Good Science Podcast
  • Season one
  • Season two
  • Season three
  • Our brands
  • Our history
  • In the news
  • Events
  • Biomarker discovery events
  • Calendar
  • Conferences
  • Speak with us
  • Careers
  • Company benefits
  • Trademarks
  • License statements
  • Quality statement
  • HQ-grade reagents
  • International Contacts by Region
  • United States and Canada
  • China
  • Japan
  • Korea
  • Europe
  • India
  • Affiliates & distributors
  • Need help?
  • Privacy request
  • Website FAQs

That's GOOD Science!

What does it take to generate good science? Careful planning, dedicated researchers, and the right tools. At Takara Bio, we thoughtfully develop exceptional products to tackle your most challenging research problems, and have an expert team of technical support professionals to help you along the way, all at superior value.

Explore what makes good science possible

 Customer Login
 View Cart (0)
Takara Bio
  • Home
  • Products
  • Services & Support
  • Learning centers
  • APPLICATIONS
  • About
  • Contact Us
  •  Customer Login
  • Register
  •  View Cart (0)

Takara Bio USA, Inc. provides kits, reagents, instruments, and services that help researchers explore questions about gene discovery, regulation, and function. As a member of the Takara Bio Group, Takara Bio USA is part of a company that holds a leadership position in the global market and is committed to improving the human condition through biotechnology. Our mission is to develop high-quality innovative tools and services to accelerate discovery.

FOR RESEARCH USE ONLY. NOT FOR USE IN DIAGNOSTIC PROCEDURES (EXCEPT AS SPECIFICALLY NOTED).

Clontech, TaKaRa, cellartis

  • Products
  • COVID-19 research
  • Next-generation sequencing
  • Real-time PCR
  • Stem cell research
  • mRNA and cDNA synthesis
  • PCR
  • Cloning
  • Nucleic acid purification
  • Gene function
  • Protein research
  • Antibodies and ELISA
  • New products
  • Special offers
  • COVID-19 research
  • Viral detection with qPCR
  • SARS-CoV-2 pseudovirus
  • Human ACE2 stable cell line
  • Viral RNA isolation
  • Viral and host sequencing
  • Vaccine development
  • CRISPR screening
  • Drug discovery
  • Immune profiling
  • Publications
  • Next-generation sequencing
  • Spatial omics
  • RNA-seq
  • DNA-seq
  • Single-cell NGS automation
  • Reproductive health
  • Bioinformatics tools
  • Immune profiling
  • Real-time PCR
  • Great value master mixes
  • Signature enzymes
  • High-throughput real-time PCR solutions
  • Detection assays
  • References, standards, and buffers
  • Stem cell research
  • Media, differentiation kits, and matrices
  • Stem cells and stem cell-derived cells
  • mRNA and cDNA synthesis
  • In vitro transcription
  • cDNA synthesis kits
  • Reverse transcriptases
  • RACE kits
  • Purified cDNA & genomic DNA
  • Purified total RNA and mRNA
  • PCR
  • Most popular polymerases
  • High-yield PCR
  • High-fidelity PCR
  • GC rich PCR
  • PCR master mixes
  • Cloning
  • In-Fusion seamless cloning
  • Competent cells
  • Ligation kits
  • Restriction enzymes
  • Nucleic acid purification
  • Automated platforms
  • Plasmid purification kits
  • Genomic DNA purification kits
  • DNA cleanup kits
  • RNA purification kits
  • Gene function
  • Gene editing
  • Viral transduction
  • Fluorescent proteins
  • T-cell transduction and culture
  • Tet-inducible expression systems
  • Transfection reagents
  • Cell biology assays
  • Protein research
  • Purification products
  • Two-hybrid and one-hybrid systems
  • Mass spectrometry reagents
  • Antibodies and ELISA
  • Primary antibodies and ELISAs by research area
  • Fluorescent protein antibodies
  • Services & Support
  • OEM
  • Instrument services
  • Gene and cell therapy manufacturing
  • Customer service
  • Sales
  • Shipping & delivery
  • Technical support
  • Feedback
  • Online tools
  • Partnering & Licensing
  • Vector information
  • OEM
  • Portfolio
  • Process
  • Facilities
  • Request samples
  • FAQs
  • Instrument services
  • Apollo services
  • ICELL8 services
  • SmartChip ND system services
  • Gene and cell therapy manufacturing
  • Services
  • Facilities
  • Our process
  • Resources
  • Sales
  • Make an appointment with your sales rep
  • Online tools
  • GoStix Plus FAQs
  • Vector information
  • Vector document overview
  • Vector document finder
  • Learning centers
  • Automation systems
  • Next-generation sequencing
  • Spatial biology
  • Real-time PCR
  • Nucleic acid purification
  • mRNA and cDNA synthesis
  • PCR
  • Cloning
  • Stem cell research
  • Gene function
  • Protein research
  • Antibodies and ELISA
  • Automation systems
  • Shasta Single Cell System introduction
  • SmartChip Real-Time PCR System introduction
  • ICELL8 introduction
  • Next-generation sequencing
  • RNA-seq
  • Technical notes
  • Technology and application overviews
  • FAQs and tips
  • DNA-seq protocols
  • Bioinformatics resources
  • Webinars
  • Real-time PCR
  • Download qPCR resources
  • Overview
  • Reaction size guidelines
  • Guest webinar: extraction-free SARS-CoV-2 detection
  • Technical notes
  • Nucleic acid purification
  • Nucleic acid extraction webinars
  • Product demonstration videos
  • Product finder
  • Plasmid kit selection guide
  • RNA purification kit finder
  • mRNA and cDNA synthesis
  • mRNA synthesis
  • cDNA synthesis
  • PCR
  • Citations
  • PCR selection guide
  • Technical notes
  • FAQ
  • Cloning
  • Automated In-Fusion Cloning
  • In-Fusion Cloning general information
  • Primer design and other tools
  • In‑Fusion Cloning tips and FAQs
  • Applications and technical notes
  • Stem cell research
  • Overview
  • Protocols
  • Technical notes
  • Gene function
  • Gene editing
  • Viral transduction
  • T-cell transduction and culture
  • Inducible systems
  • Cell biology assays
  • Protein research
  • Capturem technology
  • Antibody immunoprecipitation
  • His-tag purification
  • Other tag purification
  • Expression systems
  • APPLICATIONS
  • Molecular diagnostics
  • mRNA and protein therapeutics
  • Pathogen detection
  • Immunotherapy research
  • Cancer research
  • Alzheimer's disease research
  • Reproductive health technologies
  • Infectious diseases
  • Molecular diagnostics
  • Interview: adapting to change with Takara Bio
  • Applications
  • Solutions
  • Partnering
  • Contact us
  • mRNA and protein therapeutics
  • Characterizing the viral genome and host response
  • Identifying and cloning protein targets
  • Expressing and purifying protein targets
  • Immunizing mice and optimizing vaccines
  • Pathogen detection
  • Sample prep
  • Detection methods
  • Identification and characterization
  • SARS-CoV-2
  • Antibiotic-resistant bacteria
  • Food crop pathogens
  • Waterborne disease outbreaks
  • Viral-induced cancer
  • Immunotherapy research
  • T-cell therapy
  • Antibody therapeutics
  • T-cell receptor profiling
  • TBI initiatives in cancer therapy
  • Cancer research
  • Kickstart your cancer research with long-read sequencing
  • Sample prep from FFPE tissue
  • Sample prep from plasma
  • Cancer biomarker quantification
  • Single cancer cell analysis
  • Cancer transcriptome analysis
  • Cancer genomics and epigenomics
  • HLA typing in cancer
  • Gene editing for cancer therapy/drug discovery
  • Alzheimer's disease research
  • Antibody engineering
  • Sample prep from FFPE tissue
  • Single-cell sequencing
  • Reproductive health technologies
  • Embgenix FAQs
  • Preimplantation genetic testing
  • ESM partnership program
  • ESM Collection Kit forms
  • Infectious diseases
  • Develop vaccines for HIV
  • About
  • BioView blog
  • That's Good Science!
  • Our brands
  • Our history
  • In the news
  • Events
  • Careers
  • Trademarks
  • License statements
  • Quality and compliance
  • HQ-grade reagents
  • International Contacts by Region
  • Need help?
  • Website FAQs
  • BioView blog
  • Automation
  • Cancer research
  • Career spotlights
  • Current events
  • Customer stories
  • Gene editing
  • Research news
  • Single-cell analysis
  • Stem cell research
  • Tips and troubleshooting
  • Women in STEM
  • That's Good Support!
  • About our blog
  • That's Good Science!
  • SMART-Seq Pro Biomarker Discovery Contest
  • DNA extraction educational activity
  • That's Good Science Podcast
  • Season one
  • Season two
  • Season three
  • Events
  • Biomarker discovery events
  • Calendar
  • Conferences
  • Speak with us
  • Careers
  • Company benefits
  • International Contacts by Region
  • United States and Canada
  • China
  • Japan
  • Korea
  • Europe
  • India
  • Affiliates & distributors
  • Need help?
  • Privacy request
Takara Bio
  • Products
  • Services & Support
  • Learning centers
  • APPLICATIONS
  • About
  • Contact Us