FASTQ files explained

浏览所有产品

查看所有学习选项

研究

临床

药物

生物制药与制药应用

同行评审文献

查看所有公司信息

查看所有支持

所有信息学产品

客户访谈

特色新闻

TruSight Oncology 500可支持液体活检研究

更多新闻

客户访谈

特色新闻

客户访谈

特色新闻

客户访谈

特色新闻

回顾过去

TBD

仪器服务和咨询

更多工具

客户访谈

特色新闻

Illumina TruSight检测获突破性器械认定

更多新闻

非侵入性产前检查

医学遗传学教育

所有生殖健康内容

所有生殖健康产品

更多工具

客户访谈

特色新闻

教育是非侵入性产前检查的关键

所有遗传健康产品

客户访谈

特色新闻

造福罕见病和未确诊遗传病患者的研究进展

更多新闻

探索解决方案

FASTQ files explained

10/26/21

Illumina sequencing technology uses cluster generation and sequencing by synthesis (SBS) chemistry to sequence millions or billions of clusters on a flow cell, depending on the sequencing platform. During SBS chemistry, for each cluster, base calls are made and stored for every cycle of sequencing by the Real-Time Analysis (RTA) software on the instrument. RTA stores the base call data in the form of individual base call (or BCL) files. When sequencing completes, the base calls in the BCL files must be converted into sequence data. This process is called BCL to FASTQ conversion.

A FASTQ file is a text file that contains the sequence data from the clusters that pass filter on a flow cell (for more information on clusters passing filter, see the “additional information” section of this bulletin). If samples were multiplexed, the first step in FASTQ file generation is demultiplexing. Demultiplexing assigns clusters to a sample, based on the cluster’s index sequence(s). After demultiplexing, the assembled sequences are written to FASTQ files per sample. If samples were not multiplexed, the demultiplexing step does not occur, and, for each flow cell lane, all clusters are assigned to a single sample.

For a single-read run, one Read 1 (R1) FASTQ file is created for each sample per flow cell lane. For a paired-end run, one R1 and one Read 2 (R2) FASTQ file is created for each sample for each lane. FASTQ files are compressed and created with the extension *.fastq.gz.

What does a FASTQ file look like?

For each cluster that passes filter, a single sequence is written to the corresponding sample’s R1 FASTQ file, and, for a paired-end run, a single sequence is also written to the sample’s R2 FASTQ file. Each entry in a FASTQ files consists of 4 lines:

A sequence identifier with information about the sequencing run and the cluster. The exact contents of this line vary by based on the BCL to FASTQ conversion software used.
The sequence (the base calls; A, C, T, G and N).
A separator, which is simply a plus (+) sign.
The base call quality scores. These are Phred +33 encoded, using ASCII characters to represent the numerical quality scores.

Here is an example of a single entry in a R1 FASTQ file:

More detailed information on the FASTQ sequence file format can be found here.

How to view a FASTQ file

FASTQ files can contain up to millions of entries and can be several megabytes or gigabytes in size, which often makes them too large to open in a normal text editor. Generally, it is not necessary to view FASTQ files, because they are intermediate output files used as input for tools that perform downstream analysis, such as alignment to a reference or de novo assembly.

If you need to view a FASTQ file for troubleshooting purposes or out of curiosity, you will need either a text editor that can handle very large files, or access to a Unix or Linux system where large files can be viewed via the command line.

How to generate FASTQ files

FASTQ file generation is the first step for all analysis workflows used by MiSeq Reporter on the MiSeq and Local Run Manager on the MiniSeq. When analysis completes, the FASTQ files are located in <run folder>\Data\Intensities\BaseCalls on the MiSeq and <output folder>\Alignment_#\<subfolder>\Fastq on the MiniSeq.

For all runs uploaded to BaseSpace Sequence Hub, FASTQ file generation automatically occurs after the run is completely uploaded, and the FASTQ files are used as input for the various analysis apps on BaseSpace Sequence Hub. On BaseSpace Sequence Hub, you can find your FASTQ files in the project(s) associated with your run.

The bcl2fastq conversion software can be used to generate FASTQ files from data generated on all current Illumina sequencing systems.

For information on the different settings that can be applied during FASTQ file generation, see the software user guides below.

Additional information

A description and requirements for clusters to pass filter can be found in section 1.5.8 of the MiSeq: Imaging and Base Calling online training course.
See 2-Channel SBS Technology for more information about base calling on NovaSeq, NextSeq 500/550, and MiniSeq systems.
See Illumina Sequencing Technology for more information about base calling on MiSeq and HiSeq systems.

FASTQ files explained

Contact Us

Technical Support

Share With Tech Support

Other Support

Technical Support

techsupport@illumina.com

任何地方，任何实验室

Illumina Single Cell 3' RNA Prep

NGS 流程助手——现已支持肿瘤工作流

Illumina Connected Multiomics

NovaSeq X梦无垠，创无限。

比以往处理的更多、更快

借助人工智能推动基因组学研究发展

借助人工智能推动基因组学研究发展

借助人工智能推动基因组学研究发展

借助人工智能推动基因组学研究发展

借助人工智能推动基因组学研究发展

借助人工智能推动基因组学研究发展

借助人工智能推动基因组学研究发展

Illumina与SomaLogic 携手合作

Illumina与SomaLogic 携手合作

Illumina与SomaLogic 携手合作

Illumina与SomaLogic 携手合作

Illumina与SomaLogic 携手合作

Illumina与SomaLogic 携手合作

Illumina与SomaLogic 携手合作

MiSeq i100 系列

MiSeq i100 系列

MiSeq i100 系列

MiSeq i100 系列

MiSeq i100 系列

MiSeq i100 系列

深入探索癌症，助力精准测试

深入探索癌症，助力精准测试

深入探索癌症，助力精准测试

深入探索癌症，助力精准测试

深入探索癌症，助力精准测试

Illumina COVIDSeq Test

Illumina COVIDSeq Test

Illumina COVIDSeq Test

Illumina COVIDSeq Test

基因组与甲基化组一次检测同时获得

基因组与甲基化组一次检测同时获得

基因组与甲基化组一次检测同时获得

基因组与甲基化组一次检测同时获得

基因组与甲基化组一次检测同时获得

分秒必争。无PCR的新制备方法可加快全基因组测序

分秒必争。无PCR的新制备方法可加快全基因组测序

分秒必争。无PCR的新制备方法可加快全基因组测序

分秒必争。无PCR的新制备方法可加快全基因组测序

Illumina COVIDSeq Test

Illumina COVIDSeq Test

Illumina COVIDSeq Test

Illumina COVIDSeq Test

Illumina COVIDSeq Test

基因组与甲基化组一次检测同时获得

基因组与甲基化组一次检测同时获得

基因组与甲基化组一次检测同时获得

基因组与甲基化组一次检测同时获得

基因组与甲基化组一次检测同时获得

Hear about VeriSeq NIPT from Our Customers

Hear about VeriSeq NIPT from Our Customers

Hear about VeriSeq NIPT from Our Customers

Hear about VeriSeq NIPT from Our Customers

时间就是生命—全新PCR-Free Prep建库试剂加速全基因组测序

时间就是生命—全新PCR-Free Prep建库试剂加速全基因组测序

时间就是生命—全新PCR-Free Prep建库试剂加速全基因组测序

时间就是生命—全新PCR-Free Prep建库试剂加速全基因组测序

因美纳实验流程解决方案

FASTQ files explained

Contact Us

Technical Support

Share With Tech Support

Other Support

Contact Us

Technical Support

techsupport@illumina.com

Other Support