Struct noodles_vcf::AsyncReader
source · pub struct AsyncReader<R> { /* private fields */ }
Expand description
An async VCF reader.
The VCF format has two main parts: 1) a header and 2) a list of VCF records.
Each header line is prefixed with a #
(number sign) and is terminated by the header header
(#CHROM
…; inclusive).
VCF records are line-based and follow directly after the header until EOF.
Examples
use futures::TryStreamExt;
use noodles_vcf as vcf;
use tokio::{fs::File, io::BufReader};
let mut reader = File::open("sample.vcf")
.await
.map(BufReader::new)
.map(vcf::AsyncReader::new)?;
let header = reader.read_header().await?;
let mut records = reader.records(&header);
while let Some(record) = records.try_next().await? {
// ...
}
Implementations§
source§impl<R> Reader<R>where
R: AsyncBufRead + Unpin,
impl<R> Reader<R>where R: AsyncBufRead + Unpin,
sourcepub fn new(inner: R) -> Self
pub fn new(inner: R) -> Self
Creates an async VCF reader.
Examples
use noodles_vcf as vcf;
let data = [];
let reader = vcf::AsyncReader::new(&data[..]);
sourcepub async fn read_header(&mut self) -> Result<Header>
pub async fn read_header(&mut self) -> Result<Header>
Reads the VCF header.
This reads all header lines prefixed with a #
(number sign), which includes the header
header (#CHROM
…), and parses it as a crate::Header
.
The position of the stream is expected to be at the start.
Examples
use noodles_vcf as vcf;
let data = b"##fileformat=VCFv4.3
#CHROM\tPOS\tID\tREF\tALT\tQUAL\tFILTER\tINFO
sq0\t1\t.\tA\t.\t.\tPASS\t.
";
let mut reader = vcf::AsyncReader::new(&data[..]);
let header = reader.read_header().await?;
sourcepub async fn read_record(
&mut self,
header: &Header,
record: &mut Record
) -> Result<usize>
pub async fn read_record( &mut self, header: &Header, record: &mut Record ) -> Result<usize>
Reads a single VCF record.
This reads a line from the underlying stream until a newline is reached and parses that line into the given record.
The stream is expected to be directly after the header or at the start of another record.
It is more ergonomic to read records using a stream (see Self::records
and
Self::query
), but using this method allows control of the record buffer.
If successful, the number of bytes read is returned. If the number of bytes read is 0, the stream reached EOF.
Examples
use noodles_vcf as vcf;
let data = b"##fileformat=VCFv4.3
#CHROM\tPOS\tID\tREF\tALT\tQUAL\tFILTER\tINFO
sq0\t1\t.\tA\t.\t.\tPASS\t.
";
let mut reader = vcf::AsyncReader::new(&data[..]);
let header = reader.read_header().await?;
let mut record = vcf::Record::default();
reader.read_record(&header, &mut record).await?;
sourcepub async fn read_lazy_record(&mut self, record: &mut Record) -> Result<usize>
pub async fn read_lazy_record(&mut self, record: &mut Record) -> Result<usize>
Reads a single record without eagerly parsing its fields.
The reads VCF record fields from the underlying stream into the given record’s buffer until a newline is reached. No fields are parsed, meaning the record is not necessarily valid. However, the structure of the line is guaranteed to be record-like.
The stream is expected to be directly after the header or at the start of another record.
If successful, the number of bytes read is returned. If the number of bytes read is 0, the stream reached EOF.
Examples
use noodles_vcf as vcf;
let data = b"##fileformat=VCFv4.3
#CHROM\tPOS\tID\tREF\tALT\tQUAL\tFILTER\tINFO
sq0\t1\t.\tA\t.\t.\tPASS\t.
";
let mut reader = vcf::AsyncReader::new(&data[..]);
reader.read_header().await?;
let mut record = vcf::lazy::Record::default();
reader.read_lazy_record(&mut record).await?;
sourcepub fn records<'r, 'h: 'r>(
&'r mut self,
header: &'h Header
) -> impl Stream<Item = Result<Record>> + 'r
pub fn records<'r, 'h: 'r>( &'r mut self, header: &'h Header ) -> impl Stream<Item = Result<Record>> + 'r
Returns an (async) stream over records starting from the current (input) stream position.
The (input) stream is expected to be directly after the header or at the start of another record.
Unlike Self::read_record
, each record is parsed as a Record
.
Examples
use futures::TryStreamExt;
use noodles_vcf as vcf;
let data = b"##fileformat=VCFv4.3
#CHROM\tPOS\tID\tREF\tALT\tQUAL\tFILTER\tINFO
sq0\t1\t.\tA\t.\t.\tPASS\t.
";
let mut reader = vcf::AsyncReader::new(&data[..]);
let header = reader.read_header().await?;
let mut records = reader.records(&header);
while let Some(record) = records.try_next().await? {
// ...
}
source§impl<R> Reader<AsyncReader<R>>where
R: AsyncRead,
impl<R> Reader<AsyncReader<R>>where R: AsyncRead,
sourcepub fn virtual_position(&self) -> VirtualPosition
pub fn virtual_position(&self) -> VirtualPosition
Returns the current virtual position of the underlying BGZF reader.
Examples
use noodles_bgzf as bgzf;
use noodles_vcf as vcf;
let data = [];
let reader = vcf::AsyncReader::new(bgzf::AsyncReader::new(&data[..]));
assert_eq!(reader.virtual_position(), bgzf::VirtualPosition::default());
source§impl<R> Reader<AsyncReader<R>>where
R: AsyncRead + AsyncSeek + Unpin,
impl<R> Reader<AsyncReader<R>>where R: AsyncRead + AsyncSeek + Unpin,
sourcepub async fn seek(&mut self, pos: VirtualPosition) -> Result<VirtualPosition>
pub async fn seek(&mut self, pos: VirtualPosition) -> Result<VirtualPosition>
Seeks the underlying BGZF stream to the given virtual position.
Virtual positions typically come from an associated index.
Examples
use noodles_bgzf as bgzf;
use noodles_vcf as vcf;
let data = Cursor::new([]);
let mut reader = vcf::AsyncReader::new(bgzf::AsyncReader::new(data));
let virtual_position = bgzf::VirtualPosition::default();
reader.seek(virtual_position).await?;
sourcepub fn query<'r>(
&'r mut self,
header: &'r Header,
index: &Index,
region: &Region
) -> Result<impl Stream<Item = Result<Record>> + 'r>
pub fn query<'r>( &'r mut self, header: &'r Header, index: &Index, region: &Region ) -> Result<impl Stream<Item = Result<Record>> + 'r>
Returns a stream over records that intersects the given region.
The position of the (input) stream is expected to after the header or at the start of another record.
Examples
use futures::TryStreamExt;
use noodles_bgzf as bgzf;
use noodles_core::Region;
use noodles_tabix as tabix;
use noodles_vcf as vcf;
use tokio::fs::File;
let mut reader = File::open("sample.vcf.gz")
.await
.map(bgzf::AsyncReader::new)
.map(vcf::AsyncReader::new)?;
let header = reader.read_header().await?;
let index = tabix::read("sample.vcf.gz.tbi")?;
let region = "sq0:8-13".parse()?;
let mut query = reader.query(&header, &index, ®ion)?;
while let Some(record) = query.try_next().await? {
// ...
}