Trait BamRecordExtensions

Source

pub trait BamRecordExtensions {
    // Required methods
    fn aligned_blocks(&self) -> IterAlignedBlocks ⓘ;
    fn aligned_block_pairs(&self) -> IterAlignedBlockPairs ⓘ;
    fn introns(&self) -> IterIntrons ⓘ;
    fn aligned_pairs(&self) -> IterAlignedPairs ⓘ;
    fn aligned_pairs_full(&self) -> IterAlignedPairsFull ⓘ;
    fn cigar_stats_nucleotides(&self) -> HashMap<Cigar, i32>;
    fn cigar_stats_blocks(&self) -> HashMap<Cigar, i32>;
    fn reference_positions(&self) -> Box<dyn Iterator<Item = i64>>;
    fn reference_positions_full(&self) -> Box<dyn Iterator<Item = Option<i64>>>;
    fn reference_start(&self) -> i64;
    fn reference_end(&self) -> i64;
    fn seq_len_from_cigar(&self, include_hard_clip: bool) -> usize;
}

Expand description

Extra functionality for BAM records

Inspired by pysam

Required Methods§

Source

fn aligned_blocks(&self) -> IterAlignedBlocks ⓘ

iterator over start and end positions of aligned gapless blocks

The start and end positions are in genomic coordinates. There is not necessarily a gap between blocks on the genome, this happens on insertions.

pysam: blocks See also: aligned_block_pairs if you need the read coordinates as well.

Source

fn aligned_block_pairs(&self) -> IterAlignedBlockPairs ⓘ

Iter over <([read_start, read_stop], [genome_start, genome_stop]) blocks of continously aligned reads.

In contrast to aligned_blocks, this returns read and genome coordinates. In contrast to aligned_pairs, this returns just the start-stop coordinates of each block.

There is not necessarily a gap between blocks in either coordinate space (this happens in in-dels).

Source

fn introns(&self) -> IterIntrons ⓘ

This scans the CIGAR for reference skips and reports their positions. It does not inspect the reported regions for actual splice sites. pysam: get_introns

Source

fn aligned_pairs(&self) -> IterAlignedPairs ⓘ

iter aligned read and reference positions on a basepair level

No entry for insertions, deletions or skipped pairs

pysam: get_aligned_pairs(matches_only = True)

See also aligned_block_pairs if you just need start&end coordinates of each block. That way you can allocate less memory for the same informational content.

Source