Allele-specific expressed gene
Allele-Specific Expression Quantification
To quantify allele-specific gene expression based on the LongcallR-phased BAM, run the following command to get output.ase.tsv:
python longcallR-ase.py
-b <phased_bam>
-a <annotation>
-o <output_prefix>
-t <threads>
--gene_types <gene_types>
--min_support <min_support>
- -b, --bam: longcallR phased bam file
- -a, --annotation: Annotation file GTF for GFF3 format
- -o, --output: Prefix of output files
- -t, --threads: Number of threads
- --gene_types: Gene types to be analyzed. Default: ["protein_coding", "lncRNA"]
- --min_support: Minimum support reads for counting event. Default: 10
High-Confidence ASE Using Known Genomic SNPs
To increase confidence by requiring allele-specific expression to be supported by both LongcallR SNPs and known genomic SNPs, use the following command to get output.filter_ase.tsv:
python longcallR-ase.py
-b <phased_bam>
--vcf1 <longcallR_phased_vcf>
--vcf3 <genomic_vcf>
-a <annotation>
-o <output_prefix>
-t <threads>
--gene_types <gene_types>
--min_support <min_support>
- --vcf1: LongcallR phased vcf file
- --vcf3: DNA vcf file
Using Phased Genomic VCF for Parental Allele Resolution
To assign allele-specific expression to paternal and maternal haplotypes, provide both the LongcallR-phased VCF and a whole-genome haplotype-phased DNA VCF, get output.patmat_ase.tsv:
python longcallR-ase.py
-b <phased_bam>
--vcf1 <longcallR_phased_vcf>
--vcf2 <phased_genomic_vcf>
-a <annotation>
-o <output_prefix>
-t <threads>
--gene_types <gene_types>
--min_support <min_support>
- --vcf1: LongcallR phased vcf file
- --vcf2: Whole genome haplotype phased DNA vcf file
Output format
1.output.ase.tsv or output.filter_ase.tsv
| Column | Description |
|---|---|
#Gene_name |
Gene symbol |
Chr |
Chromosome |
PS |
Phase set ID |
H1 |
Read count assigned to haplotype 1 |
H2 |
Read count assigned to haplotype 2 |
P_value |
Binomial test p-value for allelic imbalance |
2.output.patmat_ase.tsv
| Column | Description |
|---|---|
#Gene_name |
Gene symbol |
Chr |
Chromosome |
PS |
Phase set ID |
H1 |
Read count assigned to haplotype 1 |
H2 |
Read count assigned to haplotype 2 |
P_value |
Binomial test p-value for allelic imbalance |
H1_Paternal |
Read count in H1 supporting the paternal allele |
H1_Maternal |
Read count in H1 supporting the maternal allele |
H2_Paternal |
Read count in H2 supporting the paternal allele |
H2_Maternal |
Read count in H2 supporting the maternal allele |