Add new attachment

Only authorized users are allowed to upload new attachments.

List of attachments

Kind Attachment Name Size Version Date Modified Author Change note
zip
vPECA.zip 85,728.8 kB 1 06-Jun-2019 09:15 YongWang

This page (revision-4) was last changed on 06-Jun-2019 09:20 by YongWang

This page was created on 06-Jun-2019 09:09 by YongWang

Only authorized users are allowed to rename pages.

Only authorized users are allowed to delete pages.

Difference between version and

At line 13 changed one line
We develop a variants interpretation method by Paired Expression and Chromatin Accessibility data to model genome-wide chromatin accessibility profiles for high-altitude hypoxia adaptation in HUVEC, to reveal causal SNPs, active and active selected regulatory elements for a certain gene.
We develop a new method called vPECA (Variants interpretation model by Paired Expression and Chromatin Accessibility data) to model genome-wide chromatin accessibility profiles for high-altitude hypoxia adaptation in HUVEC, to reveal causal SNPs, active and active selected regulatory elements for a certain gene.
vPECA can integrate our measured paired expression and chromatin accessibility data with the available public data, including population genetics data, functional genomics data in ENCODE, and Hi-C data for HUVEC. Our previous work PECA integrates paired expression and chromatin accessibility data across diverse cellular contexts and model the localization to REs of chromatin regulators (CR), the activation of REs due to CRs that are localized to them, and the effect of TFs bound to activated REs on the transcription of target genes (TG) 18. Our innovation here is to extend PECA to interpret genetic variants from population genetics and matched WGS data. vPECA models how positively selected noncoding SNPs affects the RE’s selection status, chromatin accessibility, and activity and further determine the target gene expression. The statistical modelling allows us to systematically identify active REs, active selected REs, and gene regulatory network to interpret variants.
At line 15 removed one line
!!!Procedure
At line 17 changed one line
!!Useful Resource:
!!!Processing data
vPECA model requires input as sample matched time-series RNA-seq, and ATAC-seq, and individual matched DNA-seq data together with selection scores calculated for each SNP from public data. For RNA-seq and ATAC-seq data, first we processed raw reads into an expression matrix with row genes and column samples. And chromatin accessibility data as a matrix with element by sample dimensions. The candidate RE and TG pairs based on distance are collected into a Element_gene in Data_prior.mat. Then the SNPs locate on REs and their corresponding selection scores are in a text file named element_SNP_use.txt. The prior (TF-TG, TG-RE) learned from public data could be set to certain number if it is not available. TF binding strength are calculated from motif scan algorithm.
At line 19 changed one line
Download 25 bio-sample matched RNA-seq and DNase-seq data from mouse ENCODE project.
!!!Running vPECA
All the main programs are in main_PECA.m file. Please run the script and get the result from the folder called Output. All the selection status of each RE, and TF-RE-TG triplets are listed
At line 21 removed one line
Download regulatoryElement-targetGene association.
At line 23 changed one line
Download PECA predicted whole network and context specific network for 25 tissues or cell type.
!!!Requirements
MATLAB 2018a
At line 27 added 2 lines
!!!Time
It usually takes several seconds on each gene. In total, about 24 hours are required to processing all the genes, and write all text output files.
At line 27 removed one line
!!Using PECA Tools:
At line 29 removed 22 lines
PECA-TNet: Matlab tool for context specific network
Input: paired expression of genes (Exp) and chromatin accessibility of regulatory elements (Opn) on your interested cellular context.
Output: context specific network (TissueSpecificNet).
Usage: TissueSpecificNet=PECATool1(Exp,Opn,GeneName,ElementName);
Where GeneName and ElementName are name of gene and regulatory elements.
PECA-Anno: Matlab tool for genomic region annotation by network
Input: Bed file of the genomic region you interested (region.bed’).
Output: annotation network of your region (RegionNet).
Usage: RegionNet=PECATool2(‘region.bed’);
At line 55 changed one line
* [PECA|http://web.stanford.edu/~zduren/PECA/]
* [vPECA.zip|vPECA.zip]
At line 59 changed one line
Category: [Software]
Category: [Supplementary] [Software]
Version Date Modified Size Author Changes ... Change note
4 06-Jun-2019 09:20 3.191 kB YongWang to previous
3 06-Jun-2019 09:18 3.2 kB YongWang to previous | to last
2 06-Jun-2019 09:13 2.941 kB YongWang to previous | to last
1 06-Jun-2019 09:09 1.931 kB YongWang to last
« This page (revision-4) was last changed on 06-Jun-2019 09:20 by YongWang