Data Format

FastQC is a Java-based software, usually in a linux environment line run with the command, it can quickly multithreading to assess the quality of sequencing data (at Quality Control)
zcat tmp.fq.gz | head -10000> tmp. fq: view the compressed file to take the first 10,000 lines into tmp.fq file.
cat tmp.fq | paste ---- | cut -f 2 | cut -c 1 | sort | uniq -c: View tmp.fq file, four lines become one line, whichever is longer cutting whichever of the second column a character, sort, remove duplicates and count the number of repetitions.

Use bwa comparison:
tmp.sam bwa mem prefix of the index tmp.fq>: to generate more than .sam file
bwa mem prefix of the index tmp.fq> tmp.sam: compare files generated .bwa.log

.Sam to convert files into .bam file:
samtools View -bs tmp.sam> tmp.bam

View .bam file:
samtools View -h tmp.bam (-h: View of the head)

.Bam sort of file:
samtools the Sort tmp.bam> tmp.sorted.bam

Find variation ():
samtools mpileup -ugf hg38.fa .bam | bcftools Call -vmo z -a rmdup.bcftools.vcf.gz


2018-12-26_223302.png

2018-12-26_223657.png

2018-12-27_224408.png

2018-12-27_224526.png

2018-12-27_225748.png

2018-12-27_225948.png

2018-12-27_230309.png

2018-12-27_230733.png

2018-12-27_230756.png

2018-12-27_230915.png

2018-12-27_231005.png

2018-12-27_231031.png

2018-12-27_231107.png

2018-12-27_231135.png

2018-12-27_231154.png

2018-12-27_231921.png

发布了515 篇原创文章 · 获赞 79 · 访问量 17万+

Guess you like

Origin blog.csdn.net/u010608296/article/details/103537246