Skip to content

Instantly share code, notes, and snippets.

Analysis flow

  1. bwa mem alignment
  2. samtools convert sam to bam
  3. sambamba sort
  4. sambamba mark duplication (markdup)
  5. GATK Haplotype Caller
  6. GATK GVCFs -> final vcf file
  7. Annovar -- vcf to annovar input
  8. Annovar -- table_annovar
@Keycatowo
Keycatowo / 词性标记.md
Created October 29, 2019 13:20 — forked from luw2007/词性标记.md
词性标记: 包含 ICTPOS3.0词性标记集、ICTCLAS 汉语词性标注集、jieba 字典中出现的词性、simhash 中可以忽略的部分词性

词的分类

  • 实词:名词、动词、形容词、状态词、区别词、数词、量词、代词
  • 虚词:副词、介词、连词、助词、拟声词、叹词。

ICTPOS3.0词性标记集

n 名词

nr 人名