基因组注释之软件使用 - 代码天地

基因组注释之软件使用

其他 2020-02-21 12:17:32 阅读次数: 0

1、RepeatMasker

1.1、输入

输入格式为fasta序列，不接受其它 GenBank, Staden,等格式。它既可以处理一个批文件(一个文件包含许多条序列)，也可以批处理许多文件(每个文件含有一条序列)。

RepeatMasker *.fasta

该命令将mask当前目录下所有的以.fasta文件结尾，并为每个文件提供单独的报告。虽然处理批文件更快，但是处理单个文件更精准。

This command will mask all files that end with .fasta in the current directory and give separate reports for each file. Note that if you have
multiple small sequences it is considerably faster to run RepeatMasker on one batch file than on many single sequence files. The summary file 
will be more informative as well. However, analysis on single files (when larger than 2 kb each) can be slightly more accurate, since GC levels
 for each sequence will be calculated and used to choose appropriate parameters.

1.2、输出

RepeatMasker返回3个文件：

.mask文件：其中包含所有已标识的重复和低复杂度序列，即mask后得基因组。

.out文件：列出被mask的序列，及其注释文件。序列按提交文件中的顺序打印，而序列在注释表中按字母顺序表示。

tbl文件是所分析序列的重复程度得摘要统计。

RepeatMasker returns a .masked file containing the query sequence(s) with all identified repeats and low complexity sequences masked. 
These masked sequences are listed and annotated in the .out file. The masked sequences are printed in the same order as they are in the
 submitted file, whereas the sequences are presented alphabetically in the annotation table. The .tbl file is a summary of the repeat 
content of the analyzed sequence.

猜你喜欢

转载自www.cnblogs.com/djx571/p/12340799.html

基因组注释之软件使用

基因组注释软件安装

基因组注释之基因功能注释

使用BRAKER2进行基因组注释

如何对基因组序列进行注释

植物基因组|注释版本问题|重测序vs泛基因组

hg19基因组功能区域注释

宏基因组测序中短序列的注释

基因组拼接

基因组处理

基因组版本

使用SPAdes测序数据拼接软件拼装基因组

NGS基础 - 参考基因组和基因注释文件

植物基因组|动物基因组|

基因组浏览器使用 (EPGG)

基因组从头组装

基因组测序模拟

人基因组（一）

参考基因组下载

BUSCO评估基因组

【转录组入门】04：参考基因组和注释文件

2018-6-25转录组学习3 参考基因组和基因注释

宏基因组序列无参考基因组装工具idba-ud的介绍及详细使用方法

基因数据处理114之BWA建立全基因组索引成功

三代转录组系列：使用Cogent重建基因组编码区

Nature Methods：基于人工重组菌群数据的宏基因组的软件评估金标准

conda 安装宏基因组软件megahit出现CondaHTTPError: HTTP 000 CONNECTION FAILED for url问题

NCBI批量下载基因组

绦虫基因组研究方法

google与基因组那点事儿

今日推荐

TIOBE 5 月榜单：Fortran “复活”进入 Top 10

GCC 14.1 发布

面壁智能发布 Eurux-8x22B 开源大模型 —— 堪称「理科状元」

开源日报 | 谷歌扶持鸿蒙上位；开源Rabbit R1；Docker加持的安卓手机；微软的焦虑和野心；海尔电器把开放平台关了

中国码农的“35岁魔咒”

蘭雅 CorelDRAW 插件 2024.5.1 国际劳动节版，免费下载

Arc Browser for Windows 1.0 正式 GA

90后程序员开发视频搬运软件、不到一年获利超 700 万，结局很刑！

周排行

Java自定义时间格式

同步整形电路

在开发中最最最常用的字符串的属性大集合

Linux 查看端口占用并杀掉

Java基础四：ArrayList

多线程之死锁就是这么简单

mysql 基础命令集

awk 命令详解

Centos6.3编译安装nginx+php步骤

OCR （Optical Character Recognition，光学字符识别）

每日归档

更多

2024-05-08(42)

2024-05-07(14)

2024-05-06(40)

2024-05-05(0)

2024-05-04(7)

2024-05-03(19)

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)