FastProtein – An automated software for in silico proteomic analysis
biorxiv(2023)
摘要
Background Although various tools provide proteomic information, each has its limitations regarding execution platforms, libraries, versions, and data output format. Therefore, integrating data analyses generated using different software programs is a manual process that can prolong the analysis time.
Results This paper presents FastProtein, a protein analysis pipeline tool developed in Java. This tool is user-friendly, easily installable, and provides important information regarding the subcellular location, transmembrane domains, signal peptide, molecular weight, isoelectric point, hydropathy, aromaticity, gene ontology, endoplasmic reticulum retention domains, and N- glycosylation domains of a protein. Furthermore, it helps determine the presence of glycosylphosphatidylinositol and obtain annotation information using InterProScan, PANTHER, PFam, and alignment-based annotation searches. Additionally, the software outputs a protein dataset with evidence of membrane localization.
Conclusions The proposed tool provides the scientific community with an easy and user-friendly computational tool for proteomics data analysis. The tool is applicable to both small datasets and proteome-wide studies. It can be used in either the command line interface mode or through a web interface installed on a local server or via the BioLib web interface (). FastProtein also accelerates proteomics analysis routines by generating multiple results in a one-step run. The software is open-source and freely available. Installation and execution instructions, as well as the source code and test files generated for tool validation, are provided at .
### Competing Interest Statement
The authors have declared no competing interest.
* CSV
: Comma-separated values
DPI
: Dots per inch
GB
: Gigabytes
GO
: Gene ontology
GPI
: Glycosylphosphatidylinositol
GUI
: Graphical user interface
IPR
: InterProScan
MB
: Megabytes
ODF
: Open document format
SEP
: Custom format where each line represents a protein information
SL
: Subcellular localization
TM
: Transmembrane domain
TSV
: Tab-separated values
TXT
: text-formatted
UUID
: Universally unique identifier
XLS
: Microsoft Excel
更多查看译文
AI 理解论文
溯源树
样例
![](https://originalfileserver.aminer.cn/sys/aminer/pubs/mrt_preview.jpeg)
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要