![]() |
OpenMS
|
Digests a protein database in-silico.
| pot. predecessor tools | → Digestor → | pot. successor tools |
|---|---|---|
| none (FASTA input) | IDFilter (peptide blacklist) |
This application is used to digest a protein database to get all peptides given a cleavage enzyme.
The output can be used e.g. as a blacklist filter input to IDFilter, to remove certain peptides.
The command line parameters of this tool are:
Digestor -- Digests a protein database in-silico.
Full documentation: http://www.openms.de/doxygen/release/3.4.1/html/TOPP_Digestor.html
Version: 3.4.1 May 19 2025, 14:24:34, Revision: 8aec8ec
To cite OpenMS:
+ Pfeuffer, J., Bielow, C., Wein, S. et al.. OpenMS 3 enables reproducible analysis of large-scale mass spec
trometry data. Nat Methods (2024). doi:10.1038/s41592-024-02197-7.
Usage:
Digestor <options>
Options (mandatory options marked with '*'):
-in <file>* Input file (valid formats: 'fasta')
-out <file>* Output file (peptides) (valid formats: 'idXML', 'fasta')
-out_type <type> Set this if you cannot control the filename of 'out', e.g., in TOPPAS. (valid:
'idXML', 'fasta')
-missed_cleavages <number> The number of allowed missed cleavages (default: '1') (min: '0')
-min_length <number> Minimum length of peptide (default: '6')
-max_length <number> Maximum length of peptide (default: '40')
-enzyme <string> The type of digestion enzyme (default: 'Trypsin') (valid: 'glutamyl endopeptid
ase', 'Alpha-lytic protease', 'no cleavage', 'unspecific cleavage', 'Trypsin',
'Arg-C', 'Arg-C/P', 'Asp-N', 'Asp-N/B', 'Glu-C+P', 'PepsinA + P', 'cyanogen-b
romide', 'Clostripain/P', 'elastase-trypsin-chymotrypsin', 'Asp-N_ambic', 'Chy
motrypsin', 'Chymotrypsin/P', 'CNBr', 'Formic_acid', 'Lys-C', 'Lys-N', 'Lys-C/
P', 'PepsinA', 'TrypChymo', 'Trypsin/P', 'V8-DE', 'V8-E', 'leukocyte elastase'
, 'proline endopeptidase', '2-iodobenzoate', 'iodosobenzoate', 'staphylococcal
protease/D', 'proline-endopeptidase/HKR')
Options for FASTA output files:
-FASTA:ID <option> Identifier to use for each peptide: copy from parent protein (parent); a conse
cutive number (number); parent ID + consecutive number (both) (default: 'paren
t') (valid: 'parent', 'number', 'both')
-FASTA:description <option> Keep or remove the (possibly lengthy) FASTA header description. Keeping it
can increase resulting FASTA file significantly. (default: 'remove') (valid:
'remove', 'keep')
Common TOPP options:
-ini <file> Use the given TOPP INI file
-threads <n> Sets the number of threads allowed to be used by the TOPP tool (default: '1')
-write_ini <file> Writes the default configuration file
--help Shows options
--helphelp Shows all options (including advanced)
INI file documentation of this tool: