You can find a guide explaining the syntax of regular expressions on the Microsoft Developer Network (MSDN) home page (Regular Expression Language - Quick Reference page) at http://msdn.microsoft.com/en-us/library/az24scfc.aspx.
Procedure
- Select Administration > Maintain FASTA Parsing Rules.
- The FASTA Parsing Rules view opens.
- Select Add.
- Enter the new parsing rule as a regular expression in the parsing rule text area, and change the preset rule name, if needed.
- The title line rules and the accession rules use named capture groups to capture the accession and description of the protein.
- Title line rules (Title Line Rules area) must specify a named capture group, AC1, for the protein accession and a group, Desc1, for the protein description.
- Accession rules (Accession Rules area) require a capture group, AC, for the accession.
- Taxonomy rules (Taxonomy Rules area) require a matched taxonomy string.
- Avoid expression rules (Avoid Expression Rules area) require a matched string.
- To change or rename a regular expression, select the parsing rule in the list, and select Edit.
- You can now edit the rule in the parsing rule text area.