You can find a guide explaining the syntax of regular expressions on the Microsoft Developer Network (MSDN) home page (Regular Expression Language - Quick Reference page) at http://msdn.microsoft.com/en-us/library/az24scfc.aspx.

Procedure

  1. Select Administration > Maintain FASTA Parsing Rules.
  2. The FASTA Parsing Rules view opens.
  3. Select Add.
  4. Enter the new parsing rule as a regular expression in the parsing rule text area, and change the preset rule name, if needed.
  5. The title line rules and the accession rules use named capture groups to capture the accession and description of the protein.
  6. Title line rules (Title Line Rules area) must specify a named capture group, AC1, for the protein accession and a group, Desc1, for the protein description.
  7. Accession rules (Accession Rules area) require a capture group, AC, for the accession.
  8. Taxonomy rules (Taxonomy Rules area) require a matched taxonomy string.
  9. Avoid expression rules (Avoid Expression Rules area) require a matched string.
  10. To change or rename a regular expression, select the parsing rule in the list, and select Edit.
  11. You can now edit the rule in the parsing rule text area.