IUBio GIL .. BIOSCI/Bionet News .. Biosequences .. Software .. FTP

EMBL database: parsing problem.

Ramu Chenna chenna
Wed Jul 9 09:31:13 EST 1997


Hello all

There is a parsing problem with embl db. It does not parse more than one
line in the 'OC OS OG' fields.

To correct that insert the following productions for 
the token table  'org' and rebuild the index.


  org:       ~ {$In:[fields c:org] $Out} 
               (oc | os | og)* ~
  oc:        ~ ('OC' (/[^;.\n]+/ {$Wrt} /[^\n]/ )+) ln ~
  os:        ~  'OS' (/[^(\n]+/ {$Wrt:[s:$Trim:$Ct]})+ ~
  og:        ~  'OG' (/[^;.\n]+/ {$Wrt})+ ~



# old code...
#               ('OC' (/[^;.\n]+/ {$Wrt} | /[^\n]/)*)+ |
#                'OS' /[^(\n]+/ {$Wrt:[s:$Trim:$Ct]} 
#                ('(' (/[^ \n)]+/ | /[^)]/)+)? |
#                'OG'  /[^\n;.]+/ {$Wrt} ~

Ramu


-- 
________________________________________________________________________________
Chenna Ramu; EMBL; Postfach 10.2209; 69012 Heidelberg; Germany. _/_/_/  _/_/_/
Email: chenna at embl-heidelberg.de                     	        _/      _/   _/
   Url: http://www.embl-heidelberg.de/~chenna/        	        _/    . _/_/_/  
   Tel: (49) 6221 387530 (Off) ; Fax: (49) 6221 387517	        _/      _/   _/  
								_/_/_/  _/    _/





More information about the Bio-srs mailing list

Send comments to us at archive@iubioarchive.bio.net