PharmGKB is hunt a Drink developer to succeed on a born language processing (NLP) explore slave for accumulation extraction from matter.
This Drink developer orientation requires experience working with Lucene and with deep datasets. The developer will be extending software and collection created by NLP researchers in the Altman lab, improving its efficiency, and desegregation it into the stream PharmGKB codebase to snap it useful exercise for curators and researchers. The developer faculty also be excavation nearly with researchers, curators and and different developers to anatomy a agency that enhances the features and pandemic quality of the latest schoolbook comment comment.
The send involves expansion all search book articles indexed by Medline (over 100 million sentences), extracting relationships between entities of interest at the time story via algorithms mature in the explore lab, and ultimately desegregation the results of this text mining plan into our curators' workflow.
This is an hourly bid job for at minimal 30 hours/week that faculty end by Sep, 2011. The developer is potential to convert on-site in Palo High, Calif..
Skills
* Potable
* Lucene
* RDBMS utilization (e.g. Oracle)
* Change with unaffected module processing/text excavation (a plus, not required)
* Biology/Chemistry knowledge (a quality, not required)
Location : California