Cocoa Compact cover annotator for biological noun phrases
Cocoa is a dense annotator for biological text. It annotates macromolecules, chemicals, protein/DNA parts, complexes, organisms, processes, anatomical parts, locations, physiological terms, parameters, values, experimental techniques, surgical procedures, and foods. The dense annotation cover should suffice to jumpstart a semantic frame -based relation annotator.
Cocoa is also accessible through a
WebAPI, which returns markup in JSON and
A1 (
GENIA/
Brat) formats for integration into text processing pipelines such as
GATE. Cocoa is directly accessible for automatic annotation from within the Brat annotator (instructions
here). Cocoa can also be used for named entity detection in the
Turku event extraction pipeline (instructions
here).