Cocoa Compact cover annotator for biological noun phrases
Cocoa is a dense annotator for biological text. It annotates macromolecules, chemicals, protein/DNA parts, complexes, organisms, processes, anatomical parts, locations, physiological terms, parameters, values, experimental techniques, surgical procedures, and foods. The dense annotation cover should suffice to jumpstart a semantic frame -based relation annotator.
Cocoa is also accessible through a WebAPI
, which returns markup in JSON and A1
) formats for integration into text processing pipelines such as GATE
. Cocoa is directly accessible for automatic annotation from within the Brat annotator (instructions here
). Cocoa can also be used for named entity detection in the Turku event extraction pipeline