Skip to content

john-root/natbakke

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

To Do:

Better handling of segmented text.
Better OCR handling.
More accurate registration of hOCR and OCR'd text with annos.
Additional sources of linked open data.
Validate that custom matchers are working.
Box joining.



Sample OCR text:



Manual add of individual example e.g. Toadlena as an entity works, but the bulk add does not seem to be working.

Source of data for enrichment:

names.mongabay.com/data/indians.html

About

IIIF and Entity extraction toolset

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • HTML 53.8%
  • Python 46.2%