Skip to content

v0.6.0

Latest
Compare
Choose a tag to compare
@docling-ops docling-ops released this 02 Jul 09:01

Feature

  • Layout evaluation fixes, mode control and cleanup (#133) (629a451)
  • Introduce utility to export layout predictions from HF parquet files into pycocotools format. (#125) (54f7c81)
  • Add specific language support for XFUND dataset builder (#122) (4ca6a0e)
  • Tooling for CVAT validation, to DoclingDocument transformation, new Evaluators (#119) (2ee1104)

Fix

  • Move ibm-cos to hyperscaler (#135) (9aff6c1)
  • Update hyperscalers to support multiple image file types (#118) (a34f264)
  • Misc fixes (#131) (518e1ba)
  • CVAT to DoclingDoc: Ensure that nested list handling works across page boundaries (#129) (1b58377)
  • Important fixes for parquet serialization / deserialization, optimizations (#128) (53c22ef)
  • Fixes for the dataset visualizers (#127) (a127ea9)

Performance

  • Improve parquet writing with plain pyarrow (#134) (c08950b)