PySpect

Home

lists

Frequently asked questions

© 2025 PySpect

Package profile

docling

  • Summary: SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for powering downstream workflows such as gen AI applications.
  • Author: Christoph Auer <cau@zurich.ibm.com>, Michele Dolfi <dol@zurich.ibm.com>, Maxim Lysak <mly@zurich.ibm.com>, Nikos Livathinos <nli@zurich.ibm.com>, Ahmed Nassar <ahn@zurich.ibm.com>, Panos Vagenas <pva@zurich.ibm.com>, Peter Staar <taa@zurich.ibm.com>
  • Homepage: https://github.com/docling-project/docling
  • Source: https://github.com/docling-project/docling (Repo profile)
  • Number of releases: 124
  • First release: 0.1.0 on 2024-07-15
  • Latest release: 2.51.0 on 2025-09-05

Releases

Dates and sizes of releasesOctober2025AprilJulyRelease Date0.050.100.150.20Size in MB

PyPI Downloads

Loading PyPI statistics...

Dependencies

Docling has 37 dependencies, 10 of which optional.
Dependencies of docling (37).
DependencyOptional
acceleratefalse
beautifulsoup4false
certififalse
docling-corefalse
docling-ibm-modelsfalse
docling-parsefalse
easyocrfalse
filetypefalse
huggingface_hubfalse
lxmlfalse
markofalse
openpyxlfalse
pandasfalse
pillowfalse
pluggyfalse
polyfactoryfalse
pydanticfalse
pydantic-settingsfalse
pylatexencfalse
pypdfium2false
python-docxfalse
python-pptxfalse
requestsfalse
rtreefalse
scipyfalse
tqdmfalse
typerfalse
mlx-vlmtrue
modelscopetrue
ocrmactrue
onnxruntimetrue
openai-whispertrue
qwen-vl-utilstrue
rapidocrtrue
tesserocrtrue
transformerstrue
vllmtrue

Details