PySpect

Home

lists

Frequently asked questions

© 2025 PySpect

Repository info

trafilatura

  • Summary: a summary of trafilatura
  • Description: Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
  • Stars: 4489
  • Number of forks: 301