Skip to content
@scrapinghub

Scrapinghub

Turn web content into useful data

Pinned Loading

  1. splash splash Public

    Lightweight, scriptable browser as a service with an HTTP API

    Python 4.2k 516

  2. dateparser dateparser Public

    python parser for human readable dates

    Python 2.7k 484

  3. python-scrapinghub python-scrapinghub Public

    A client interface for Scrapinghub's API

    Python 205 61

  4. extruct extruct Public

    Extract embedded metadata from HTML markup

    Python 935 119

  5. spidermon spidermon Public

    Scrapy Extension for monitoring spiders execution.

    Python 547 100

  6. python-crfsuite python-crfsuite Public

    A python binding for crfsuite

    Python 771 222

Repositories

Showing 10 of 183 repositories
  • shub Public

    Scrapinghub Command Line Client

    scrapinghub/shub’s past year of commit activity
    Python 130 BSD-3-Clause 81 48 (7 issues need help) 16 Updated Nov 3, 2025
  • scrapinghub-entrypoint-scrapy Public

    Scrapy entrypoint for Scrapinghub job runner

    scrapinghub/scrapinghub-entrypoint-scrapy’s past year of commit activity
    Python 26 BSD-3-Clause 16 6 1 Updated Nov 3, 2025
  • hcf-backend Public

    Crawl Frontier HCF backend

    scrapinghub/hcf-backend’s past year of commit activity
    Python 8 BSD-3-Clause 6 2 1 Updated Oct 31, 2025
  • shub-workflow Public
    scrapinghub/shub-workflow’s past year of commit activity
    Python 15 BSD-3-Clause 14 2 1 Updated Oct 30, 2025
  • web-poet Public

    Web scraping Page Objects core library

    scrapinghub/web-poet’s past year of commit activity
    Python 101 BSD-3-Clause 18 16 (1 issue needs help) 13 Updated Oct 28, 2025
  • dateparser Public

    python parser for human readable dates

    scrapinghub/dateparser’s past year of commit activity
    Python 2,739 BSD-3-Clause 484 298 (5 issues need help) 54 Updated Oct 28, 2025
  • docker-images Public
    scrapinghub/docker-images’s past year of commit activity
    Dockerfile 33 8 0 5 Updated Oct 20, 2025
  • scrapy-poet Public

    Page Object pattern for Scrapy

    scrapinghub/scrapy-poet’s past year of commit activity
    Python 123 BSD-3-Clause 28 13 (1 issue needs help) 5 Updated Oct 17, 2025
  • price-parser Public

    Extract price amount and currency symbol from a raw text string

    scrapinghub/price-parser’s past year of commit activity
    Python 342 BSD-3-Clause 51 17 (4 issues need help) 9 Updated Oct 6, 2025
  • andi Public

    Library for annotation-based dependency injection

    scrapinghub/andi’s past year of commit activity
    Python 24 BSD-3-Clause 6 4 1 Updated Oct 3, 2025