Skip to content
#

news-crawler

Here are 17 public repositories matching this topic...

A Scrapy package based web scraper for collecting Kurdish text data from websites. The tool recursively crawls specified domains, extracts article content using Trafilatura, and filters results by language using Facebook's FastText language identification model.

  • Updated Mar 29, 2026
  • Python

Improve this page

Add a description, image, and links to the news-crawler topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the news-crawler topic, visit your repo's landing page and select "manage topics."

Learn more