You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
Michal Goral a36b7ab724 version 0.1.2 7 months ago
src/torss Aligned The Economist parsing 7 months ago
.gitignore Rewrite to async + no dependency to rssgen in main program 7 months ago Rename, readme 7 months ago
pyproject.toml version 0.1.2 7 months ago

RSS Scrap

rss-scrap is a command line utility which scraps contents of web pages and converts them to RSS feeds. Specific web scrapers must be implemented for each page.

rss-scrap works asynchronously, meaning that many web pages can be scraped simultaneously.

Currently scraping for the following pages is implemented:

  • The Economist, World This Week section
  • Wikipedia Current Events
  • Warnings of Główny Inspektorat Sanitarny (Polish Government Agency)