GitHub Org Scraper Scripts

2025-05-25
GitHub Org Scraper Scripts

This suite of scripts scrapes GitHub for organization information, ultimately producing a TSV file containing the organization name, URL, location, and star counts of selected repositories. `get_all_orgs.sh` fetches a list of all GitHub organizations; `get_orgs_tsv.py` scrapes data for each organization and outputs a TSV; `sorry.sh` handles GitHub's rate limiting policy, requiring users to manually reset their IP address.

Development scraper