Python library + CLI to bulk-download images from Bing or DuckDuckGo. Embeddable API, async, typed errors, JSONL manifest for ML training data. Good first issues available.
-
Updated
Jun 13, 2026 - Python
Python library + CLI to bulk-download images from Bing or DuckDuckGo. Embeddable API, async, typed errors, JSONL manifest for ML training data. Good first issues available.
Synthetic Japanese business email generator for ML training data
Local-first Python + TypeScript SDK for public data — one interface for quants, ML pipelines, and AI agents. Adapters ship weather (METAR, ASOS, GHCNh, NWS CLI) + prediction-market settlements (Kalshi, Polymarket) today. SEC filings (EDGAR), Federal Reserve (FRED), court filings, FDA approvals, and equities structured data are next.
Add a description, image, and links to the ml-training-data topic page so that developers can more easily learn about it.
To associate your repository with the ml-training-data topic, visit your repo's landing page and select "manage topics."