OPEN_SOURCE ↗
PH · PRODUCT_HUNT// 20d agoPRODUCT LAUNCH
DataSieve 2.0 broadens offline data extraction
DataSieve 2.0 updates the offline text-to-data app formerly known as TextMine. It now handles text, PDFs, EPUBs, spreadsheets, Word docs, folders, and archives locally, then exports structured results without sending data to the cloud.
// ANALYSIS
This is the right kind of 2.0: it turns a narrow extractor into a genuinely useful document-ingestion tool. The standout value is privacy plus batch processing, which makes it feel more like lightweight ETL than a novelty app.
- –Batch extraction across multiple data types is the time saver here, especially for reports, inbox exports, and research dumps.
- –Local-only processing is a real trust differentiator for sensitive docs and avoids sending raw files to a cloud service.
- –Custom extractors make the app feel extensible enough for recurring analyst workflows, and it could slot into ETL or LLM preprocessing.
- –Early Reddit feedback already called out PDFs, repeated configuration, and multi-field extraction; this release appears to answer those pain points.
// TAGS
datasievedata-toolsautomationsearch
DISCOVERED
20d ago
2026-03-23
PUBLISHED
20d ago
2026-03-23
RELEVANCE
5/ 10
AUTHOR
[REDACTED]