GIT_FEED

apache/tika

The Apache Tika toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF).

View on GitHub
0Active

On the radar — signal detected

Stars
3.7k
Forks
921
Contributors
180
Language
Java

Score updated Apr 12, 2026

// SUBSCRIBE

The repos that moved this week, why they matter, and what to watch next. One email. No noise.