Apache Nutch

Apache Nutch
Original author(s)Doug Cutting, Mike Cafarella
Developer(s)Apache Software Foundation
Stable release
1.x1.20 / 24 April 2024 (2024-04-24)
2.x2.4 / 11 October 2019 (2019-10-11)
RepositoryNutch Github Repository
Written inJava
Operating systemCross-platform
TypeWeb crawler
LicenseApache License 2.0
Websitenutch.apache.org

Apache Nutch is a highly extensible and scalable open source web crawler software project.