Apache Nutch

Apache Nutch is a extensible and scalable open source web crawler software project.Nutch provides extensible interfaces such as Parse, Index and ScoringFilter's for custom implementations e.g. Apache Tika for parsing.

Languages supported:

8.0/10 (Expert Score) ★★★★★
Product is rated as #30 in category Java Web Frameworks Software
Ease of use
8.2
Support
7.9
Ease of Setup
7.9

Apache Nutch is a extensible and scalable open source web crawler software project.Nutch provides extensible interfaces such as Parse, Index and ScoringFilter’s for custom implementations e.g. Apache Tika for parsing.

Apache Nutch
Apache Nutch

Show more categories

Customer Reviews

Apache Nutch Reviews

Narendra A.

Advanced user of Apache Nutch
★★★★★
Apache Nutch is Rockstar in terms of huge data crawling.

What do you like best?

When I used apache Nutch I was amazed with the speed it crawls data and the libraries and data structures provided to customise your crawling and reading the data in desired format. I was crawling the whole IBM data to get the insights and do text analytics on it. The kind of support I got from the forums was also great. So overall it was nice experience using apache Nutch crawler.

What do you dislike?

What I disliked was the video support it provides in the Internet.

Recommendations to others considering the product:

It's nice to use and provides lots of flexibility.

What problems are you solving with the product? What benefits have you realized?

I was solving the problem in my organisation for data analytics. Where we automate the whole process of bidding with text analytics.

Review source: G2.com

Leave a reply

Your total score

B2B Software Guide