Crawling the web, Harnessing the power of Nutch with Scala

Nutch is a very powerful, open source webcrawler written in Java. Apache Nutch can run very large crawls in parallel, downloading, indexing, and archiving millions of pages. In this talk we understand key architectural details about Nutch. We would see how it is easy to extend the Nutch behavior with Scala plugins.

[…]

Vikas Hazrati

Vikas is the co-founder and software craftsman at Knoldus. In his 16 years of experience he has become a recognized speaker, mentor, and practitioner in the software industry.

At Knoldus he is responsible for keeping the organization on the forefront of technology adoption curve. Knoldus is one of the very few organizations, if not the […]

Building Massively Scalable Applications with Akka

Historically writing correct concurrent, scalable and fault-tolerant applications has been very hard. Akka is an attempt to simplify writing concurrent, scalable and highly available software for the JVM. Akka has an API both for Scala and Java. Akka uses the Actor Model together with Software Transactional Memory (STM) to raise the abstraction level. For fault-tolerance […]