Professional Documents
Culture Documents
www.semtech-solutions.co.nz
info@semtech-solutions.co.nz
Mahout What is it ?
Machine learning For large data Based on Hadoop But can work on a non Hadoop cluster Scaleable Licensed by Apache
www.semtech-solutions.co.nz
info@semtech-solutions.co.nz
Uses Hadoop Map Reduce Has many supplied algorithms Supports four use cases
www.semtech-solutions.co.nz
info@semtech-solutions.co.nz
A branch of artificial intelligence Systems that learn from data Classify data after learning Learn on test data sets Generalisation the ability to classify unseen data sets
after learning
www.semtech-solutions.co.nz
info@semtech-solutions.co.nz
Mahout Algorithms
Some of the available algorithms (among many others)
Collaborative filtering
Narrow Sense make predictions about user interests by collecting preferences General - Multi agent collaboration for information filtering Mode seeking, used for visual tracking Find unique features
www.semtech-solutions.co.nz
info@semtech-solutions.co.nz
Mahout Install
So how do we install Mahout and test it ?
Install Maven
sudo apt-get install maven3 You will need subversion installed svn co http://svn.apache.org/repos/asf/mahout/trunk Go to dir containing pom.xml file
mvn install
## in ./trunk
Full details available in the Mahout install guide on our web site shop
www.semtech-solutions.co.nz
info@semtech-solutions.co.nz
cd $MAHOUT_HOME/examples/bin ./build-reuters.sh choose option 1 kmeans clustering Should finish with see next slide
Full details available in the Mahout install guide on our web site shop
www.semtech-solutions.co.nz
info@semtech-solutions.co.nz
Contact Us
www.semtech-solutions.co.nz info@semtech-solutions.co.nz
We offer IT project consultancy We are happy to hear about your problems You can just pay for those hours that you need To solve your problems