Just how much Java does one need to use Hadoop and Mahout effectively?

It shouldn't be all that difficult to get data from PHP to Java for analysis using Mahout and Hadoop.

Up vote 4 down vote favorite 3 share g+ share fb share tw.

I'm a PHP developer. Let's just get that out of the way now. But Hadoop – and Mahout in particular – have piqued my interest.

I'm ready to take the dive into Java in order to use them. So from people experience enough to know, just how much Java will I need to be able to use these effectively? From what I've seen, programming mappers/reducers doesn't take all that much.

But with Mahout I'm not at all sure what I'm looking at when I look at the documentation. Also, just how hard will it be to take data from my PHP application for processing in Java via Hadoop and Mahout? I can't imagine it'd be that difficult, but I'm not experienced enough to say.

Java php hadoop mahout link|improve this question asked Jul 22 '10 at 18:21Josh Smith2,733839 88% accept rate.

It shouldn't be all that difficult to get data from PHP to Java for analysis using Mahout and Hadoop. Even easier is to process using Mahout and Hadoop off-line in a batch mode and to store the data products in a file system or database. PHP can then read these data products as easy as falling off a log.

For real-time use, the recommendations part of Mahout supports a variety of web-service interfaces that make it pretty easy to access from PHP. Hitting the model evaluation part of Mahout would require a bit more programming.

I'm not sure I've come across this so far myself. In the meantime, thanks for your answer! – Josh Smith Jul 22 '10 at 21:40 Nevermind.

I think I found it under the Taste documentation. For a noob like myself, though, would you mind expanding slightly on how PHP might be integrated to work with Mahout in a real-time application? I'd deeply appreciate it.

– Josh Smith Jul 22 '10 at 21:43 Sorry to be slow answering... but PHP is easy to integrate via web-services calls from PHP to Mahout's Taste components. Another alternative would be to use Quercus to run PHP from a Java environment and call Apache Mahout components directly. – Ted Dunning Sep 12 '10 at 8:28 So cwiki.apache.org/MAHOUT/recommender-documentation.html is enough to get a (PHP-friendly) Web service up and running, with nothing much more than a text file of ratings to configure.

It barely scratches the surface of what's possible but is a good way to get started. – Dan Brickley Feb 24 '11 at 21:58.

Beginner level of Java is sufficient. You can always dug deep on adhoc need basis.

I just did the same thing, and it's been years I did anything Java related. What I did was the following: Started off with simple Hadoop streaming examples Try my own analysis with PHP streaming Started experimenting with Pig Start experimenting with using PHP streaming inside Pig All without any Java!

For real-time recommendations you could also instantiate an instance of mahout in a java servlet class, then serve export that as a war to serve up on a tomcat server.

I cant really gove you an answer,but what I can give you is a way to a solution, that is you have to find the anglde that you relate to or peaks your interest. A good paper is one that people get drawn into because it reaches them ln some way.As for me WW11 to me, I think of the holocaust and the effect it had on the survivors, their families and those who stood by and did nothing until it was too late.

Related Questions