Capture data from html, parse add to db and genrate a new reformatted html [closed]?

Up vote -1 down vote favorite share g+ share fb share tw.

I have a html on my screen. I need to capture (scrap) specific data from it and place that data in a db. Then I need to recall and reformat that data and display back to the user.

The initial html is from a third party site. Web-scraping link|improve this question asked Mar 4 '11 at 22:43Todd1.

It exists a great tool for this: OutWit Hub, as a Firefox extension. It is dedicated to scraping data over the internet and putting it into a database.

You have a number of technologies on your side for parsing data from an HTML webpage: Java JSON API takes the HTML and returns the scraped data to a StringBuilder object which can be used to store in a database PHP There are some parsers available in its standard library like libxml2 which can do the extraction, or, you can use the easy but horribly inefficient way of writing a regular expression in PHP. Python There are built-in classes which you can easily instantiate to parse the data. There's a lot of details which can be shared here, but, I've just given a head-start!

I cant really gove you an answer,but what I can give you is a way to a solution, that is you have to find the anglde that you relate to or peaks your interest. A good paper is one that people get drawn into because it reaches them ln some way.As for me WW11 to me, I think of the holocaust and the effect it had on the survivors, their families and those who stood by and did nothing until it was too late.

Related Questions


Thank You!
send