Capture data from html, parse add to db and genrate a new reformatted html [closed]?

Up vote -1 down vote favorite share g+ share fb share tw.

I have a html on my screen. I need to capture (scrap) specific data from it and place that data in a db. Then I need to recall and reformat that data and display back to the user.

The initial html is from a third party site. Web-scraping link|improve this question asked Mar 4 '11 at 22:43Todd1.

It exists a great tool for this: OutWit Hub, as a Firefox extension. It is dedicated to scraping data over the internet and putting it into a database.

You have a number of technologies on your side for parsing data from an HTML webpage: Java JSON API takes the HTML and returns the scraped data to a StringBuilder object which can be used to store in a database PHP There are some parsers available in its standard library like libxml2 which can do the extraction, or, you can use the easy but horribly inefficient way of writing a regular expression in PHP. Python There are built-in classes which you can easily instantiate to parse the data. There's a lot of details which can be shared here, but, I've just given a head-start!

I cant really gove you an answer,but what I can give you is a way to a solution, that is you have to find the anglde that you relate to or peaks your interest. A good paper is one that people get drawn into because it reaches them ln some way.As for me WW11 to me, I think of the holocaust and the effect it had on the survivors, their families and those who stood by and did nothing until it was too late.

Related Questions