Regex that only matches text that's not part of HTML markup? (python)?

Since you are using Python anyway, if I were you, I would have a look at Beautiful Soup which is a Python HTML/XML parser Really, there are so many special cases and headaches with writing your own parser, it just doesn't worth the effort. Your regular expression will get unmanageably large and will still not yield the correct results in all of the cases Just use Beautiful Soup.

Since you are using Python anyway, if I were you, I would have a look at Beautiful Soup, which is a Python HTML/XML parser. Really, there are so many special cases and headaches with writing your own parser, it just doesn't worth the effort. Your regular expression will get unmanageably large and will still not yield the correct results in all of the cases.

Just use Beautiful Soup.

Since you are using Python anyway, if I were you, I would have a look at Beautiful Soup, which is a Python HTML/XML parser. Really, there are so many special cases and headaches with writing your own parser, it just doesn't worth the effort. Your regular expression will get unmanageably large and will still not yield the correct results in all of the cases.

I cant really gove you an answer,but what I can give you is a way to a solution, that is you have to find the anglde that you relate to or peaks your interest. A good paper is one that people get drawn into because it reaches them ln some way.As for me WW11 to me, I think of the holocaust and the effect it had on the survivors, their families and those who stood by and did nothing until it was too late.

Related Questions