Hi..to all today i learnt have to extract a webpage in python …here is the simple code…it’s so easy.. we need to install the following..
$ sudo apt-get install python-setuptools
$ sudo easy_install stripogram
import urllib
from stripogram import html2text
myurl=urllib.urlopen(“https://tuxbalaji.wordpress.com”)
html_string=myurl.read()
text= html2text( html_string )
print(text)
This is will print the source page of the given url as text as our output..
Thanks…enjoy coding in python….:)