html - Inspecting elements and using XPATH to get the correct data python -

- June 15, 2012

i'm trying scrape coinid's form website.

when inpecting element, id's seen here, when copying xpath get:

//*[@id="id-bitcoin"]

i'm planning on using python code:

from lxml import html  import requests   page = requests.get('http://coinmarketcap.com/all/views/all/') tree = html.fromstring(page.content)  id = tree.xpath('')  print id

but i'm not sure in element plug tree.xpath('')

i hoping

//span[@class="id"]/text()

i tried printing tree understand data better, it's printing `what's syntax see data, tree.getdata() ?

any info on how can these coin id names appreciated, thanks.

i suppose trying id's of tr tags. id attribute of tag, can this:

from lxml import html import requests  page = requests.get('http://coinmarketcap.com/all/views/all/') tree = html.fromstring(page.content)  trs = tree.xpath('//table[@id="currencies-all"]/tbody/tr')  tr in trs:     print tr.attrib.get('id')

you shall output this:

id-bitcoin id-ripple id-litecoin id-ethereum id-dash id-dogecoin ...

if want data each row tr, can find each td tag inside tr , extract text content.

for tr in trs:     tds = tr.findall('td')     data = [td.text_content().strip() td in tds]     print data

output:

['1', 'bitcoin', 'btc', '$ 6,815,160,833', '$ 452.70', '15,054,475', '$ 75,535,400', '-0.21 %', '5.19 %', '5.76 %'] ...

you might need cleanup data.

Search This Blog

Stadnd

html - Inspecting elements and using XPATH to get the correct data python -

Comments

Post a Comment

Popular posts from this blog

python - Statsmodels.api Logit model error ValueError: endog must be in the unit interval -

Capture and play voice with Asterisk ARI -

c++ - Can not find the "fiostream.h" file -