html - Inspecting elements and using XPATH to get the correct data python -
i'm trying scrape coinid's form website.
when inpecting element, id's seen here, when copying xpath get:
//*[@id="id-bitcoin"]
i'm planning on using python code:
from lxml import html import requests page = requests.get('http://coinmarketcap.com/all/views/all/') tree = html.fromstring(page.content) id = tree.xpath('') print id
but i'm not sure in element plug tree.xpath('')
i hoping
//span[@class="id"]/text()
i tried printing tree understand data better, it's printing `what's syntax see data, tree.getdata() ?
any info on how can these coin id names appreciated, thanks.
i suppose trying id
's of tr
tags. id attribute of tag, can this:
from lxml import html import requests page = requests.get('http://coinmarketcap.com/all/views/all/') tree = html.fromstring(page.content) trs = tree.xpath('//table[@id="currencies-all"]/tbody/tr') tr in trs: print tr.attrib.get('id')
you shall output this:
id-bitcoin id-ripple id-litecoin id-ethereum id-dash id-dogecoin ...
if want data each row tr
, can find each td
tag inside tr
, extract text content.
for tr in trs: tds = tr.findall('td') data = [td.text_content().strip() td in tds] print data
output:
['1', 'bitcoin', 'btc', '$ 6,815,160,833', '$ 452.70', '15,054,475', '$ 75,535,400', '-0.21 %', '5.19 %', '5.76 %'] ...
you might need cleanup data.
Comments
Post a Comment