返回一個(gè)列表每個(gè)元素都是Element類型,所有節(jié)點(diǎn)都包含在其中。
from lxml import etree
html=etree.parse('test',etree.HTMLParser())
result=html.xpath('//*') #//代表獲取子孫節(jié)點(diǎn),*代表獲取所有
print(type(html))
print(type(result))
print(result)
#
<class 'lxml.etree._ElementTree'>
<class 'list'>
[<Element html at 0x754b210048>, <Element body at 0x754b210108>, <Element div at 0x754b210148>, <Element ul at 0x754b210188>, <Element li at 0x754b2101c8>, <Element a at 0x754b210248>, <Element li at 0x754b210288>, <Element a at 0x754b2102c8>, <Element li at 0x754b210308>, <Element a at 0x754b210208>, <Element li at 0x754b210348>, <Element a at 0x754b210388>, <Element li at 0x754b2103c8>, <Element a at 0x754b210408>]
如要獲取li節(jié)點(diǎn),可以使用//后面加上節(jié)點(diǎn)名稱,然后調(diào)用xpath()方法
html.xpath('//li') #獲取所有子孫節(jié)點(diǎn)的li節(jié)點(diǎn)
更多建議: