阿葆的學習筆記: Python 網頁擷取(爬蟲) 實作 -以Yahoo拍賣為例

2016年7月7日星期四

Python 網頁擷取(爬蟲) 實作 -以Yahoo拍賣為例

以Yahoo拍賣為例

擷取標頭文字、價錢，如下圖畫紅圈

使用google chrome開發者工具，擷取網頁URL

使用InfoLite 擷取網頁原始碼

-----------------------------------------------------------------------------------------

擷取網頁區塊.srp-pdconten

下圖紅框部分

InfoLite

-----------------------------------------------------------------------------------------

擷取標頭文字 .srp-pdtitle

綠色部分

InfoLite

-----------------------------------------------------------------------------------------

擷取價錢 .srp-pdprice

黃色部分

InfoLite

-----------------------------------------------------------------------------------------

程式碼:

import requests

from bs4 import BeautifulSoup

res = requests.get("https://tw.search.bid.yahoo.com/search/product;_ylt=AlVvPYF4uAi_N312wTXpbl1yFbN8;_ylv=3?p=iphone+6+plus+%E6%89%8B%E6%A9%9F%E6%AE%BC&property=auction&sub_property=auction&srch=product&aoffset=0&poffset=0&pg=1&pptf=3&act=srp&rescheck=1&pmt=30&its=16&cid=4638850&clv=4&sort=etime&nst=1&fr=aucpromo&show=pic&show_flag=1&view=pic&hpp=hp_topkeyword_04_07&fr=aucpromo")

soup = BeautifulSoup(res.text)

count = 1

for item in soup.select(' .srp-pdcontent'):

print '======[',count,']========='

print item.select(' .srp-pdtitle')[0].text.strip()

print item.select(' .srp-pdprice')[0].text.strip()

count += 1

-----------------------------------------------------------------------------------------

執行結果:

阿葆的學習筆記

2016年7月7日星期四

Python 網頁擷取(爬蟲) 實作 -以Yahoo拍賣為例

沒有留言:

張貼留言

2016年7月7日 星期四

Python 網頁擷取(爬蟲) 實作 -以Yahoo拍賣為例

沒有留言:

張貼留言

2016年7月7日星期四