Sengoli sa Lingoloa
Baidu Spider e hapa lintlha tsa mokhelo: Ke lokela ho etsa eng haeba sokete e bala le ho ngola ka phoso?
Ka ho nka hore sebaka sa hau sa Marang-rang ha se so thathamisitsoe ke Baidu, u tlameha ho qala ka ho etsa tlhahlobo ea sekho seatleng sa Baidu.
Ke etse'ng ha Baidu crawler e hloleha ho khasa likhokahano tsa tlhahlobo?
Haeba tlhahlobo ea Baidu crawler e hloleha makhetlo a 'maloa, firewall e kanna ea thiba lenaneo la crawler.
Baidu Search Resource Platform > Crawl Diagnosis > Crawl Exception Information: Socket bala le ho ngola liphoso ▼
- Haholo-holo ha u sebelisa Cloudflare CDN, e thibetsoe ka ho sa feleng.
- Marang-rang, ho thoe ho eketsa aterese ea IP
xxx.xxx.xxx.xxx/24
- Leha ho le joalo, ke ile ka leka seo ha sea ka sa atleha.
Ha kea thibela likho tsa Baidu ho seva, kahoo bothata e lokela ho ba WAF ea Cloudflare!
Kena ho Cloudflare → Tšireletso → WAF → Melao ea Li-firewall → Theha Molao oa Firewall
- Fumana melao ea WAF e amanang le bakhanni ho Cloudflare, 'me u fumane khetho ea "seqhobi se nepahetseng sa liroboto" ▼
- Kamora ho theha melao ea li-firewall, emela metsotso e 10, ebe u tšoara tlhahlobo, 'me kaofela ha bona ba ile ba tšoaroa ka katleho!
Phoso ke efe ka Baidu crawler ho hloleha ho khasa Sitemap le nako ea ho tima?
Haeba u fana ka aterese ea faele ea Sitemap sethaleng sa lisebelisoa tsa ho batla tsa Baidu, ho tla ba le mathata a kang ho hloleha ho khasa le nako ea khokahano ▼
Tharollo ho hloleheng ha Baidu crawler ho nka 'mapa oa Sitemap
Kena ho Cloudflare → Tšireletso → WAF → Melao ea Li-firewall → Theha Melao ea Li-firewall ▼
- lebaleng, khetha "Moemeli oa mosebelisi"
- opareitara, kgetha E na
- Kenya mosebelisi e mocha, tobetsa ea ho qetela "Kapa"
- Boleng, ka ho latellana kenya mosebedisi o latelang wa Baidu Spider UA:
-
Baiduspider/2.0
-
Baiduspider-image
-
Baiduspider-render/2.0
-
http://www.baidu.com/search/spider.html
-
Mozilla/5.0 (compatible; Baiduspider/2.0; +http://www.baidu.com/search/spider.html)
-
Mozilla/5.0 (Linux;u;Android 4.2.2;zh-cn;) AppleWebKit/534.46 (KHTML,like Gecko) Version/5.1 Mobile Safari/10600.6.3 (compatible; Baiduspider/2.0; +http://www.baidu.com/search/spider.html)
Kamora ho phethela, leka ho lata hape, 'me sephetho se khutlisa sehlooho sa HTTP 200, se bontšang hore ho lata ho atlehile▼
-
抓取诊断 > 抓取详情
以下是百度Spider抓取结果及页面信息:
-
提交网址: https://www.etufo.org/sitemap_baidu.xml
-
抓取网址: https://www.etufo.org/sitemap_baidu.xml
-
抓取UA: Mozilla/5.0 (compatible; Baiduspider/2.0;
-
+http://www.baidu.com/search/spider.html)
-
抓取时间: 2022-11-11 19:03:44
-
网站IP: 172.***.***.149
-
下载时长: 0.868秒
-
返回HTTP头:HTTP/2 200
Basebelisi ba likho tse ling le lihahabi le bona ba ka ipatla ka tsela e tšoanang.
Hope Chen Weiliang Blog ( https://www.chenweiliang.com/ ) o ile a arolelana "Baidu Spider Crawl Failure Diagnosis Tlhahisoleseding e sa Tloaelehang Seo o ka se Etsang Haeba Khokahano ea Phoso ea Socket Read and Write e Feletsoe ke Nako", e leng thuso ho uena.
Rea u amohela ho arolelana sehokelo sa sengoloa sena:https://www.chenweiliang.com/cwl-29315.html
Rea u amohela ho mocha oa Telegraph oa blog ea Chen Weiliang ho fumana lintlha tsa morao-rao!
📚 Tataiso ena e na le boleng bo boholo, 🌟Ona ke monyetla o sa tloaelehang, se ke oa o fetoa! ⏰⌛💨
Share le rata haeba u rata!
Ho arolelana le lintho tseo u li ratang ke khothatso ea rona e tsoelang pele!