Kalozera wa Nkhani
Kangaude wa Baidu amakwawa ndikuzindikira zachilendo: Nditani ngati soketiyo yawerengedwa kapena kulembedwa?
Pongoganiza kuti tsamba lanu silinalembedwe ndi a Baidu, muyenera choyamba kudziwa kangaude pa nsanja ya Baidu.
Nditani ngati chokwawa cha Baidu chalephera kukwawa ulalo wozindikira matenda?
Ngati chokwawa cha Baidu chikalephera kukwawa ndikuzindikira kangapo, chotchingira chikhoza kutsekereza chokwawacho.
Baidu Search Resource Platform > Crawl Diagnosis > Crawl Abnormal Information: cholakwika chowerengera / kulemba ▼
- Makamaka mukamagwiritsa ntchito Cloudflare CDN, yomwe imatsekedwa mwachisawawa.
- Pa intaneti, pali nkhani yowonjezera adilesi ya IP
xxx.xxx.xxx.xxx/24
- Komabe, anayesa sizinaphule kanthu.
Sindikuletsa akangaude a Baidu pa seva, ndiye vuto liyenera kukhala WAF ya Cloudflare!
Lowani mu Cloudflare → Chitetezo → WAF → Malamulo Ozimitsa Moto → Pangani Lamulo Laziwopsezo
- Kuyang'ana malamulo a WAF okhudzana ndi zokwawa pa Cloudflare ndikupeza njira ya "Legal Robot Crawler" ▼
- Pambuyo popanga malamulo a firewall, dikirani kwa mphindi 10, ndiyeno gwirani matendawo, ndipo onsewo agwidwa bwino!
Baidu crawler Sitemap yalephera kukwawa, kulumikizana kwatha?
Ngati adilesi ya fayilo ya sitemap itumizidwa pa nsanja ya kufufuza kwa Baidu, padzakhala zovuta zokwawa ndikulephera kulumikiza kutha ▼
Baidu crawler walephera kukwawa njira ya mapu a Sitemap
Lowani mu Cloudflare → Chitetezo → WAF → Malamulo a Zozimitsa moto → Pangani Malamulo Oziteteza ▼
- m'munda, sankhani User-Agent
- opareta, sankhani "zili"
- Onjezani wogwiritsa ntchito watsopano, dinani "Kapena" kumapeto
- mtengo, lowetsani otsatirawa a Baidu Spider UA motsatana:
-
Baiduspider/2.0
-
Baiduspider-image
-
Baiduspider-render/2.0
-
http://www.baidu.com/search/spider.html
-
Mozilla/5.0 (compatible; Baiduspider/2.0; +http://www.baidu.com/search/spider.html)
-
Mozilla/5.0 (Linux;u;Android 4.2.2;zh-cn;) AppleWebKit/534.46 (KHTML,like Gecko) Version/5.1 Mobile Safari/10600.6.3 (compatible; Baiduspider/2.0; +http://www.baidu.com/search/spider.html)
Mukamaliza, yesani ndikutenganso, ndipo zotsatira zake zimabwezeretsa mutu wa HTTP 200, zomwe zikuwonetsa kuti kutengako kuli bwino▼
-
抓取诊断 > 抓取详情
以下是百度Spider抓取结果及页面信息:
-
提交网址: https://www.etufo.org/sitemap_baidu.xml
-
抓取网址: https://www.etufo.org/sitemap_baidu.xml
-
抓取UA: Mozilla/5.0 (compatible; Baiduspider/2.0;
-
+http://www.baidu.com/search/spider.html)
-
抓取时间: 2022-11-11 19:03:44
-
网站IP: 172.***.***.149
-
下载时长: 0.868秒
-
返回HTTP头:HTTP/2 200
Ogwiritsa ntchito akangaude ena ndi zokwawa amathanso kudzifufuza mwanjira yomweyo.
Hope Chen Weiliang Blog ( https://www.chenweiliang.com/ ) adagawana nawo "Kangaude wa Baidu akulephera kuzindikira socket yodziwika bwino werengani ndi kulemba zolakwika zoyenera kuchita pakutha kwa intaneti", zomwe ndi zothandiza kwa inu.
Takulandirani kugawana ulalo wa nkhaniyi:https://www.chenweiliang.com/cwl-29315.html
Takulandilani panjira ya Telegraph yabulogu ya Chen Weiliang kuti mupeze zosintha zaposachedwa!
📚 Bukuli lili ndi phindu lalikulu, 🌟Uwu ndi mwayi wosowa, musaphonye! ⏰⌛💨
Share ndi like ngati mukufuna!
Kugawana kwanu ndi zomwe mumakonda ndizomwe zimatilimbikitsa nthawi zonse!