学会链接分析,快速定位网站SEO问题
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">点击上方蓝色字 [<span style="color: black;"> <strong style="color: blue;">关注</strong></span>
</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">咱们</span> ]</p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><strong style="color: blue;"><span style="color: black;">知识</span></strong><span style="color: black;"> | 产<span style="color: black;">品 | <span style="color: black;">新闻</span> | <span style="color: black;">职场</span> | 资源 五大版块</span></span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">国内最专业的SEM学习交流社区</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><img src="http://mmbiz.qpic.cn/mmbiz/ZUyGV8zD8wnQekHHOgiauicdwSELRbBvsEjh8oianHmOsdDh46hjcuvmhyIvA7g8M4XfIqNuXYCoPo7GFXicr0o7WQ/0?wx_fmt=gif&tp=webp&wxfrom=5&wx_lazy=1" style="width: 50%; margin-bottom: 20px;"></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><strong style="color: blue;"><span style="color: black;">【本文<span style="color: black;">重点</span>内容】</span></strong></span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">(1)<span style="color: black;">查询</span><span style="color: black;">是不是</span>有黑链<span style="color: black;">显现</span>——从日志分析,百度蜘蛛抓取了网站的<span style="color: black;">那些</span>预期外的页面,<span style="color: black;">是不是</span>存在黑链。(这个可能要先卖个关子,<span style="color: black;">由于</span>这又是个大工程啦,本期专题会<span style="color: black;">说到</span><span style="color: black;">有些</span>)</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">(2)百度站长<span style="color: black;">工具</span>外链分析——查看<span style="color: black;">是不是</span>有垃圾外链、黑链等,以及链向的站内什么<span style="color: black;">地区</span>,<span style="color: black;">怎样</span>处理。(本期里面<span style="color: black;">亦</span>有所<span style="color: black;">触及</span>)</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">(3)百度站长<span style="color: black;">工具</span>链接分析——三大死链(内链死链、链出死链、链入死链),批量下载数据,合并数据,excel操作,按<span style="color: black;">规律</span><span style="color: black;">归类</span>,定位问题,处理问题。(定位和处理,材料<span style="color: black;">不足</span>,<span style="color: black;">由于</span>好多<span style="color: black;">已然</span>处理过了,<span style="color: black;">无</span>材料了= =|||||)</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">(4)从分析这些数据,得到的与SEO效果<span style="color: black;">关联</span>的其他信息(垃圾搜索引擎、垃圾外链带来的<span style="color: black;">没</span>用抓取,浪费资源配额,<span style="color: black;">怎样</span>拒绝。)</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">(5)<span style="color: black;">怎样</span>自动化地<span style="color: black;">运用</span>shell脚本,定位到被百度蜘蛛抓取到的死链,并进行复查,<span style="color: black;">而后</span>将确定为死链的URL进行自动化提交。(本期专题内容太多,留作下期专题用)</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">(6)分析<span style="color: black;">工具</span>介绍(firefox设置,插件,excel,windows命令提示符批处理)</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><strong style="color: blue;"><span style="color: black;">【你<span style="color: black;">亦</span>许会学到的新姿势】</span></strong></span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">(1)批量下载百度站长<span style="color: black;">工具</span>中的表格数据(活学活用地去下载其他网站的<span style="color: black;">有些</span>东西,只要你<span style="color: black;">爱好</span>。<span style="color: black;">例如</span>5118什么的。5118的站长会不会打我呀?)</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">(2)合并<span style="color: black;">有些</span><span style="color: black;">平常</span>的文档,<span style="color: black;">例如</span>txt、csv之类的文本,方便数据分析和处理。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">(3)分析死链数据、定位问题的一点基本思路</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><strong style="color: blue;"><span style="color: black;">【本文<span style="color: black;">重点</span><span style="color: black;">运用</span>到的<span style="color: black;">工具</span>】</span></strong></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">(只是结合例子中,<span style="color: black;">倘若</span>有其他<span style="color: black;">类似</span>功能的<span style="color: black;">工具</span>,请结合<span style="color: black;">自己</span>习惯<span style="color: black;">运用</span><span style="color: black;">就可</span>)</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">【浏览器】火狐(Firefox)浏览器,版本<span style="color: black;">没</span><span style="color: black;">所说</span></span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">【插件】:Launch Clipboard</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">功能:一键打开剪切板中存在的URL。(<span style="color: black;">重视</span>URL中只能有英文数字标点,<span style="color: black;">倘若</span>有中文可能<span style="color: black;">没</span>法被识别)。快捷键:alt + shift +K(先复制好单个<span style="color: black;">或</span>多个URL)</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><img src="http://mmbiz.qpic.cn/mmbiz/mYsbTZXlVl3okFgwqPmkGOjnJTG2DbiaBeViatdkjEt6y4kJWMfaia5CQOfwEBbfibpibF8qQ9bXKxNWIsFUQr5ntYA/640?wx_fmt=png&tp=webp&wxfrom=5&wx_lazy=1&wx_co=1" style="width: 50%; margin-bottom: 20px;"></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">设置:打开选项设置,<span style="color: black;">选取</span>好下载文件自动<span style="color: black;">保留</span>的位置(我<span style="color: black;">这儿</span><span style="color: black;">选取</span>了桌面,你<span style="color: black;">亦</span><span style="color: black;">能够</span>单独创建一个文件夹,好对批量下载的文件进行归类)</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><img src="http://mmbiz.qpic.cn/mmbiz/mYsbTZXlVl3okFgwqPmkGOjnJTG2DbiaBcXpX1sLLKHHicNlCNoG4OAx58DyAKB9VicxhRZEaHnYwS9557A8XEjMw/640?wx_fmt=png&tp=webp&wxfrom=5&wx_lazy=1&wx_co=1" style="width: 50%; margin-bottom: 20px;"></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">【表格处理】:Microsoft Office 2013 Excel</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">【文本处理】:Notepad++</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">【批量处理】:Windows自带命令提示符</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><strong style="color: blue;"><span style="color: black;">【<span style="color: black;">起始</span>讲解啦】</span></strong></span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">来到百度站长<span style="color: black;">工具</span>链接分析。<span style="color: black;">咱们</span>看到有两大板块,死链分析与外链分析。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">1、</span><span style="color: black;">咱们</span><span style="color: black;">能够</span>先看一下外链分析。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">分析外链数据的<span style="color: black;">重点</span>目的是,找出垃圾外链,主动去封堵垃圾外链可能对网站<span style="color: black;">导致</span>的<span style="color: black;">卑劣</span>影响。<span style="color: black;">最后</span><span style="color: black;">目的</span>:1、找到垃圾外链的域名,进行防盗链处理(针对<span style="color: black;">源自</span>为垃圾域名的,直接返回404状态码);2、处理站内可能存在问题的页面。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">这儿</span>,我会重点讲解<span style="color: black;">第1</span>点;第二点比较简单,我会讲解得比较粗略。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">1、定位出垃圾域名。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><img src="http://mmbiz.qpic.cn/mmbiz/fbac17wmeIFujC8MHlmcDeCnTm80OfGTv6y3YHniasErUdz31R3kK8XNGOa0cj6BXZlgUBRPTxbvuIutjjjXvtQ/640?wx_fmt=png&tp=webp&wxfrom=5&wx_lazy=1&wx_co=1" style="width: 50%; margin-bottom: 20px;"><span style="color: black;">图注:<span style="color: black;">能够</span>看到这是一个<span style="color: black;">显著</span>不正常的趋势图</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">咱们</span><span style="color: black;">能够</span>下载外链数据,来进行初步分析。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><img src="http://mmbiz.qpic.cn/mmbiz/fbac17wmeIFujC8MHlmcDeCnTm80OfGTUU8vgottJuic1J0L5rsCHGtzkabLbzLAPZOxe7LUb3RXpTZ1LDRsFXg/640?wx_fmt=png&tp=webp&wxfrom=5&wx_lazy=1&wx_co=1" style="width: 50%; margin-bottom: 20px;"><span style="color: black;">图注:下载得到的表格文件(csv逗号分隔符)</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">然则</span><span style="color: black;">这般</span>一份原始数据,是很难进行分析的。<span style="color: black;">因此呢</span><span style="color: black;">咱们</span><span style="color: black;">必须</span><span style="color: black;">根据</span><span style="color: black;">必定</span><span style="color: black;">规律</span>对其进行分析——<span style="color: black;">便是</span><span style="color: black;">根据</span>【被链接的网页url】进行<span style="color: black;">归类</span>。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">首要</span>,<span style="color: black;">咱们</span><span style="color: black;">能够</span>快速浏览一下,进行直观判断,这些页面大部分是什么页面呢?</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">针对<span style="color: black;">咱们</span>网站的<span style="color: black;">状况</span><span style="color: black;">来讲</span>,外链数据分为两类,正常外链与垃圾外链。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">而垃圾外链又分为两种:站内搜索结果页面(垃圾搜索词)以及被黑客入侵<span style="color: black;">移植</span>的黑链(<span style="color: black;">已然</span>处理为死链)。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">咱们</span>进行数据处理的目的有两个:识别出<span style="color: black;">那些</span>是正常外链,<span style="color: black;">那些</span>是垃圾外链,并<span style="color: black;">按照</span>垃圾外链的<span style="color: black;">关联</span>数据,进行<span style="color: black;">有些</span>处理,<span style="color: black;">守护</span>好网站;并且<span style="color: black;">必须</span>使被垃圾链接指向的页面,不被搜索引擎抓取(浪费抓取资源配额)以及被收录/索引(<span style="color: black;">保准</span>网站词库不受污染,不为网站带来形象与<span style="color: black;">重要</span>词方面的<span style="color: black;">消极</span>影响)。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">第1</span>步,筛选出网站的搜索结果页面</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><img src="http://mmbiz.qpic.cn/mmbiz/fbac17wmeIFujC8MHlmcDeCnTm80OfGTmc9ricWBEHWzSZzicia40cd9fpqTXLU7B2bVLJicA4kt5R8A2FJeEP9Bjg/640?wx_fmt=png&tp=webp&wxfrom=5&wx_lazy=1&wx_co=1" style="width: 50%; margin-bottom: 20px;"><img src="http://mmbiz.qpic.cn/mmbiz/fbac17wmeIFujC8MHlmcDeCnTm80OfGTsk4qBvic8jrqZZ6cG8PvdYBBrtuM92vmIKribfgfLicArcCaicz2AtnFhw/640?wx_fmt=png&tp=webp&wxfrom=5&wx_lazy=1&wx_co=1" style="width: 50%; margin-bottom: 20px;"><span style="color: black;">图注:筛选数据、复制到新的sheet中,删除原始sheet中的筛选数据,来<span style="color: black;">归类</span>数据</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">还有几类搜索链接格式,都以相同方式进行处理。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">而后</span>把原始sheet中剩下的数据进行去重(空白行),得到剩余的链接信息。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><img src="http://mmbiz.qpic.cn/mmbiz/fbac17wmeIFujC8MHlmcDeCnTm80OfGTD75HI662qZ7icz1U3ibc7NibYN6v8Wqd8FJEFb91fUXec8gNkhTibsduqQ/640?wx_fmt=png&tp=webp&wxfrom=5&wx_lazy=1&wx_co=1" style="width: 50%; margin-bottom: 20px;"><span style="color: black;">图注:对剩余数据进行简单的去重处理。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">而后</span>,<span style="color: black;">咱们</span><span style="color: black;">必须</span>对黑链进行筛选。黑链的数据,<span style="color: black;">通常</span><span style="color: black;">必须</span>先从网站日志中分析得到(<span style="color: black;">这般</span>是最全面的,为了<span style="color: black;">保准</span>效率,会<span style="color: black;">必须</span><span style="color: black;">运用</span>到shell脚本来自动运行,<span style="color: black;">然则</span><span style="color: black;">触及</span>篇幅<span style="color: black;">太多</span>,我将在以后的专题中进行讲解)。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">当然<span style="color: black;">亦</span><span style="color: black;">能够</span>对表格中【被链接的网页url】这一列<span style="color: black;">根据</span><span style="color: black;">次序</span>排序后,挨着分析得到(自己去打开,<span style="color: black;">同期</span>黑客会<span style="color: black;">运用</span><span style="color: black;">有些</span>特殊手段,妨碍<span style="color: black;">咱们</span>去识别真正的会被搜索引擎识别到的垃圾内容,最<span style="color: black;">平常</span>的<span style="color: black;">状况</span><span style="color: black;">便是</span>,<span style="color: black;">运用</span>js<span style="color: black;">转</span>。<span style="color: black;">这般</span><span style="color: black;">咱们</span><span style="color: black;">经过</span>浏览器<span style="color: black;">拜访</span>时,会看到完全不<span style="color: black;">同样</span>的内容,而搜索引擎抓取时,则下载到了垃圾内容。)</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">此时</span>,<span style="color: black;">咱们</span><span style="color: black;">必须</span><span style="color: black;">运用</span>一款firefox插件【No Script】,旨在屏蔽网站上的js,看到与搜索引擎类似的内容。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><img src="http://mmbiz.qpic.cn/mmbiz/fbac17wmeIFujC8MHlmcDeCnTm80OfGTAkmhRsZkrbbkvfHvlx2OHLmjjDJj9pju1UlpqqbZv6wLXTSnveeyRA/640?wx_fmt=png&tp=webp&wxfrom=5&wx_lazy=1&wx_co=1" style="width: 50%; margin-bottom: 20px;"><span style="color: black;">图注:屏蔽浏览器中java script的插件</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">另一</span>还有一种不是很<span style="color: black;">可靠</span>的甄选<span style="color: black;">办法</span>,在搜索引擎里面去搜:【site:域名 博彩】之类的<span style="color: black;">重要</span>词,把不符合网站预期的<span style="color: black;">重要</span>词拿去搜,就<span style="color: black;">能够</span>得到<span style="color: black;">非常多</span>链接了。(<span style="color: black;">这儿</span><span style="color: black;">必须</span><span style="color: black;">运用</span><span style="color: black;">有些</span><span style="color: black;">办法</span>,把链接全都批量导出,在今后的专题中,我会继续讲解的)</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">筛选过程我就只能省略啦,<span style="color: black;">能够</span>结合视频看一看。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><img src="http://mmbiz.qpic.cn/mmbiz/fbac17wmeIFujC8MHlmcDeCnTm80OfGTGIOzQgxeLPudLJPgoxKhAiawEdUrdicZgRica5Eibz4hEuJ8UvITqbR0YA/640?wx_fmt=png&tp=webp&wxfrom=5&wx_lazy=1&wx_co=1" style="width: 50%; margin-bottom: 20px;"><span style="color: black;">图注:筛选出来的网站黑链</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">咱们</span>之<span style="color: black;">因此</span>要这么辛苦地找出垃圾外链,目的<span style="color: black;">便是</span>要把这些垃圾外链的域名记录下来,避免这些垃圾域名被黑客重复利用,拿去制作新的垃圾链接,从而在<span style="color: black;">第1</span>时间拒绝掉这些垃圾外链,使百度蜘蛛从垃圾外链<span style="color: black;">拜访</span><span style="color: black;">咱们</span>网站上内容时,<span style="color: black;">没</span>法获取到任何信息(<span style="color: black;">亦</span><span style="color: black;">便是</span>返回404状态码,被识别成死链),久而久之,这些垃圾域名的权重就会越来越低(<span style="color: black;">由于</span>导出了死链,影响搜索引擎的正常抓取工作),<span style="color: black;">这般</span><span style="color: black;">咱们</span>不仅<span style="color: black;">守护</span>了自己,<span style="color: black;">亦</span><span style="color: black;">处罚</span>了敌人。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">详细</span><span style="color: black;">办法</span>是,把垃圾页面找出来——从搜索结果页面和黑链的两个sheet中,把外链页面整合到<span style="color: black;">一块</span>。如sheet3所示。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><img src="http://mmbiz.qpic.cn/mmbiz/fbac17wmeIFujC8MHlmcDeCnTm80OfGTicpxuldLIbeucuEcGDDFzGeNs72cOJibNS0Ur60B7mmdYpLiaKoJSUyibQ/640?wx_fmt=png&tp=webp&wxfrom=5&wx_lazy=1&wx_co=1" style="width: 50%; margin-bottom: 20px;"><span style="color: black;">图注:合并垃圾外链页面</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">接下来的处理会<span style="color: black;">运用</span>到一款小<span style="color: black;">工具</span>,来快速获取这些链接的主域名。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">https://www.benmi.com/getdomain.html</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><img src="http://mmbiz.qpic.cn/mmbiz/fbac17wmeIFujC8MHlmcDeCnTm80OfGT9FOksuJK7eaT1are6BoRicanGibDxabRoe69NiaZN2NibdunGBlKGRicskA/640?wx_fmt=png&tp=webp&wxfrom=5&wx_lazy=1&wx_co=1" style="width: 50%; margin-bottom: 20px;"><span style="color: black;">图注:将链接复制到左边红框里,点击本地提取,就会出<span style="color: black;">此刻</span>右侧红框</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">如此一来,<span style="color: black;">咱们</span>就得到了这些垃圾外链页面的主域名,<span style="color: black;">咱们</span>只<span style="color: black;">必须</span>在<span style="color: black;">咱们</span>服务器上配置一下防盗链,禁止refer(<span style="color: black;">源自</span>)为这些域名的<span style="color: black;">拜访</span>(返回404http状态码)<span style="color: black;">就可</span>。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">2、从站内对搜索结果页面进行处理(黑链处理我<span style="color: black;">保存</span>在下一次专题,<span style="color: black;">由于</span>要<span style="color: black;">海量</span>结合linux的shell脚本):</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">权重比较高的网站的站内搜索,<span style="color: black;">必定</span>要<span style="color: black;">重视</span>antispam(反垃圾)。<span style="color: black;">倘若</span>不加以防范的话,一旦被黑客利用,<span style="color: black;">那样</span>可能会<span style="color: black;">导致</span><span style="color: black;">海量</span>搜索页面被百度抓取,黑客利用高权重网站的资源,快速做好黄赌毒行业的<span style="color: black;">重要</span>词排名。<span style="color: black;">然则</span>这<span style="color: black;">针对</span><span style="color: black;">咱们</span>网站<span style="color: black;">来讲</span>,则是噩梦般的打击。不作处理的话,可能会<span style="color: black;">引起</span>如下几方面的问题:浪费<span style="color: black;">海量</span>的蜘蛛抓取配额,去抓取垃圾页面;垃圾页面被搜索引擎收录,网站词库被黑客污染,使得网站的行业词和品牌词排名<span style="color: black;">不睬</span>想;对网站形象<span style="color: black;">导致</span>损失……等。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">在进行这类反垃圾策略的时候,<span style="color: black;">咱们</span><span style="color: black;">必须</span>关注四个方面:站内用户<span style="color: black;">能够</span>正常<span style="color: black;">运用</span>;不<span style="color: black;">准许</span>搜索引擎抓取这类页面;拒绝垃圾外链的<span style="color: black;">拜访</span>;页面上不得<span style="color: black;">显现</span>垃圾<span style="color: black;">重要</span>词。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">既然有了<span style="color: black;">知道</span>的<span style="color: black;">目的</span>,<span style="color: black;">那样</span>相应的应对<span style="color: black;">方法</span><span style="color: black;">亦</span>就出来了,那<span style="color: black;">便是</span>:</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">A 限制<span style="color: black;">源自</span>,拒绝掉所有非站内<span style="color: black;">源自</span>的搜索</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">B 页面上的TKD等<span style="color: black;">重要</span>位置,不对搜索词进行调用</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">C 指定<span style="color: black;">敏锐</span>词库过滤规则,将<span style="color: black;">敏锐</span>词<span style="color: black;">所有</span>替换为星号*(有<span style="color: black;">必定</span>技术<span style="color: black;">研发</span><span style="color: black;">需求</span>)</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">D 在robots.txt声明,不<span style="color: black;">准许</span>抓取</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">E 在页面源代码head区间添加meta robots信息,声明该页面不<span style="color: black;">准许</span><span style="color: black;">创立</span>索引(noindex)</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">进行以上处理,<span style="color: black;">能够</span><span style="color: black;">处理</span>掉大部分站内搜索页面(不局限于该类页面,<span style="color: black;">乃至</span>其他的页面只要不<span style="color: black;">期盼</span>搜索引擎抓取以及<span style="color: black;">创立</span>索引的话,都<span style="color: black;">能够</span><span style="color: black;">这般</span>处理)容易<span style="color: black;">显现</span>的问题。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><strong style="color: blue;"><span style="color: black;"><span style="color: black;">2、</span><span style="color: black;">咱们</span>再来看一下死链分析。</span></strong></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">死链,在站长<span style="color: black;">工具</span>的死链提交<span style="color: black;">工具</span>的<span style="color: black;">帮忙</span>文档中<span style="color: black;">已然</span>有详尽的阐释,我仅仅进行<span style="color: black;">有些</span><span style="color: black;">弥补</span><span style="color: black;">就可</span>。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">死链<span style="color: black;">通常</span>有如下几种:内部死链、<span style="color: black;">外边</span>死链。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">内部死链,<span style="color: black;">便是</span><span style="color: black;">咱们</span>网站上<span style="color: black;">显现</span>的,<span style="color: black;">因为</span>种种<span style="color: black;">原由</span>使得百度蜘蛛抓取链接时,<span style="color: black;">没</span>法获取到内容而被识别的死链。大部分<span style="color: black;">状况</span>下,<span style="color: black;">针对</span><span style="color: black;">咱们</span><span style="color: black;">来讲</span>,这种死链是<span style="color: black;">能够</span><span style="color: black;">经过</span><span style="color: black;">有些</span>方式进行避免的,因此是可控的。<span style="color: black;">同期</span>,<span style="color: black;">因为</span>链向死链的页面,都是<span style="color: black;">咱们</span>网站上的页面,并且链出了死链的页面,对搜索引擎非常不友好,<span style="color: black;">因此</span>不<span style="color: black;">即时</span>处理的话,极有可能使搜索引擎<span style="color: black;">没</span>法顺利地对网站上有价值页面进行抓取,从而间接<span style="color: black;">引起</span>“局部降权”(抓取<span style="color: black;">有些</span>页面的周期变得越来越长,快照更新缓慢,排名上不去之类)。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">内部死链问题比较严重,<span style="color: black;">因此</span>应当优先处理内部的死链。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">而<span style="color: black;">咱们</span><span style="color: black;">能够</span>放慢地百度站长<span style="color: black;">工具</span>中获取死链数据,并<span style="color: black;">根据</span><span style="color: black;">必定</span><span style="color: black;">规律</span>方式进行整理和划分,定位问题,接下来我将围绕进行死链数据分析进行讲解。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">经过</span>在页面上对死链信息进行预览,谁都会,我就不<span style="color: black;">必须</span><span style="color: black;">太多</span>说明了。而死链问题,不<span style="color: black;">必须</span><span style="color: black;">每日</span>都去下载表格进行分析,而只<span style="color: black;">必须</span><span style="color: black;">每日</span>大致看一下数据,<span style="color: black;">是不是</span>有<span style="color: black;">忽然</span><span style="color: black;">显现</span>的死链,找到<span style="color: black;">原由</span>并处理(<span style="color: black;">通常</span>大范围<span style="color: black;">显现</span>,比较容易被察觉到,<span style="color: black;">亦</span>是<span style="color: black;">必须</span>紧急处理的);其次<span style="color: black;">咱们</span><span style="color: black;">必须</span><span style="color: black;">定时</span>进行一次较为彻底的死链数据分析,<span style="color: black;">瞧瞧</span><span style="color: black;">是不是</span>有平时<span style="color: black;">无</span>关注到的死链问题(<span style="color: black;">通常</span><span style="color: black;">显现</span>范围小,会比较难以察觉,<span style="color: black;">然则</span>任由其<span style="color: black;">长时间</span>发展下去的话,可能会<span style="color: black;">导致</span>大问题)。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><img src="http://mmbiz.qpic.cn/mmbiz/fbac17wmeIFujC8MHlmcDeCnTm80OfGTxC2xCvlkVGWUhZ8NSLvB7rDWtPkjdYKK8mibcWlicvptBmpK0Ivh873w/640?wx_fmt=png&tp=webp&wxfrom=5&wx_lazy=1&wx_co=1" style="width: 50%; margin-bottom: 20px;"><span style="color: black;">图注:<span style="color: black;">通常</span><span style="color: black;">忽然</span><span style="color: black;">显现</span>的<span style="color: black;">海量</span>死链,很容易被察觉,<span style="color: black;">亦</span>比较好确定<span style="color: black;">原由</span></span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><img src="http://mmbiz.qpic.cn/mmbiz/fbac17wmeIFujC8MHlmcDeCnTm80OfGTWvOIqoic5ThlpOz0wRiaibHTF2wYW6BqhE82qKzCYibes2icaLx8k5iczkIw/640?wx_fmt=png&tp=webp&wxfrom=5&wx_lazy=1&wx_co=1" style="width: 50%; margin-bottom: 20px;"><span style="color: black;">图注:这是<span style="color: black;">初期</span>定位到的问题,虽然提交了处理<span style="color: black;">意见</span>,但被程序员<span style="color: black;">忽略</span>掉,<span style="color: black;">而后</span>在<span style="color: black;">近期</span><span style="color: black;">忽然</span>爆发出来,<span style="color: black;">因此呢</span>即使小问题,<span style="color: black;">亦</span>应当<span style="color: black;">导致</span>足够<span style="color: black;">注意</span>(<span style="color: black;">因为</span><span style="color: black;">出现</span>后处理<span style="color: black;">即时</span>,<span style="color: black;">无</span><span style="color: black;">显现</span>过于严重的问题)</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">接下来,我来简单说一下,批量下载百度站长<span style="color: black;">工具</span>中的死链数据,以及合并数据进行统一处理。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">内链死链(子域名A指向子域名A)和链出死链(子域名A指向子域名BCD……),<span style="color: black;">通常</span><span style="color: black;">来讲</span>比较容易分析,<span style="color: black;">咱们</span>来针对链入死链(子域名BCD……指向子域名A)来进行<span style="color: black;">有些</span>批量处理吧。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><img src="http://mmbiz.qpic.cn/mmbiz/fbac17wmeIFujC8MHlmcDeCnTm80OfGTib3FHz15m7genS82c1CfTtV5iavftht06CnC10BCf40Uib5wZ7UYV8icTw/640?wx_fmt=png&tp=webp&wxfrom=5&wx_lazy=1&wx_co=1" style="width: 50%; margin-bottom: 20px;"></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><img src="data:image/svg+xml,%3C%3Fxml version=1.0 encoding=UTF-8%3F%3E%3Csvg width=1px height=1px viewBox=0 0 1 1 version=1.1 xmlns=http://www.w3.org/2000/svg xmlns:xlink=http://www.w3.org/1999/xlink%3E%3Ctitle%3E%3C/title%3E%3Cg stroke=none stroke-width=1 fill=none fill-rule=evenodd fill-opacity=0%3E%3Cg transform=translate(-249.000000, -126.000000) fill=%23FFFFFF%3E%3Crect x=249 y=126 width=1 height=1%3E%3C/rect%3E%3C/g%3E%3C/g%3E%3C/svg%3E" style="width: 50%; margin-bottom: 20px;"><span style="color: black;">图注:<span style="color: black;">能够</span>对数据进行下载,格式为csv(逗号分隔符),<span style="color: black;">能够</span>方便地<span style="color: black;">运用</span>excel进行处理</span><span style="color: black;">;并且下方有官方的<span style="color: black;">帮忙</span>文档。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">到<span style="color: black;">这儿</span>,你<span style="color: black;">能够</span>试着点击【下载数据】,<span style="color: black;">这般</span>火狐浏览器就会自动把文件下载到你设置好的位置。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><img src="data:image/svg+xml,%3C%3Fxml version=1.0 encoding=UTF-8%3F%3E%3Csvg width=1px height=1px viewBox=0 0 1 1 version=1.1 xmlns=http://www.w3.org/2000/svg xmlns:xlink=http://www.w3.org/1999/xlink%3E%3Ctitle%3E%3C/title%3E%3Cg stroke=none stroke-width=1 fill=none fill-rule=evenodd fill-opacity=0%3E%3Cg transform=translate(-249.000000, -126.000000) fill=%23FFFFFF%3E%3Crect x=249 y=126 width=1 height=1%3E%3C/rect%3E%3C/g%3E%3C/g%3E%3C/svg%3E" style="width: 50%; margin-bottom: 20px;"><span style="color: black;"><span style="color: black;">这儿</span>告诉<span style="color: black;">大众</span>一个小技巧,<span style="color: black;">能够</span>点击下载列表中的对应文件,复制下载链接,<span style="color: black;">而后</span>粘贴出来。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">http://zhanzhang.baidu.com/inbound/deadlist?site=http://www.abc.com/&download=1&type=3&day=2016-02-30&f=dead_link&key=</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">相信长得帅的<span style="color: black;">伴侣</span><span style="color: black;">已然</span>看出来了,site=http://www.abc.com/<span style="color: black;">便是</span>指定你的网站域名,而day=2016-02-30<span style="color: black;">便是</span>指定你<span style="color: black;">必须</span>的日期了。type=3<span style="color: black;">便是</span>指定下载【链入死链】的数据,而type=2是链出死链,type=1是内链死链。而其他参数不<span style="color: black;">必须</span>做<span style="color: black;">太多</span><span style="color: black;">认识</span>。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">脑洞大开的<span style="color: black;">伴侣</span><span style="color: black;">必定</span>会想,<span style="color: black;">倘若</span>我把日期参数做一下处理,是不是能够批量地直接下载这些文件了呢?没错,<span style="color: black;">能够</span>的。<span style="color: black;">这儿</span>你<span style="color: black;">必须</span>借助一下excel强大的功能。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">先手动做好两行URL,<span style="color: black;">而后</span>选中,左键按住从右下角,往下拉你就<span style="color: black;">发掘</span>excel<span style="color: black;">已然</span>自动帮你对URL进行了补完。非常方便。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><img src="data:image/svg+xml,%3C%3Fxml version=1.0 encoding=UTF-8%3F%3E%3Csvg width=1px height=1px viewBox=0 0 1 1 version=1.1 xmlns=http://www.w3.org/2000/svg xmlns:xlink=http://www.w3.org/1999/xlink%3E%3Ctitle%3E%3C/title%3E%3Cg stroke=none stroke-width=1 fill=none fill-rule=evenodd fill-opacity=0%3E%3Cg transform=translate(-249.000000, -126.000000) fill=%23FFFFFF%3E%3Crect x=249 y=126 width=1 height=1%3E%3C/rect%3E%3C/g%3E%3C/g%3E%3C/svg%3E" style="width: 50%; margin-bottom: 20px;"><span style="color: black;">松开左键,就得到了想要的结果</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><img src="data:image/svg+xml,%3C%3Fxml version=1.0 encoding=UTF-8%3F%3E%3Csvg width=1px height=1px viewBox=0 0 1 1 version=1.1 xmlns=http://www.w3.org/2000/svg xmlns:xlink=http://www.w3.org/1999/xlink%3E%3Ctitle%3E%3C/title%3E%3Cg stroke=none stroke-width=1 fill=none fill-rule=evenodd fill-opacity=0%3E%3Cg transform=translate(-249.000000, -126.000000) fill=%23FFFFFF%3E%3Crect x=249 y=126 width=1 height=1%3E%3C/rect%3E%3C/g%3E%3C/g%3E%3C/svg%3E" style="width: 50%; margin-bottom: 20px;"><span style="color: black;"><span style="color: black;">而后</span>,你就<span style="color: black;">能够</span>复制下这些URL,<span style="color: black;">而后</span>到火狐浏览器中,用<span style="color: black;">咱们</span>之前安装好的Launch Clipboard插件,<span style="color: black;">运用</span>其快捷键alt + shift +K批量打开上图中的链接,<span style="color: black;">而后</span><span style="color: black;">咱们</span>的火狐浏览器就会自动把这些文件下载存储到<span style="color: black;">咱们</span>指定好的位置。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">来,<span style="color: black;">咱们</span>看一看收获的成果吧:</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><img src="data:image/svg+xml,%3C%3Fxml version=1.0 encoding=UTF-8%3F%3E%3Csvg width=1px height=1px viewBox=0 0 1 1 version=1.1 xmlns=http://www.w3.org/2000/svg xmlns:xlink=http://www.w3.org/1999/xlink%3E%3Ctitle%3E%3C/title%3E%3Cg stroke=none stroke-width=1 fill=none fill-rule=evenodd fill-opacity=0%3E%3Cg transform=translate(-249.000000, -126.000000) fill=%23FFFFFF%3E%3Crect x=249 y=126 width=1 height=1%3E%3C/rect%3E%3C/g%3E%3C/g%3E%3C/svg%3E" style="width: 50%; margin-bottom: 20px;"><span style="color: black;"><span style="color: black;">好似</span>还<span style="color: black;">能够</span>的样子哦?<span style="color: black;">然则</span>,这么多表格难道要我一个一个地打开吗?</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">当然不。<span style="color: black;">咱们</span>来看一看某一个表格长什么样子吧。看到了吗?<span style="color: black;">这儿</span>有记录时间的。</p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">亦</span><span style="color: black;">便是</span>说,<span style="color: black;">倘若</span><span style="color: black;">咱们</span>能想办法把这些文件都合并起来的话,<span style="color: black;">亦</span>是有办法区分日期的。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">好吧,说干就干。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">(1)打开你的命令提示符:Windows + R,输入cmd,回车</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><img src="data:image/svg+xml,%3C%3Fxml version=1.0 encoding=UTF-8%3F%3E%3Csvg width=1px height=1px viewBox=0 0 1 1 version=1.1 xmlns=http://www.w3.org/2000/svg xmlns:xlink=http://www.w3.org/1999/xlink%3E%3Ctitle%3E%3C/title%3E%3Cg stroke=none stroke-width=1 fill=none fill-rule=evenodd fill-opacity=0%3E%3Cg transform=translate(-249.000000, -126.000000) fill=%23FFFFFF%3E%3Crect x=249 y=126 width=1 height=1%3E%3C/rect%3E%3C/g%3E%3C/g%3E%3C/svg%3E" style="width: 50%; margin-bottom: 20px;"><span style="color: black;">(2)在命令提示符中,输入cd再敲入空格,再到<span style="color: black;">保留</span>csv文件的位置,去把文件夹<span style="color: black;">全部</span>拖拽到命令提示符中,<span style="color: black;">就可</span>自动补完路径。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">倘若</span>不输入cd空格的话,会报错,如下图。(cd的意思是<span style="color: black;">转</span>目录到指定目录)</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><img src="data:image/svg+xml,%3C%3Fxml version=1.0 encoding=UTF-8%3F%3E%3Csvg width=1px height=1px viewBox=0 0 1 1 version=1.1 xmlns=http://www.w3.org/2000/svg xmlns:xlink=http://www.w3.org/1999/xlink%3E%3Ctitle%3E%3C/title%3E%3Cg stroke=none stroke-width=1 fill=none fill-rule=evenodd fill-opacity=0%3E%3Cg transform=translate(-249.000000, -126.000000) fill=%23FFFFFF%3E%3Crect x=249 y=126 width=1 height=1%3E%3C/rect%3E%3C/g%3E%3C/g%3E%3C/svg%3E" style="width: 50%; margin-bottom: 20px;"><span style="color: black;">当成功后,你就<span style="color: black;">能够</span>把csv文件都合并起来啦,输入命令:</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">copy *.csv..\ok.csv</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">意思是,拷贝出所有以后缀名为csv的文件,输出到上一级目录下的ok.csv文件中。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">这般</span>就完<span style="color: black;">成为了</span>合并。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">咱们</span>打开ok.csv<span style="color: black;">瞧瞧</span>?接下来就<span style="color: black;">能够</span>进行简单的去重处理。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><img src="data:image/svg+xml,%3C%3Fxml version=1.0 encoding=UTF-8%3F%3E%3Csvg width=1px height=1px viewBox=0 0 1 1 version=1.1 xmlns=http://www.w3.org/2000/svg xmlns:xlink=http://www.w3.org/1999/xlink%3E%3Ctitle%3E%3C/title%3E%3Cg stroke=none stroke-width=1 fill=none fill-rule=evenodd fill-opacity=0%3E%3Cg transform=translate(-249.000000, -126.000000) fill=%23FFFFFF%3E%3Crect x=249 y=126 width=1 height=1%3E%3C/rect%3E%3C/g%3E%3C/g%3E%3C/svg%3E" style="width: 50%; margin-bottom: 20px;"><span style="color: black;">图注:简单去重后,<span style="color: black;">咱们</span>依然<span style="color: black;">能够</span>大致浏览一下。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">咱们</span><span style="color: black;">发掘</span>,死链前链中,有许多来自于<span style="color: black;">区别</span>域名的<span style="color: black;">类似</span>目录下的页面。<span style="color: black;">咱们</span>不妨把这些页面单独存起来。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><img src="data:image/svg+xml,%3C%3Fxml version=1.0 encoding=UTF-8%3F%3E%3Csvg width=1px height=1px viewBox=0 0 1 1 version=1.1 xmlns=http://www.w3.org/2000/svg xmlns:xlink=http://www.w3.org/1999/xlink%3E%3Ctitle%3E%3C/title%3E%3Cg stroke=none stroke-width=1 fill=none fill-rule=evenodd fill-opacity=0%3E%3Cg transform=translate(-249.000000, -126.000000) fill=%23FFFFFF%3E%3Crect x=249 y=126 width=1 height=1%3E%3C/rect%3E%3C/g%3E%3C/g%3E%3C/svg%3E" style="width: 50%; margin-bottom: 20px;"><span style="color: black;">图注:筛选出所有zx123.cn子域名下<span style="color: black;">包括</span>xiaoqu目录的页面</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">而后</span><span style="color: black;">咱们</span><span style="color: black;">发掘</span>,还有<span style="color: black;">有些</span><span style="color: black;">包括</span>baidu.com/的页面,这些页面<span style="color: black;">通常</span>是经过推送数据来进行抓取的,<span style="color: black;">因此</span><span style="color: black;">亦</span>暂时<span style="color: black;">归类</span>到一边。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><img src="data:image/svg+xml,%3C%3Fxml version=1.0 encoding=UTF-8%3F%3E%3Csvg width=1px height=1px viewBox=0 0 1 1 version=1.1 xmlns=http://www.w3.org/2000/svg xmlns:xlink=http://www.w3.org/1999/xlink%3E%3Ctitle%3E%3C/title%3E%3Cg stroke=none stroke-width=1 fill=none fill-rule=evenodd fill-opacity=0%3E%3Cg transform=translate(-249.000000, -126.000000) fill=%23FFFFFF%3E%3Crect x=249 y=126 width=1 height=1%3E%3C/rect%3E%3C/g%3E%3C/g%3E%3C/svg%3E" style="width: 50%; margin-bottom: 20px;"><span style="color: black;">图注:百度的抓取数据</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">剩下的数据中,还剩下<span style="color: black;">外边</span>死链,而<span style="color: black;">外边</span>死链中还<span style="color: black;">包括</span><span style="color: black;">有些</span>垃圾链接,<span style="color: black;">咱们</span><span style="color: black;">必须</span>把这些垃圾链接找出来。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><img src="data:image/svg+xml,%3C%3Fxml version=1.0 encoding=UTF-8%3F%3E%3Csvg width=1px height=1px viewBox=0 0 1 1 version=1.1 xmlns=http://www.w3.org/2000/svg xmlns:xlink=http://www.w3.org/1999/xlink%3E%3Ctitle%3E%3C/title%3E%3Cg stroke=none stroke-width=1 fill=none fill-rule=evenodd fill-opacity=0%3E%3Cg transform=translate(-249.000000, -126.000000) fill=%23FFFFFF%3E%3Crect x=249 y=126 width=1 height=1%3E%3C/rect%3E%3C/g%3E%3C/g%3E%3C/svg%3E" style="width: 50%; margin-bottom: 20px;"><span style="color: black;">图注:<span style="color: black;">根据</span>死链链接排序</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">把垃圾死链<span style="color: black;">亦</span>单独归为一类,剩下的<span style="color: black;">便是</span>真正的外链死链了。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><img src="data:image/svg+xml,%3C%3Fxml version=1.0 encoding=UTF-8%3F%3E%3Csvg width=1px height=1px viewBox=0 0 1 1 version=1.1 xmlns=http://www.w3.org/2000/svg xmlns:xlink=http://www.w3.org/1999/xlink%3E%3Ctitle%3E%3C/title%3E%3Cg stroke=none stroke-width=1 fill=none fill-rule=evenodd fill-opacity=0%3E%3Cg transform=translate(-249.000000, -126.000000) fill=%23FFFFFF%3E%3Crect x=249 y=126 width=1 height=1%3E%3C/rect%3E%3C/g%3E%3C/g%3E%3C/svg%3E" style="width: 50%; margin-bottom: 20px;"></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">图注:检验成果的时候到啦。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">咱们</span>把数据<span style="color: black;">根据</span><span style="color: black;">必定</span><span style="color: black;">规律</span>关系分<span style="color: black;">成为了</span>四类,分别是【<span style="color: black;">外边</span>死链】【垃圾链接】【百度】【子域名(<span style="color: black;">亦</span>属于内部死链)】</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">咱们</span><span style="color: black;">必须</span>重点关注的是,【子域名】<span style="color: black;">显现</span>的死链。<span style="color: black;">由于</span>子域名<span style="color: black;">亦</span>是<span style="color: black;">咱们</span>的网站的一部分啊,这些页面上<span style="color: black;">显现</span>了死链,势必对这些页面的SEO效果<span style="color: black;">有害</span>,<span style="color: black;">必须</span>尽快<span style="color: black;">知道</span><span style="color: black;">原由</span>。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">经过与技术<span style="color: black;">分部</span>沟通,我确认到该类问题<span style="color: black;">显现</span>的<span style="color: black;">原由</span>,<span style="color: black;">重点</span>是<span style="color: black;">咱们</span>网站的服务器之间同步数据时不成功,<span style="color: black;">或</span>服务器之间连接偶然断开<span style="color: black;">导致</span>。这类问题暂时难以避免,<span style="color: black;">因此呢</span>只能让技术人员将<span style="color: black;">由于</span>这种<span style="color: black;">状况</span><span style="color: black;">显现</span>的404(永久不可<span style="color: black;">拜访</span>)状态码改为返回503(临时不可<span style="color: black;">拜访</span>)状态码了。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">而【百度】<span style="color: black;">显现</span>的死链,理由和上面的一致。只<span style="color: black;">不外</span>蜘蛛的抓取<span style="color: black;">途径</span>,是来自于主动推送方式。返回503状态码后,<span style="color: black;">状况</span>有所改善。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">【垃圾链接】,我<span style="color: black;">已然</span>在外链分析中做出过<span style="color: black;">必定</span>程度的说明了,<span style="color: black;">能够</span>参考一下。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">【<span style="color: black;">外边</span>死链】,这个其实<span style="color: black;">能够</span>不必过于关注,会受到死链影响的不是<span style="color: black;">咱们</span>网站,而是导出了死链的网站。<span style="color: black;">然则</span>有时候分析<span style="color: black;">瞧瞧</span>,总能<span style="color: black;">发掘</span><span style="color: black;">有些</span>有趣的现象。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">比方说,我<span style="color: black;">此刻</span>看到的数据的共性是,死链链接都不完整,要么中间用点号来省略了,要么尾部被强行截断了。<span style="color: black;">咱们</span>打开死链前链,<span style="color: black;">发掘</span>死链链接是<span style="color: black;">做为</span>明链接(<span style="color: black;">没</span>锚文本)出<span style="color: black;">此刻</span>页面上。而死链前链的页面,大<span style="color: black;">都数</span>都类似于搜索引擎结果页面,并且这些结果页面上对锚链接都以nofollow进行了<span style="color: black;">掌控</span>。</span><img src="data:image/svg+xml,%3C%3Fxml version=1.0 encoding=UTF-8%3F%3E%3Csvg width=1px height=1px viewBox=0 0 1 1 version=1.1 xmlns=http://www.w3.org/2000/svg xmlns:xlink=http://www.w3.org/1999/xlink%3E%3Ctitle%3E%3C/title%3E%3Cg stroke=none stroke-width=1 fill=none fill-rule=evenodd fill-opacity=0%3E%3Cg transform=translate(-249.000000, -126.000000) fill=%23FFFFFF%3E%3Crect x=249 y=126 width=1 height=1%3E%3C/rect%3E%3C/g%3E%3C/g%3E%3C/svg%3E" style="width: 50%; margin-bottom: 20px;"></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><img src="data:image/svg+xml,%3C%3Fxml version=1.0 encoding=UTF-8%3F%3E%3Csvg width=1px height=1px viewBox=0 0 1 1 version=1.1 xmlns=http://www.w3.org/2000/svg xmlns:xlink=http://www.w3.org/1999/xlink%3E%3Ctitle%3E%3C/title%3E%3Cg stroke=none stroke-width=1 fill=none fill-rule=evenodd fill-opacity=0%3E%3Cg transform=translate(-249.000000, -126.000000) fill=%23FFFFFF%3E%3Crect x=249 y=126 width=1 height=1%3E%3C/rect%3E%3C/g%3E%3C/g%3E%3C/svg%3E" style="width: 50%; margin-bottom: 20px;">图注:这些都是垃圾搜索引擎,目的是抓取其他网站的信息为己所用,制造垃圾站群</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">能够</span>看出,【垃圾链接】和【外链死链】中的大部分,依然<span style="color: black;">亦</span>是抱着恶意目的而来的。<span style="color: black;">此时</span>候<span style="color: black;">咱们</span>可能就<span style="color: black;">必须</span><span style="color: black;">思虑</span>,<span style="color: black;">运用</span>反爬虫策略,来禁止<span style="color: black;">有些</span>垃圾搜索引擎对<span style="color: black;">咱们</span>网站进行恣意妄为的抓取<span style="color: black;">行径</span>了。(关于反爬虫策略专题,我将来<span style="color: black;">亦</span>打算尝试一下)</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">好啦,这期的内容差不多<span style="color: black;">便是</span><span style="color: black;">这般</span>,<span style="color: black;">咱们</span>来总结一下吧。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">(1)分析链接数据的目的:<span style="color: black;">保准</span>搜索引擎对网站正常抓取和索引;防止被恶意人士利用而受到损失。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">(2)分析链接数据的手段:<span style="color: black;">有些</span><span style="color: black;">工具</span>,再加上简单的<span style="color: black;">规律</span>。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">(3)养成良好工作习惯与<span style="color: black;">认识</span>:<span style="color: black;">每日</span>大致关注一下这些数据,<span style="color: black;">定时</span>仔细分析一下数据,对这些环节有<span style="color: black;">掌控</span>地进行操作。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">作者:</span>响1亮2的3名4字 <span style="color: black;">源自</span>:百度站长平台</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">更加多</span>SEO知识,请点击阅读:</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">01、</span><a style="color: black;"><span style="color: black;">揭秘竞价SEM与快照SEO千丝万缕的关系</span></a></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">02、</span><a style="color: black;"><span style="color: black;">百度搜索(SEO)移动站友好度<span style="color: black;">诠释</span>(四维度)</span></a></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">03、</span><a style="color: black;"><span style="color: black;">服务器对SEO的影响有<span style="color: black;">那些</span>?(服务器<span style="color: black;">怎样</span>设置对SEO友好)</span></a></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">04、</span><a style="color: black;"><span style="color: black;">SEO的这9个搜索引擎算法 你get了吗</span></a></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">05、</span><a style="color: black;">SEO外链专员,<span style="color: black;">怎样</span><span style="color: black;">评定</span>一个外链的价值?</a></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">06、</span><a style="color: black;">SEO<span style="color: black;">能手</span>是<span style="color: black;">怎样</span>更新网站<span style="color: black;">文案</span>的?</a></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">07、</span><a style="color: black;"><span style="color: black;">SEO<span style="color: black;">怎样</span>正确处理网站死链?</span></a></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">08、</span><a style="color: black;"><span style="color: black;">SEO的春天来了?百度的<span style="color: black;">回复</span>依然有<span style="color: black;">许多</span>不确定性</span></a></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">09、</span><a style="color: black;"><span style="color: black;">SEO入门必看 详解搜索引擎优化原理</span></a></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">十、</span></span><a style="color: black;"><span style="color: black;">SEO<span style="color: black;">怎样</span>分析竞争对手网站</span></a></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">11、</span><a style="color: black;">SEO之网站域名的<span style="color: black;">选择</span>规则</a></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">12、</span><a style="color: black;">SEO实战 | <span style="color: black;">怎样</span>挖掘<span style="color: black;">重要</span>词?</a></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">13、</span><a style="color: black;"><span style="color: black;">SEO中<span style="color: black;">重要</span>词密度多少比较合适?</span></a></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">14、</span><a style="color: black;"><span style="color: black;">夫唯:<span style="color: black;">必定</span>要<span style="color: black;">晓得</span>的SEO基本</span></a>知识</p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">15、</span><a style="color: black;"><span style="color: black;">讲讲SEO<span style="color: black;">怎样</span>布局<span style="color: black;">重要</span>词?</span></a></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">16、</span><a style="color: black;"><span style="color: black;">电商平台SEO的八大<span style="color: black;">重要</span><span style="color: black;">原因</span>,你<span style="color: black;">认识</span>了吗?</span></a></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><img src="data:image/svg+xml,%3C%3Fxml version=1.0 encoding=UTF-8%3F%3E%3Csvg width=1px height=1px viewBox=0 0 1 1 version=1.1 xmlns=http://www.w3.org/2000/svg xmlns:xlink=http://www.w3.org/1999/xlink%3E%3Ctitle%3E%3C/title%3E%3Cg stroke=none stroke-width=1 fill=none fill-rule=evenodd fill-opacity=0%3E%3Cg transform=translate(-249.000000, -126.000000) fill=%23FFFFFF%3E%3Crect x=249 y=126 width=1 height=1%3E%3C/rect%3E%3C/g%3E%3C/g%3E%3C/svg%3E" style="width: 50%; margin-bottom: 20px;"></p>
楼主的文章非常有意义,提升了我的知识水平。
页:
[1]