Google 搜索引擎的工作原理,奥密原来都在这儿
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">2020 年 9 月的某个清晨,美国北加州地区的民众一觉醒来,<span style="color: black;">发掘</span>野火浓烟后的西海岸上空一片橙红。这种像是从《银翼杀手》电影中走出来</span><span style="color: black;">的</span><span style="color: black;">景象,<span style="color: black;">非常多</span>人在现实生活中可能从未见过。</span></p>
<div style="color: black; text-align: left; margin-bottom: 10px;"><img src="https://p3-sign.toutiaoimg.com/pgc-image/0dc97eaf66db4fa98279b807fabf0399~noop.image?_iz=58558&from=article.pc_detail&lk3s=953192f4&x-expires=1723731220&x-signature=WYTvu85A2mX4Pe58KV11V5DsOhg%3D" style="width: 50%; margin-bottom: 20px;">
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">图:LA Times</p>
</div>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">到底<span style="color: black;">出现</span>了什么?</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">自然而然地,就像国内大<span style="color: black;">都数</span>网民会打开百度<span style="color: black;">乃至</span>知乎搜索答案<span style="color: black;">同样</span>,一时间加州人民<span style="color: black;">亦</span>纷纷潮涌至 Google,键入了类似「</span><strong style="color: blue;"><span style="color: black;"><span style="color: black;">为何</span>天空是橙色的</span></strong><span style="color: black;">」<span style="color: black;">这般</span>的搜索关键字 —— 这些在搜索引擎眼里或许有点无厘头的问题,依然<span style="color: black;">经过</span>信息卡片、精选<span style="color: black;">资讯</span><span style="color: black;">新闻</span>的方式得到了<span style="color: black;">精细</span>而<span style="color: black;">即时</span>的解答。</span></p>
<div style="color: black; text-align: left; margin-bottom: 10px;"><img src="https://p3-sign.toutiaoimg.com/pgc-image/445fc6f86dcc429f9b27b2bfa786e30a~noop.image?_iz=58558&from=article.pc_detail&lk3s=953192f4&x-expires=1723731220&x-signature=bUnDON7%2B4hjEQS24QnVj%2Bhzyehg%3D" style="width: 50%; margin-bottom: 20px;">
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">当时的 Google 搜索页面</p>
</div>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">以上是 Google 不久前 分享的一个案例。当<span style="color: black;">咱们</span>将 Google 搜索引擎从上面这个事件中剥离出来仔细审视时,不少人应该都会心生疑窦:Google 是<span style="color: black;">怎样</span><span style="color: black;">晓得</span>用户要搜什么的,<span style="color: black;">为何</span>针对加州地区的当地<span style="color: black;">新闻</span>会排在页面顶部,其它地区的人搜索<span style="color: black;">一样</span>的问题会得到类似的答案吗,结果页面左侧的知识面板在<span style="color: black;">这般</span>的搜索中发挥了<span style="color: black;">怎么样</span>的<span style="color: black;">功效</span>……</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">为了让你多<span style="color: black;">认识</span>一点这个世界上最受欢迎的搜索引擎,Google自 2018 年以来就<span style="color: black;">起始</span><span style="color: black;">持续</span>在 The Keyword 博客中分享关于 Google 搜索引擎的<span style="color: black;">各样</span>细节与原理。<span style="color: black;">倘若</span>你<span style="color: black;">亦</span>有上面这些疑问,不妨跟随本文<span style="color: black;">一块</span>探究 Google 搜索引擎<span style="color: black;">背面</span>的<span style="color: black;">奥密</span>。</span></p>
<h1 style="color: black; text-align: left; margin-bottom: 10px;">搜索<span style="color: black;">意见</span>是怎么「蹦」出来的?</h1>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">每日</span><span style="color: black;">咱们</span>都要和搜索引擎打交道,而每次<span style="color: black;">运用</span> Google 搜索信息时,键入搜索关键字的<span style="color: black;">同期</span>搜索框下方都会<span style="color: black;">持续</span>「蹦」出<span style="color: black;">各样</span>各样<span style="color: black;">按照</span>已输入词汇扩展而来的搜索<span style="color: black;">意见</span>。是此时的 Google「能掐会算」,早就<span style="color: black;">晓得</span>了你心里的那点小心思吗?</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">这种「能掐会算」的<span style="color: black;">背面</span>是 Google 的一项名为自动填充(auto complete)的技术。从<span style="color: black;">咱们</span>录入<span style="color: black;">起始</span>,Google 就<span style="color: black;">起始</span>在搜索框的下方<span style="color: black;">表示</span>它所猜测的搜索关键字结果。只要有任意一条「猜测」命中,<span style="color: black;">咱们</span>就能快速完成输入。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">这种「猜测」(官方<span style="color: black;">叫作</span>为「预测」)其实是系统在<span style="color: black;">持续</span><span style="color: black;">运用</span><span style="color: black;">咱们</span>键入的词汇进行联想<span style="color: black;">查找</span>,<span style="color: black;">咱们</span><span style="color: black;">持续</span>输入的<span style="color: black;">同期</span>,搜索框下方提示的文字内容<span style="color: black;">亦</span>会<span style="color: black;">按照</span>「猜测」结果<span style="color: black;">持续</span><span style="color: black;">调节</span>。这其实<span style="color: black;">亦</span>是<span style="color: black;">为何</span>网络环境不太好的时候搜索<span style="color: black;">意见</span>可能会表现得反应迟滞<span style="color: black;">乃至</span>完全不会「蹦」出搜索<span style="color: black;">意见</span>的<span style="color: black;">原由</span>。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">为了<span style="color: black;">加强</span>这些搜索<span style="color: black;">意见</span>的命中率,Google 还会进一步引入<span style="color: black;">关联</span><span style="color: black;">原因</span>来进行预测校准,进行搜索的用户所处的地理位置、当下的热门<span style="color: black;">乃至</span>用户所<span style="color: black;">运用</span>的设备……这些都会对自动填充生成的搜索<span style="color: black;">意见</span>产生影响 —— 当然了,<span style="color: black;">非常多</span>人应该<span style="color: black;">亦</span><span style="color: black;">晓得</span>,<span style="color: black;">咱们</span>在 Google 上<span style="color: black;">保留</span>的搜索历史和<span style="color: black;">各样</span>搜索设置<span style="color: black;">一样</span><span style="color: black;">亦</span>会影响到<span style="color: black;">详细</span>的预测结果。</span></p>
<div style="color: black; text-align: left; margin-bottom: 10px;"><img src="https://p3-sign.toutiaoimg.com/pgc-image/bfc97dfccd7645e88cb4e527bdd0b08e~noop.image?_iz=58558&from=article.pc_detail&lk3s=953192f4&x-expires=1723731220&x-signature=sEQM0bhC4L8eg%2BBE7N27VBVmx6s%3D" style="width: 50%; margin-bottom: 20px;">
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">搜索设置会影响搜索结果,但只是众多影响<span style="color: black;">原因</span>的一部分</p>
</div>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">举个例子,在 Google 搜索引擎<span style="color: black;">运用</span>率更高的欧美地区,Google <span style="color: black;">常常</span>会<span style="color: black;">按照</span>搜索用户所处的地理位置预判<span style="color: black;">她们</span><span style="color: black;">运用</span>的是英式英语还是美式英语,<span style="color: black;">从而</span><span style="color: black;">供给</span>差异化的内容<span style="color: black;">表示</span> —— 在英式英语的语境下「football」<span style="color: black;">一般</span>会是足球,而在美式英语下<span style="color: black;">常常</span>是橄榄球,Google <span style="color: black;">亦</span>会这么做;与之对应的,Google 还会在单词拼写上进行<span style="color: black;">意见</span>,<span style="color: black;">例如</span><span style="color: black;">按照</span>搜索者的所在地区对「center」和「centre」的写法进行区分。</span></p>
<div style="color: black; text-align: left; margin-bottom: 10px;"><img src="https://p3-sign.toutiaoimg.com/pgc-image/6d3b266150f949c39461d0052f3634a8~noop.image?_iz=58558&from=article.pc_detail&lk3s=953192f4&x-expires=1723731220&x-signature=iE3iltZ3ffmB3L%2Fr%2Bz02S6kItk4%3D" style="width: 50%; margin-bottom: 20px;">
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">重视</span>观察图中位置与单词的拼写</p>
</div>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">由此其实<span style="color: black;">亦</span><span style="color: black;">能够</span>得出一个事实:<span style="color: black;">每一个</span>人在 Google 中进行的每一次搜索都是高度个性化的,即便<span style="color: black;">咱们</span><span style="color: black;">运用</span>浏览器的隐私浏览模式排除个人搜索和浏览记录的干扰,<span style="color: black;">实质</span>搜索结果还是会<span style="color: black;">按照</span>其它<span style="color: black;">原因</span>进行<span style="color: black;">调节</span>。</span></p>
<h1 style="color: black; text-align: left; margin-bottom: 10px;">精选摘要:<span style="color: black;">不消</span>翻查、即问即答</h1>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">我只是要找个答案<span style="color: black;">罢了</span>,并不想点开网页。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">经常<span style="color: black;">运用</span>搜索引擎获取信息的人<span style="color: black;">必定</span>会有类似的想法,让<span style="color: black;">她们</span>养成这个习惯的<span style="color: black;">原由</span>之一,<span style="color: black;">特别有</span>可能<span style="color: black;">便是</span> Google 经常会在搜索结果页面上方直接生成的那个信息卡片 —— 直接、干脆,你问、它答。</span></p>
<div style="color: black; text-align: left; margin-bottom: 10px;"><img src="https://p3-sign.toutiaoimg.com/pgc-image/d1fea0d538214dcc940945980b87257c~noop.image?_iz=58558&from=article.pc_detail&lk3s=953192f4&x-expires=1723731220&x-signature=%2BCuwt1ZmXKkveJugGxUEjpvLdgQ%3D" style="width: 50%; margin-bottom: 20px;"></div>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">这个答案是怎么来的?</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">首要</span>,这个卡片<span style="color: black;">亦</span>有一个特定的名字:精选摘要(featured snippets),套用一句俗话,「生活就像水中的鸭子,表面上从容淡定,其实水底下在拼命划水」。精选摘要的<span style="color: black;">源自</span><span style="color: black;">亦</span>是<span style="color: black;">这般</span> —— 在<span style="color: black;">咱们</span>键入、搜索的过程中,Google 表面上只是从容淡定地搜索、<span style="color: black;">转</span>,<span style="color: black;">背面</span>的零点几秒时间里,幕后其实<span style="color: black;">亦</span>在「拼命划水」。搜索系统算法会<span style="color: black;">按照</span><span style="color: black;">咱们</span>所搜索的问题检索<span style="color: black;">有些</span>相对具备权威性的高质量网站页面,<span style="color: black;">而后</span>从这些网站中提取关键内容来生成摘要,最后把这份摘要呈送到<span style="color: black;">咱们</span>眼前,即上面所说的「精选摘要」。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">然而算法毕竟是算法,<span style="color: black;">亦</span>会有阴沟翻船的时候,其中最著名的例子莫过于「古罗马人夜间<span style="color: black;">怎样</span>计时」这个问题,最初 Google 给出的答案是:</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">日晷。罗马人最初<span style="color: black;">运用</span>日晷来<span style="color: black;">测绘</span>时间流逝。<span style="color: black;">经过</span>这种<span style="color: black;">办法</span><span style="color: black;">她们</span>不仅<span style="color: black;">能够</span>相对准确地获取日出、日落和正午时间,还能<span style="color: black;">按照</span>日影长度估算一天中的其它时刻。日晷这种新工具的引入给了罗马人一种更好的<span style="color: black;">测绘</span>时间的<span style="color: black;">办法</span>……</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">那样</span>夜间<span style="color: black;">无</span>太阳<span style="color: black;">怎样</span>用日晷计时呢?Google 的精选摘要那时<span style="color: black;">亦</span>不<span style="color: black;">晓得</span>。是不是有点你学生时代答非所问但<span style="color: black;">必定</span>要把试题纸写满的味道了……</span></p>
<h1 style="color: black; text-align: left; margin-bottom: 10px;">知识图谱:强力的信息<span style="color: black;">弥补</span></h1>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">上面<span style="color: black;">咱们</span><span style="color: black;">已然</span><span style="color: black;">认识</span>了「精选摘要」,<span style="color: black;">亦</span>见识了它的「胡说八道」,那当<span style="color: black;">咱们</span><span style="color: black;">认识</span>到精选摘要似乎在「乱侃」的时候怎么办?<span style="color: black;">或</span>这个搜索页面<span style="color: black;">基本</span>就<span style="color: black;">无</span>精选摘要……</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><strong style="color: blue;"><span style="color: black;">你可能<span style="color: black;">已然</span>有这个习惯了:向右看</span></strong><span style="color: black;">。页面右侧可能会<span style="color: black;">显现</span>一个知识面板,它<span style="color: black;">包括</span>了当前搜索<span style="color: black;">专题</span><span style="color: black;">关联</span>的知识信息,没准<span style="color: black;">亦</span>能在你的搜索中派上用场。这个知识面板(Knowledge Panel)与早年 Google 精心搭建的知识图谱(Knowledge Graph)体系密切<span style="color: black;">关联</span>。</span></p>
<div style="color: black; text-align: left; margin-bottom: 10px;"><img src="https://p3-sign.toutiaoimg.com/pgc-image/ec1f0a1185a643149451e244ba8e0242~noop.image?_iz=58558&from=article.pc_detail&lk3s=953192f4&x-expires=1723731220&x-signature=3rfhIBTlKe4jYJm4rfN1NydCO%2Fs%3D" style="width: 50%; margin-bottom: 20px;">
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">照片</span>来自于维基百科</p>
</div>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">简单<span style="color: black;">来讲</span>,知识图谱是一个由<span style="color: black;">各样</span><span style="color: black;">区别</span>页面、<span style="color: black;">区别</span><span style="color: black;">源自</span>的信息<span style="color: black;">形成</span>的小「知识库」,<span style="color: black;">按照</span><span style="color: black;">专题</span>的<span style="color: black;">区别</span>,Google <span style="color: black;">经过</span>语义算法自动整理、归纳<span style="color: black;">区别</span>内容的<span style="color: black;">关联</span>信息,这些信息<span style="color: black;">同期</span>会随着原始<span style="color: black;">源自</span>页面的变化而自动更新。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">因此呢</span>当<span style="color: black;">咱们</span>在搜索<span style="color: black;">名人</span>、地点、组织等信息的时候,知识面板<span style="color: black;">能够</span>直接将<span style="color: black;">关联</span>内容汇总为一张知识面板放在搜索页面右侧。<span style="color: black;">日前</span>这个面板中所收纳的内容<span style="color: black;">已然</span>相当丰富了,以 Apple 的知识面板为例,<span style="color: black;">咱们</span><span style="color: black;">能够</span>直接在知识面板中找到 Apple 这家<span style="color: black;">机构</span>的基本信息介绍、股价信息、业务范围、售后<span style="color: black;">tel</span>、社交账户页面、热门<span style="color: black;">制品</span><span style="color: black;">乃至</span>换电池业务页面……比起<span style="color: black;">转</span>到某个互联网犄角旮旯里<span style="color: black;">才可</span>找到官网的体验<span style="color: black;">来讲</span><span style="color: black;">这般</span>的知识面板能够大幅<span style="color: black;">加强</span><span style="color: black;">专题</span>信息的检索效率。</span></p>
<div style="color: black; text-align: left; margin-bottom: 10px;"><img src="https://p3-sign.toutiaoimg.com/pgc-image/60feaea62e7d407ab9fb7e930ef9f505~noop.image?_iz=58558&from=article.pc_detail&lk3s=953192f4&x-expires=1723731220&x-signature=MOEzHZ3Qgd8lR4Uynvxn0OkJXR4%3D" style="width: 50%; margin-bottom: 20px;"></div>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">尽管知识面板偏居一隅,<span style="color: black;">然则</span> Google 对它还挺上心的。<span style="color: black;">根据</span> Google 的说法,<span style="color: black;">截止</span> 2020 年 5 月,知识面板<span style="color: black;">已然</span>收集了约 50 亿个实体、超过 5000 亿个名词实例,说它是一本藏在 Google 搜索引擎里的「百科全书」<span style="color: black;">不外</span>分吧?</span></p>
<h1 style="color: black; text-align: left; margin-bottom: 10px;"><span style="color: black;">那些</span>结果排前面?不是钱说了算</h1>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">精选摘要<span style="color: black;">亦</span>好,知识面板<span style="color: black;">亦</span>罢,这些都<span style="color: black;">能够</span>简单归纳到快速答案范畴内。假如把<span style="color: black;">全部</span>搜索过程比作是一顿饱餐,精选摘要、知识面板只<span style="color: black;">不外</span>是餐前甜点,页面主<span style="color: black;">身体</span>容里的搜索结果才是正餐。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">因此</span><span style="color: black;">非常多</span>人在浏览 Google 搜索结果的时候,随着鼠标的滚轮<span style="color: black;">持续</span>滑动、蓝色的搜索链接飞速掠过,很自然地就会有一个不成熟的小想法:这么多的搜索结果是<span style="color: black;">怎样</span>排序的,前面这几个会不会跟某些搜索引擎<span style="color: black;">同样</span>是收了钱的?</span></p>
<div style="color: black; text-align: left; margin-bottom: 10px;"><img src="https://p3-sign.toutiaoimg.com/pgc-image/172b73f073b7468ab9e08274997bbdbc~noop.image?_iz=58558&from=article.pc_detail&lk3s=953192f4&x-expires=1723731220&x-signature=wVSx%2BiMebrRaM8rK80tQziGCWvM%3D" style="width: 50%; margin-bottom: 20px;">
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">「犯罪嫌疑人」是<span style="color: black;">这般</span>说的</p>
</div>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">这个问题就<span style="color: black;">触及</span>到了搜索排名算法了。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">这儿</span>最为<span style="color: black;">公众</span>所熟知的搜索排名算法应该<span style="color: black;">便是</span> PageRank 了。这<span style="color: black;">亦</span>是 Google 最早<span style="color: black;">运用</span>的 对网页进行的排名算法。对,<span style="color: black;">便是</span>你的潜<span style="color: black;">认识</span>里的那个名字,拉里·佩奇(Larry Page),这个算法正是用 Google 创始人(之一)的名字命名。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">虽然 Google <span style="color: black;">重点</span>靠<span style="color: black;">宣传</span>挣钱,影响搜索结果排名的<span style="color: black;">重点</span>还是算法本身,但金无足赤,算法<span style="color: black;">一样</span><span style="color: black;">亦</span>有问题。PageRank 的缺陷就<span style="color: black;">包含</span>「旧的页面的排名<span style="color: black;">常常</span>会比新页面高」,<span style="color: black;">亦</span><span style="color: black;">一样</span><span style="color: black;">因此呢</span><span style="color: black;">作为</span>了<span style="color: black;">有些</span>人「刷排名」的漏洞。<span style="color: black;">因此呢</span> Google 在 2016 年 关闭了 PageRank 数据开放的<span style="color: black;">前门</span>。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">诚</span><span style="color: black;"><span style="color: black;">所说</span>条条大路通罗马,尽管时间在变、算法在变,<span style="color: black;">不外</span> Google <span style="color: black;">暗示</span><span style="color: black;">保准</span>搜索结果排名质量的初心并<span style="color: black;">无</span>变。<span style="color: black;">根据</span> Google 的说法,<span style="color: black;">日前</span> Google 搜索引擎的排名系统是以质量为导向的,它由一系列算法<span style="color: black;">构成</span>,在搜索过程中,<span style="color: black;">咱们</span>搜索的字词、搜索<span style="color: black;">目的</span>网页的<span style="color: black;">关联</span>性、可用性、<span style="color: black;">源自</span>专业程度等等都会影响到算法和页面的<span style="color: black;">最后</span>排名。用户搜索<span style="color: black;">专题</span>的性质<span style="color: black;">区别</span><span style="color: black;">亦</span>会影响页面的内容排序。</span></p>
<div style="color: black; text-align: left; margin-bottom: 10px;"><img src="https://p3-sign.toutiaoimg.com/pgc-image/f998bb340adb4c5087682ec30a5f0ae8~noop.image?_iz=58558&from=article.pc_detail&lk3s=953192f4&x-expires=1723731220&x-signature=Y2IyDiqiF9qTwNLuRdHZixKXUAI%3D" style="width: 50%; margin-bottom: 20px;"></div>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">因此</span>从某种程度上<span style="color: black;">来讲</span>,Google 搜索引擎现<span style="color: black;">周期</span>的排名算法其实是有点「黑盒子」,它不像早年 PageRank 那样公开透明,但依然维持着较高的搜索结果排名质量 —— 当然,Google 用来「养家糊口」的<span style="color: black;">宣传</span><span style="color: black;">常常</span>还是会排在搜索结果的上面,好在它们和<span style="color: black;">少许</span>派网站<span style="color: black;">同样</span>都标注得蛮清楚。</span></p>
<h1 style="color: black; text-align: left; margin-bottom: 10px;">用人力<span style="color: black;">保准</span>搜索结果质量</h1>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">没错,讲了这么多预测、知识图谱与算法,</span><strong style="color: blue;"><span style="color: black;"><span style="color: black;">保准</span> Google 搜索结果质量最后一环的竟然还是人</span></strong><span style="color: black;">。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">就像上面<span style="color: black;">说到</span>的那个「罗马人夜间用日晷计时」的笑话<span style="color: black;">同样</span>,搜索结果词不达意<span style="color: black;">乃至</span>答非所问的<span style="color: black;">状况</span>是有的,而算法很难自查。为了减少类似的<span style="color: black;">状况</span><span style="color: black;">出现</span>,Google 充分调动<span style="color: black;">这般</span>几波人的智慧:</span></p><strong style="color: blue;"><span style="color: black;">专家、权威<span style="color: black;">公司</span></span></strong><span style="color: black;">。在搜索健康财务、<span style="color: black;">百姓</span>信息( civic-information )和<span style="color: black;">危险</span><span style="color: black;">状况</span>等<span style="color: black;">专题</span>的时候,<span style="color: black;">咱们</span>能直接在搜索结果中优先看到来自当地政府、卫健、选举等权威<span style="color: black;">公司</span>的信息。<span style="color: black;">这般</span><span style="color: black;">咱们</span>就能从源头上得到<span style="color: black;">可靠</span>的信息。</span><strong style="color: blue;"><span style="color: black;">Google 内部团队</span></strong><span style="color: black;">。这<span style="color: black;">其中</span>不得不<span style="color: black;">说到</span>的有两支团队:一支是专门的<span style="color: black;">科研</span>团队,一支是内容合规团队(enforcement team)。前者<span style="color: black;">经过</span>对世界各地的<span style="color: black;">详细</span><span style="color: black;">状况</span>进行「实地考察」来改进个性化搜索质量;后者<span style="color: black;">按照</span> Google 的政策处理<span style="color: black;">哪些</span>系统<span style="color: black;">无</span>拦住的违规内容。</span><strong style="color: blue;"><span style="color: black;">搜索质量评分员</span></strong><span style="color: black;">(Search Quality Rater)。<span style="color: black;">她们</span>是对搜索质量进行 E-A-T 评级的人,E-A-T 评级反映了搜索结果的专业性(Expertise)、权威性(Authoritativeness)和可信度(Trustworthiness);评分员<span style="color: black;">同期</span><span style="color: black;">亦</span>是<span style="color: black;">帮忙</span> Google <span style="color: black;">评定</span><span style="color: black;">咱们</span>在搜索<span style="color: black;">行径</span>上<span style="color: black;">实质</span>体验的人。<span style="color: black;">按照</span> Google 的数据,<span style="color: black;">日前</span>参与这些工作的评分员有 10000 多人。</span>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">P.S. 评分员在<span style="color: black;">起始</span><span style="color: black;">供给</span>评级服务之前,<span style="color: black;">必须</span>学习 Google 发布的《搜索质量评分者指南》并且<span style="color: black;">经过</span>相应考试。<span style="color: black;">全部</span><span style="color: black;">评定</span>工作<span style="color: black;">亦</span>要<span style="color: black;">按照</span>该《指南》进行。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">除了以人之智慧补算法之不足之外,Google <span style="color: black;">一样</span><span style="color: black;">无</span>放弃对算法优化的<span style="color: black;">奋斗</span>。以「网页的<span style="color: black;">关联</span>性和可用性」而言,Google <span style="color: black;">持有</span>多种语言理解系统。这些语言理解系统中既有对应拼写错误、同义词等内容系统,又有基于 AI 的系统。<span style="color: black;">经过</span>这些系统,Google 得以<span style="color: black;">认识</span>与<span style="color: black;">咱们</span>搜索最<span style="color: black;">关联</span>的结果并进行改善。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">协同</span>人为主导的并行实验、实时流量实验等一系列的工作,<span style="color: black;">最后</span> Google 得以<span style="color: black;">保准</span><span style="color: black;">咱们</span>在 Google 搜索引擎中的<span style="color: black;">实质</span>体验。<span style="color: black;">按照</span> Google 披露的数据,2019 年<span style="color: black;">她们</span>与搜索质量评分者一共进行了 383605 余次搜索质量测试、62937 次并行实验、17523 次实时流量实验,这些<span style="color: black;">奋斗</span><span style="color: black;">帮忙</span> Google 对搜索算法进行了 3600 多次改进。</span></p>
<div style="color: black; text-align: left; margin-bottom: 10px;"><img src="https://p3-sign.toutiaoimg.com/pgc-image/35027b4423c64637b802d5ce5ea25159~noop.image?_iz=58558&from=article.pc_detail&lk3s=953192f4&x-expires=1723731220&x-signature=iT%2FJH61ZSZFEAvhpLdh173cjhc4%3D" style="width: 50%; margin-bottom: 20px;">
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">修正前与修正后的精选摘要答案对比</p>
</div>
<h1 style="color: black; text-align: left; margin-bottom: 10px;">小结</h1>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">一次简单的搜索<span style="color: black;">行径</span>、一个稀松平常的搜索结果页面,<span style="color: black;">背面</span>的算法、原理、<span style="color: black;">形成</span>和人力<span style="color: black;">原因</span>其实都<span style="color: black;">繁杂</span>且精妙。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">太阳<span style="color: black;">每日</span>都是新的、互联网发展<span style="color: black;">持续</span>向前,<span style="color: black;">咱们</span>的搜索需求<span style="color: black;">亦</span>水涨船高,回首来路,<span style="color: black;">亦</span>正是<span style="color: black;">由于</span> Google 在「搜索」这件事情上的<span style="color: black;">持续</span>改进和优化,才让它<span style="color: black;">最后</span><span style="color: black;">作为</span>了不少人心中那个最<span style="color: black;">可靠</span>的首选。</span></p>
这夸赞甜到心里,让我感觉温暖无比。 你的话语如春风拂面,让我心生暖意。 我们有着相似的经历,你的感受我深有体会。
页:
[1]