{"id":1073,"date":"2009-01-07T00:01:37","date_gmt":"2009-01-07T05:01:37","guid":{"rendered":"http:\/\/arxivblog.com\/?p=1073"},"modified":"2009-01-06T19:12:20","modified_gmt":"2009-01-07T00:12:20","slug":"next-generation-search-engines-could-rank-sites-by-talent","status":"publish","type":"post","link":"http:\/\/arxivblog.com\/?p=1073","title":{"rendered":"Next generation search engines could rank sites by &#8220;talent&#8221;"},"content":{"rendered":"<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-1074\" title=\"experience-v-talent\" src=\"http:\/\/arxivblog.com\/wp-content\/uploads\/2009\/01\/experience-v-talent.jpg\" alt=\"experience-v-talent\" width=\"329\" height=\"230\" srcset=\"http:\/\/arxivblog.com\/wp-content\/uploads\/2009\/01\/experience-v-talent.jpg 565w, http:\/\/arxivblog.com\/wp-content\/uploads\/2009\/01\/experience-v-talent-300x209.jpg 300w\" sizes=\"auto, (max-width: 329px) 100vw, 329px\" \/><\/p>\n<p>How will the next generation of search engines outperform Google&#8217;s all-conquering Pagerank algorithm?<\/p>\n<p>One route might be to hire Vwani Roychowdhury at the University of California, Los Angeles and his buddies who have found a fascinating new way to tackle the problem of website rankings.<\/p>\n<p><!--more-->Their breakthrough is to have found that the structure of the web is determined by three factors: the number of inbound links to a page, the rate at which pages are created and deleted and the likelihood that somebody visiting a page will link to it.<\/p>\n<p>This last factor is the forehead smacker. Google&#8217;s PageRank cannot easily identify new sites with huge potential because their very newness means they they don&#8217;t have a large number of inbound links and so feature poorly in the rankings.<\/p>\n<p>But by looking at the ratio of visitors to incoming links, Roychowdhury and co can get a good handle on a site&#8217;s potential, even when it is new.<\/p>\n<p>In fact, these sites stick out like sore thumbs. It turns out that in the year&#8217;s worth of data the team examined, only 6.5 per cent of the 10 million or so sites they monitored received more than two new incoming links.<\/p>\n<p>It is these up-and-coming sites that go on to displace more established sites in the popularity stakes, leading to the\u00a0 constant slow churn of content we see on conventional Pagerank-based search results. The UCLA team simply see them earlier.<\/p>\n<p>That&#8217;s a fascinating and valuable insight.<\/p>\n<p>Roychowdhury and co liken this process to the age-old battle between &#8220;experience&#8221; (well established sites with many incoming links) and &#8220;talent&#8221; (up-and-coming sites with potential).<\/p>\n<p>Their algorithm won&#8217;t replace Pagerank but it could help to significantly fine tune it, and that could pique the interest of a well known company based in Mountain View, not to mention numerous other pretenders to the search engine crown.<\/p>\n<p>Ref: <a href=\"http:\/\/arxiv.org\/abs\/0901.0296\">arxiv.org\/abs\/0901.0296<\/a>: Experience Versus Talent Shapes the Structure of the Web<\/p>\n","protected":false},"excerpt":{"rendered":"<p>How will the next generation of search engines outperform Google&#8217;s all-conquering Pagerank algorithm? One route might be to hire Vwani Roychowdhury at the University of California, Los Angeles and his buddies who have found a fascinating new way to tackle the problem of website rankings.<\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[23,22],"tags":[],"class_list":["post-1073","post","type-post","status-publish","format-standard","hentry","category-mountain-climbin","category-nets-n-webs"],"_links":{"self":[{"href":"http:\/\/arxivblog.com\/index.php?rest_route=\/wp\/v2\/posts\/1073","targetHints":{"allow":["GET"]}}],"collection":[{"href":"http:\/\/arxivblog.com\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/arxivblog.com\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/arxivblog.com\/index.php?rest_route=\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"http:\/\/arxivblog.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=1073"}],"version-history":[{"count":1,"href":"http:\/\/arxivblog.com\/index.php?rest_route=\/wp\/v2\/posts\/1073\/revisions"}],"predecessor-version":[{"id":1075,"href":"http:\/\/arxivblog.com\/index.php?rest_route=\/wp\/v2\/posts\/1073\/revisions\/1075"}],"wp:attachment":[{"href":"http:\/\/arxivblog.com\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=1073"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/arxivblog.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=1073"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/arxivblog.com\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=1073"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}