The PageRank Citation Ranking: Bringing Order to the Web (1998) [pdf]

by niicoon 10/21/2016, 7:12 PMwith 39 comments

by hbbioon 10/21/2016, 7:51 PM

I love how the (now famous) researchers dismiss at the time what has become Google's main chase:

"These types of personalized PageRanks are virtually immune to manipulation by commercial interests. For a page to get a high PageRank, it must convince an important page, or a lot of non-important pages to link to it. At worst, you can have manipulation in the form of buying advertisements(links) on important sites. But, this seems well under control since it costs money"

by jzlon 10/22/2016, 1:45 AM

And here's the original patent. I never knew until now that Sergey Brin wasn't on it.

Will expire in about a year and three months!

https://www.google.com/patents/US6285999

by gigatexalon 10/21/2016, 7:51 PM

the best part of the paper was this: "To test the utility of PageRank for search we built a web search engine called Go ogle" now a company worth hundreds of billions.

by andrewstuart2on 10/21/2016, 7:35 PM

Also fun and related: http://infolab.stanford.edu/~sergey/

And another paper by Page and Brin: http://infolab.stanford.edu/~backrub/google.html

by karakalon 10/21/2016, 9:17 PM

Is it still possible in today's Academia setting to conceive a company like Google without oweing anything to the institution in which it was created? It looks like the first version of Google was even hosted on Stanford's computers.

by androidfoxon 10/21/2016, 7:29 PM

This is 1998 Link. Is this still at least partially valid

by ssnon 10/22/2016, 11:07 AM

But does it work in practice? It doesn't seem to. PageRank is mostly a marketing tool.

https://www.microsoft.com/en-us/research/publication/hits-on...

"The fact that in-degree features outperform PageRank under all measures is quite surprising. A possible explanation is that link-spammers have been targeting the published PageRank algorithm for many years, and that this has led to anomalies in the web graph that affect PageRank."

by jacquesmon 10/22/2016, 8:41 AM

Any act of measuring a system changes that system. Pagerank did more to destroy the value of the web of links than any other technology before it because it was so good at measuring its value. Extracting that value then instantly leads to diminishing it because others (in this case the link spammers) want a slice of that huge pie.

I believe that each and every technology that successfully manages to index the web in a new and useful way will further diminish the value of the web.

by Xeoncrosson 10/22/2016, 1:21 AM

Are there any other good reads on how this technology is implemented, the shortcomings since it was first released, improvements in the algorithm, splitting the calculations up into a map-reduce for the modern (larger) web, and anything else that might be a good read?

I just googled for some implementations like this one in Go: https://github.com/dcadenas/pagerank

by teddyknoxon 10/22/2016, 2:24 AM

This reminds me of the new Black Mirror season 3 pilot