Wikipedia talk:What Google liked

Untitled text[edit]

I really think that this material isn't appropriate for an encyclopedia. It is more like news, and becomes out-of-date very quickly. If perhaps this was just an external link, that'd be OK. Wikipedia is the place for long-term knowledge, not ephemeral statistics. -- ansible

I think you are missing the point. What Google Likes is to point to the pages that might need to be looked over since we are getting outside hits to them. It is what would be considered a meta article rather than a regular encyclopedia entry. --- Jagged

Like Jagged said. This page is not actually part of the encyclopedia, but is posted by Jimbo Wales (our esteemed co-founded) to let us know why people are coming to us. It's been around since early on in the project. -- Stephen Gilbert

Ansible has a point, though, and one that I have pondered myself. This isn't an encyclopedia article, and so it should probably be moved to a more appropriate meta article location. How about Wikipedia/What Google Likes? -- Jimbo Wales

The problem is we have lots of "About Wikipedia" pages that should probably be out of the article namespace. I vote we wait until Magnus' software is ready before moving them all. -- Stephen Gilbert

This page is now really out of date. But the information is quite interesting. Anybody able to update? WillSmith (Malaysia) 16:32 Oct 26, 2002 (UTC)

Second that. Is there a developer handy to update these statistics? Or a way for a normal person to do so? Atorpen 02:01 Feb 9, 2003 (UTC)

Thirded, just to add another prod ;-) - Khendon 13:48 Mar 3, 2003 (UTC)

I renamed this page from "Wikipedia:What Google likes" to "Wikipedia:List of articles frequently visited through Google", because it is the combined result of what users of Google like and how Google indexes. - Patrick 11:03 Apr 25, 2003 (UTC)

I thought the old title was more memorable. Google, as an inanimate object, can't "like" things, so I think the possibility of confusion is minimal. Ahh well - aesthetics... Martin

It was not clear, it could mean something like Wikipedia:Subjects for which a Wikipedia article is in the Google top 10, A-K. - Patrick 11:44 Apr 25, 2003 (UTC)

---

How do one see the statistics related to this page? It would be easier, then, to update it. Yves 17:34 May 15, 2003 (UTC)

For that matter, it would be helpful if we could somehow see failed searches within Wikipedia, where "failed" maybe means a search that was performed without clicking on any of the results (I know this may be hard, particularly for non-cookied users, but perhaps something is better than nothing). A lower-tech tactic would be to include a prominent link to the requested articles page on the search page, to make it easy for users to record what they were looking for. Zashaw 02:39, 19 Aug 2003 (UTC)

This page hasn't been updatd since March and is getting a bit stale. Do we have an update coming soon? --Raul654 19:27, 18 Oct 2003 (UTC)

From VfD[edit]

Wikipedia:List of articles frequently visited through Google hasn't been updated since early March. I posted in the talk pages in early October asking if there was any plan to update, and no one responded. Super-stale pages like this give a false impression of where effort should be concentrated. --Raul654 11:14, 14 Dec 2003 (UTC)
- Unfortunately, I have to agree. If a time-sensitive page isn't regularly updated, we are better off without it. Rossami 16:00, 14 Dec 2003 (UTC)
  - This may sound obvious, but why not just *update* the thing instead of deleting it? --Raul654 15:06, 15 Dec 2003 (UTC)
    - Good question. Answer: 1) Because I don't know how. 2) Because other things interest me (and apparently everyone else) more. Rossami
- Don't delete, but restore to former title "What Google Likes." I've edited the page some, to make it so it is not so time-sensitive. It is linked from quite a few places under stats under the heading "what google likes." The present heading is inappropriate and should be updated regularly or deleted. The specific data should be updated regularly, but at present is listed as a case study. However, posting at least weekly, possibly even daily a list of what pages Google is referring would be a good quality control check to make sure those pages are in good shape. Sterlingda 07:59, 16 Dec 2003 (UTC)
  - While what you are saying sounds good (I for one would love to see the page updated rapidly), as a computer engineer, I'm inclined to think it is a logistical nightmare. Specifically, running such an intensive task on such a large log (whatever webserver wiki is running) would probably take a LONG time to complete - on the order of hours for each run. And last I had heard, wiki is short on developers as it is. --Raul654 08:11, 16 Dec 2003 (UTC)
- Keep and try to update it. Its a useful tool to know which articles we should try to improve. Muriel Victoria 11:29, 17 Dec 2003 (UTC)
  - There seems to be no consensus for deletion at this point, so I've removed it from Vfd. Angela . 00:46, 20 Dec 2003 (UTC)

Given the lack of updates, if anybody cares to try, I'm moving this content from Wikipedia:Ranking items by Google hit count:

In case it's useful to anyone, a perl script to rank a long list of items by "importance", defined by number of Google hits. Rather crude, but could be helpful in cases where it's a first stab at an NPOV ordering -- developed it in order to pick a few people from an alumni list for a highlight list of particularly famous alumni.

This is not the most beautiful perl script. But for what it's worth (maybe not much!) you may copy it under GPL, additional to your rights under GFDL.


#!/usr/bin/perl

# simple, but crude script to rank terms by "importance", defined by
# number of Google hits.
#
# runs on unix; takes on stdin a list of terms - one per line
#
# outputs numbers of hits as it progresses, followed by an HTML table
# of sorted hit counts at the end.
#
# requires lynx
#======================================================================

while ($term=<STDIN>)
{
    $term =~ s/^\s+//;
    $term =~ s/\s+$//;

    $n{$term}=ngoogle("\"$term\"");  # NB adds quotes (\") around item
                                     # remove them if you don't want that.
    print "$term $n{$term}\n";
}
print "=============\n";

print "<table><tr><th>Term</th><th>Google hits</th></tr>\n";
for $term (sort {$n{$b}<=>$n{$a}} keys %n) {
    print "<tr><td>$term</td><td>$n{$term}</td></tr>\n"
}
print "</table>\n";


sub ngoogle
{
    my $term=shift;

    my $url="http://www.google.com/search?num=1&q=".escape($term);
    
    open GOOGLE,"lynx -source '$url'|" or die $!;

    my $n;
    while (<GOOGLE>) {
	if (m{Results <b>.*</b> of about <b>([0-9,]+)</b>}) {
	    ($n=$1) =~ s/,//g;
	    last;
	}
    }
    close GOOGLE;
    return $n;
}

sub escape {
    #
    # turn string into "escaped" format for GET url
    #
    my $s = shift;
    $s =~ s/([^a-zA-Z0-9_\-.])/uc sprintf("%%%02x", ord($1))/ge;
    $s =~ tr/ /+/;
    return $s;
}

Why does this page even need to exist?

MfD debate[edit]

This article survived a deletion debate. The discussion can be found here. -Splash^talk 21:32, 2 December 2005 (UTC)[reply]