#wikiteam 2012-09-04,Tue

↑back Search

Time Nickname Message
07:54 🔗 Nemo_bis alard: do you have a Yahoo! search API key?
07:55 🔗 alard No.
07:56 🔗 alard Do I need one?
07:56 🔗 Nemo_bis alard: I'm trying to get someone (or myself?) run http://www.cs.brown.edu/~pavlo/mediawiki/ again
07:56 🔗 Nemo_bis to list all the mediawikis around
07:58 🔗 Nemo_bis alard: do your scripts for Google use keys or are they not needed? I don't understand the requirements for http://graffiti.cs.brown.edu/svn/graffiti/src/webcrawler/crawler/googlecrawler.py
07:58 🔗 alard The Google script doesn't need a key. You do need a (free) ipv6 tunnel.
10:39 🔗 Nemo_bis alard: I don't know if the script supports it though
10:39 🔗 Nemo_bis emijrp: can't Pavlo's crawler be used with Google only?one doesn't need a key for that
10:44 🔗 emijrp try it, im busy with WLM
10:57 🔗 alard Nemo_bis: You could look if you can modify the script. From what I've seen it's a two-step process: it does a random search for "This is MediaWiki"-like phrases, that's something that you could do with any google crawl script, and the second part is looking at each url you've found to see if it is really a MediaWiki-wiki.
12:22 🔗 Nemo_bis alard: is that google.search it imports a standard library of some sort?
12:23 🔗 Nemo_bis oh, this one it seems http://web.archive.org/web/20110625003240/http://blackcodeseo.com/google-search-api-for-python/

irclogger-viewer