Time |
Nickname |
Message |
07:54
🔗
|
Nemo_bis |
alard: do you have a Yahoo! search API key? |
07:55
🔗
|
alard |
No. |
07:56
🔗
|
alard |
Do I need one? |
07:56
🔗
|
Nemo_bis |
alard: I'm trying to get someone (or myself?) run http://www.cs.brown.edu/~pavlo/mediawiki/ again |
07:56
🔗
|
Nemo_bis |
to list all the mediawikis around |
07:58
🔗
|
Nemo_bis |
alard: do your scripts for Google use keys or are they not needed? I don't understand the requirements for http://graffiti.cs.brown.edu/svn/graffiti/src/webcrawler/crawler/googlecrawler.py |
07:58
🔗
|
alard |
The Google script doesn't need a key. You do need a (free) ipv6 tunnel. |
10:39
🔗
|
Nemo_bis |
alard: I don't know if the script supports it though |
10:39
🔗
|
Nemo_bis |
emijrp: can't Pavlo's crawler be used with Google only?one doesn't need a key for that |
10:44
🔗
|
emijrp |
try it, im busy with WLM |
10:57
🔗
|
alard |
Nemo_bis: You could look if you can modify the script. From what I've seen it's a two-step process: it does a random search for "This is MediaWiki"-like phrases, that's something that you could do with any google crawl script, and the second part is looking at each url you've found to see if it is really a MediaWiki-wiki. |
12:22
🔗
|
Nemo_bis |
alard: is that google.search it imports a standard library of some sort? |
12:23
🔗
|
Nemo_bis |
oh, this one it seems http://web.archive.org/web/20110625003240/http://blackcodeseo.com/google-search-api-for-python/ |