Time |
Nickname |
Message |
01:16
🔗
|
SketchCow |
http://archive.org/details/archiveteam-tvtropes-2012-09 |
01:31
🔗
|
BlueMax |
oh good, someone grabbed TVT |
01:44
🔗
|
SketchCow |
A just in case copy. |
01:44
🔗
|
SketchCow |
54gb. |
01:56
🔗
|
godane |
hey SketchCow |
01:56
🔗
|
SketchCow |
Yo |
02:04
🔗
|
godane |
i uploaded the last of april 2012 episodes of gbtv |
03:18
🔗
|
bsmith095 |
so is dice on the warrior? |
03:23
🔗
|
SketchCow |
No, not yet. |
03:23
🔗
|
SketchCow |
We're still plotting. |
03:23
🔗
|
SketchCow |
Finding everything geeknet owns |
03:54
🔗
|
edsu_ |
anyone around know if internet archive's s3 api lets you create folders? |
03:54
🔗
|
edsu_ |
mine appear to be getting url encoded, e.g. http://archive.org/details/wikitweets |
03:54
🔗
|
chronomex |
hrm. |
03:54
🔗
|
chronomex |
what is your command? |
03:57
🔗
|
edsu_ |
i believe i'm doing a PUT to http://wikitweets.s3.us.archive.org/2012/09/19/030901.json |
03:57
🔗
|
chronomex |
hmmmm |
03:57
🔗
|
chronomex |
I don't pretend to know much about ias3, but that should do what you expect it to |
03:58
🔗
|
edsu_ |
ok |
03:59
🔗
|
SketchCow |
11G JPGMAG-2009-01.zip |
03:59
🔗
|
SketchCow |
11gb JPG Magazine, which shut down some time ago. |
04:37
🔗
|
underscor |
Ias3 does not allow folders atm |
04:37
🔗
|
underscor |
cc chronomex / edsu_ |
04:38
🔗
|
chronomex |
Ok |
05:46
🔗
|
Coderjoe |
geeknet? what's up with geeknet? |
05:48
🔗
|
SketchCow |
Sold. |
05:48
🔗
|
SketchCow |
To Dice.com |
05:48
🔗
|
Coderjoe |
oh crap |
05:48
🔗
|
SketchCow |
http://www.youtube.com/watch?v=TcxpbhM0DaA |
05:57
🔗
|
Coderjoe |
i know this is part of the plotting that has alread been going on, but is there already an archive of slashdot? |
05:57
🔗
|
Coderjoe |
I fear sourceforge |
05:58
🔗
|
Coderjoe |
huge, and some of the backup procedures have been problematic for me in the past |
06:00
🔗
|
godane |
i know that slashdot.com does index the stories in a more easy manner to grab |
06:01
🔗
|
godane |
example: http://slashdot.org/story/1 |
06:07
🔗
|
godane |
now this is close number: http://slashdot.org/story/174999 |
06:08
🔗
|
godane |
so we are talking about 175000+ stories just about |
06:08
🔗
|
Coderjoe |
i think it is more like 1.5 million |
06:09
🔗
|
godane |
oh |
06:09
🔗
|
Coderjoe |
1457243 |
06:09
🔗
|
* |
Coderjoe tries that |
06:09
🔗
|
chronomex |
don't forget, I'm pretty sure that it doesn't give you all the comments for a story on a single request |
06:09
🔗
|
Coderjoe |
hmm |
06:10
🔗
|
godane |
if there is that many stories then how does 174999 show a story from 2 days ago? |
06:11
🔗
|
Coderjoe |
hmm |
06:11
🔗
|
Coderjoe |
125000 gets me nothing |
06:11
🔗
|
godane |
try 125001 |
06:11
🔗
|
Coderjoe |
the story I am looking at for the geeknet sale says story/12/09/18/1457243 |
06:12
🔗
|
Coderjoe |
which is where I got the number, but it didn't work |
06:12
🔗
|
godane |
12/09/18/2249200/ |
06:13
🔗
|
godane |
got that from that from "Feds Add 9 Felony Charges Against Swartz For JSTOR Hack" |
06:13
🔗
|
Coderjoe |
*sigh* |
06:13
🔗
|
Coderjoe |
JSTOR hack... |
06:13
🔗
|
Coderjoe |
"hack" |
06:19
🔗
|
DFJustin |
someone wanna help this guy http://archive.org/details/proust-panic-download |
07:21
🔗
|
godane |
i'm on my way to having all of underground gamer forums |
07:22
🔗
|
godane |
just know some are going to be older then others by a few days |
15:41
🔗
|
SketchCow |
OK, who wants to take on a downloading of a forum? |
15:41
🔗
|
SketchCow |
http://mmoquests.com/2012/09/19/vanguard-to-get-new-forums-save-your-old-favorite-posts-vgd-vanguard/ |
15:41
🔗
|
SketchCow |
http://forums.station.sony.com/vg/posts/list.m?topic_id=59004 |
17:58
🔗
|
Wack0 |
SketchCow, what forum software |
17:59
🔗
|
SketchCow |
Not clear. |
17:59
🔗
|
SketchCow |
Sony never credits anybody |
17:59
🔗
|
alard |
I've just made a small Lua script for it. |
18:00
🔗
|
alard |
Now testing to see if it works. |
18:02
🔗
|
balrog_ |
alard: link plz? |
18:02
🔗
|
alard |
balrog_: Not yet. |
18:52
🔗
|
SketchCow |
http://makerbot.com/ |
18:52
🔗
|
SketchCow |
Quick, go |
19:11
🔗
|
alard |
Here's the download script for forums.station.sony.com. I think it works. https://gist.github.com/d9559ca9899d7a7f341f |
19:11
🔗
|
alard |
Is there anyone who wants to run it on the Vanguard forums? balrog_ ? |
19:11
🔗
|
balrog_ |
alard: ask me later... |
19:12
🔗
|
alard |
I probably won't be here later, but maybe someone will remember. |
19:15
🔗
|
alard |
There are more forums of the same type on forums.station.sony.com. Are they going too? |
19:16
🔗
|
alard |
See the list on the left: http://forums.station.sony.com/vg/user/profile.m?user_id=393 |
19:17
🔗
|
alard |
They can be saved with the same script. |
19:18
🔗
|
frame_at |
Makerbot press event now live. homepage is still broken for me, http://www.livestream.com/makerbotindustries this link here is better |
19:20
🔗
|
alard |
New location of the Sony script: https://gist.github.com/e046d761b820bfb34de8 |
19:20
🔗
|
underscor |
alard: do we have the recommended wget incantation somewhere? |
19:20
🔗
|
underscor |
I've not used wget with lua hooks yet |
19:22
🔗
|
alard |
Download-and-build-script of a recent-enough version: https://raw.github.com/ArchiveTeam/cityofheroes-grab/master/get-wget-lua.sh |
19:22
🔗
|
alard |
The code is here: https://github.com/alard/wget-lua/tree/lua |
19:23
🔗
|
underscor |
kk, got the binary |
19:28
🔗
|
mistym |
alard: Are you keeping that repo up-to-date? I remember at one point it seemed to lag behind the tarballs in the various grabs' downloads. |
19:29
🔗
|
underscor |
alard: do you have a standard/recommended set of args? |
19:29
🔗
|
alard |
mistym: Yes, it's now up to date, I think. I've restarted the repo, so it's now based on the Wget git repository. |
19:30
🔗
|
SmileyG |
you hit slashdot. |
19:30
🔗
|
mistym |
alard: Thanks! I'll move my Homebrew build script over to that. |
19:30
🔗
|
alard |
underscor: Yes, they're at the top of the script. |
19:30
🔗
|
SmileyG |
http://news.slashdot.org/story/12/09/19/1846211/all-the-tv-news-since-2009-now-available-at-the-internet-archive |
19:31
🔗
|
underscor |
alard: oops, sorry :$ |
19:31
🔗
|
underscor |
heh heh |
19:45
🔗
|
mistym |
alard: Hm, yikes. sed is whining in the ./bootstrap. Have you seen that before? |
19:53
🔗
|
alard |
mistym: No, I don't know about that. I've merged the most recent Wget commits now, so maybe that helps. (There are no changes for bootstrap.) |
20:02
🔗
|
mistym |
alard: Oh, I see what's causing it. Looks like an upstream gnulib bug actually, so I'll bug the wget maintainers to update their bootstrap script. |
20:03
🔗
|
mistym |
(It's been fixed in gnulib already.) |
20:03
🔗
|
alard |
Didn't we have that same problem before? Or is this a new bug? |
20:03
🔗
|
mistym |
This is a new one. |
20:05
🔗
|
mistym |
There's a syntax error in the sed line in the warn() function, which was being called because I didn't have a few deps up=to-date in my PATH |
20:10
🔗
|
ersi |
Hey, I wanna post it too! |
20:10
🔗
|
ersi |
http://rss.slashdot.org/~r/Slashdot/slashdot/~3/lJhjRHvOhB4/all-the-tv-news-since-2009-now-available-at-the-internet-archive |
20:31
🔗
|
godane |
i'm grabbing the utlm.org site |
20:37
🔗
|
SketchCow |
New feature added to archive.org. |
20:37
🔗
|
SketchCow |
http://archive.org/derive-wait.php |
20:37
🔗
|
SketchCow |
Tells you the current wait times for the derivations of items. |
20:40
🔗
|
mistym |
Neat! |
20:53
🔗
|
chronomex |
spiffy |
21:06
🔗
|
SketchCow |
Only hardcores need it, but there it is |
21:42
🔗
|
chronomex |
alard: are you aware that you commited an executable binary? https://github.com/ArchiveTeam/cityofheroes-grab -> wget-lua |
21:43
🔗
|
Wack0 |
SketchCow, pm |
21:47
🔗
|
alard |
chronomex: Yes, that's the binary for the warrior. (The fastest way to get it there.) |
21:47
🔗
|
chronomex |
ah, ok |
21:49
🔗
|
chronomex |
hey, what does wget-warc do when you point it at a ftp url? |
21:49
🔗
|
chronomex |
? |
21:49
🔗
|
chronomex |
any idea |
21:53
🔗
|
alard |
It writes the download to the warc file. |
21:54
🔗
|
chronomex |
ok, I guess I'll investigate further myself |
21:58
🔗
|
alard |
http://derive-wait.herokuapp.com/ |
23:02
🔗
|
* |
chronomex hacking together a warc-writing http proxy |