#warrior 2013-04-15,Mon

↑back Search

Time Nickname Message
12:12 🔗 vrmlguy So, I like to peek at the URLs that I'm downloading. This week-end I found something both interesting and huge. Is there any way to clone the archive that's being built so I don't have to run my own wget?
12:18 🔗 ersi What do you mean by cloning the archive?
12:18 🔗 ersi Do you mean downloading all the data that others have downloaded?
12:28 🔗 Smiley I think he just wants that single warc?
12:30 🔗 ersi Ah.
12:55 🔗 vrmlguy yeah, I just want a copy of what I'm downloading, to avoid duplicate work
12:58 🔗 ersi The current scripts will remove what you've downloaded as soon as it's done with the uploading
12:58 🔗 ersi but you could modify it so that it'll copy/ssh it somewhere before doing that I guess - or do it manually when it's in the rsync stage
13:02 🔗 vrmlguy that sounds good. I guess I need to log into my warrior instead of treating it as a black box
13:04 🔗 ersi could always find and download that warc later on though :)
13:05 🔗 vrmlguy re-read "black box". where do the ward's wind up?
13:05 🔗 vrmlguy warc
13:06 🔗 ersi archive.org
13:06 🔗 ersi sooner or later
13:06 🔗 vrmlguy ah, of course. thanks
13:07 🔗 vrmlguy - Downloaded 49840 URLs
13:07 🔗 vrmlguy Starting WgetDownload for Item import-rkyd.posterous.com
13:14 🔗 Smiley yah we hitting massive users :o
20:17 🔗 alard Tip for vrmlguy (who's no longer here) and others: you can log in and make a hard-link to the warc file while it's downloading, so you can still get it when it's done.
21:29 🔗 ersi I'll tip him off if he arrives again

irclogger-viewer