#archiveteam 2013-08-22,Thu

↑back Search

Time Nickname Message
01:00 🔗 SketchCow OK BACK
01:00 🔗 SketchCow Sorry, I went on a huge bender.
01:02 🔗 SketchCow Hey, clarification on the Yahoo! Geocities thing. Yes, those are paid users, and the way that Yahoo!'s system is built, they can't shut off the original geocities link without also shutting off the paid link, so they have to leave it on.
01:02 🔗 SketchCow In parlance, it would be called "deprecated" and assuming anyone ever figured it out, the geocities link would absolutely die.
03:16 🔗 ivan` someone please WARC all of pr2support.willowgarage.com, there are 30GB+ of DVD ISOs that I have no disk for
03:16 🔗 ivan` it might go down http://www.willowgarage.com/blog
03:28 🔗 omf_ ivan`, is it just he isos on the download page or are there more
03:28 🔗 omf_ http://pr2support.willowgarage.com/wiki/Download
03:34 🔗 ivan` omf_: I don't know
03:34 🔗 ivan` those are the ones my wget found
13:49 🔗 omf_ groklaw is still going. 174,750 pages and counting
13:50 🔗 omf_ They have a lot of comments
17:12 🔗 frame_at_ Hi guys. Are there currently zero projects? I didn't check my archive warrior VM in a while, was suprised that it's idling.
17:21 🔗 Baljem I think the tracker's down at the moment - but don't quote me on that - just noticed I get an error page trying to look at the xanga leaderboard
17:31 🔗 omf_ frame_at_, Baljem thanks for the heads up. I notified the admins.
17:33 🔗 frame_at_ I'm getting an "HTTP error 502" thrown to the console when I start the warrior VM, might be related.
17:56 🔗 yipdw ugh, cache busters are annoying
17:56 🔗 yipdw patch.com does the Javascript pagination thing and appends two current timestamps when it makes said call
17:57 🔗 yipdw how the hell do you archive that?
17:57 🔗 yipdw I mean, sure, I can capture all the data, but the URLs are not going to match up
17:57 🔗 yipdw you'd need some way to rewrite that or freeze time
18:04 🔗 frame_at_ yipdw: sometimes sites do this to prevent caching. did you try to omit it altogether?
18:04 🔗 yipdw frame_at_: I can't omit t
18:04 🔗 yipdw it's generated by the Javascript that does the pagination
18:05 🔗 yipdw I mean, yes, in theory, I guess I could rewrite patch.com's Javascript to not do that
18:05 🔗 yipdw in practice, nope
18:05 🔗 frame_at_ yes. you would need to inject a js that removes the timestampes once the page is loaded.
18:05 🔗 frame_at_ that's ok for a one time thing, locally.
18:05 🔗 yipdw this isn't a one-time thing
18:06 🔗 yipdw though, the number of pages to deal with isn't the problem
18:06 🔗 yipdw the problem is that I'd be contaminating the WARC
18:06 🔗 yipdw I think it's far better to just do that when the WARC is viewed
18:07 🔗 yipdw i.e. for now, fuck it
18:24 🔗 closure anyone have experience with this? http://quvi.sourceforge.net/
18:24 🔗 closure seems rather nicer than youtube-dl or cclive since you get out just an url, that anything can download
18:25 🔗 yipdw closure: huh, thanks
18:25 🔗 yipdw that's neat
18:27 🔗 closure I don't know how often it breaks though
21:15 🔗 xmc yipdw: yeah, could release an "as captured" and a "as made working" version
21:19 🔗 yipdw xmc: yeah, I think that's what frame_at_ was getting at, too
21:19 🔗 xmc mmm ok
21:20 🔗 yipdw xmc: that, or adding the option to a WARC viewer
21:20 🔗 yipdw a heuristic to factor out cache busters
21:20 🔗 xmc or release a .warc overlay that makes it work
21:20 🔗 xmc hmmm
21:20 🔗 xmc extra annotation records
21:21 🔗 yipdw oh
21:21 🔗 yipdw I totally forgot about annotation records
21:22 🔗 yipdw those could be used to log processing instructions
22:36 🔗 xmc tracker is powered off for maintenance
22:36 🔗 xmc whatever
22:37 🔗 xmc probably sooner than that, giving myself some breathing space
22:38 🔗 wp494 shameless plug alert: in the meantime, while the main tracker is down, chfoo's own tracker is running the puush archival project, details in #pushharder
22:38 🔗 wp494 it just went over the 1M items done mark too
22:38 🔗 xmc :D
22:48 🔗 xmc tracker appears to be functional again
22:49 🔗 xmc that's more like it
22:52 🔗 sethish I would like to put together a timeline of the price of laser printers through the 80s. Anybody have an idea where to start looking? SketchCow
22:57 🔗 SketchCow archive.org/details/computermagazines
22:58 🔗 sethish Imagen is one laserprinter company I know from the 80s
22:58 🔗 sethish awesome. I'll start looking
23:43 🔗 ivan` ftp://78.46.141.148/videos/ HQ demoscene captures via https://www.youtube.com/watch?v=KTjnt_WSJu8&
23:44 🔗 ivan` a lot more on that ftp server

irclogger-viewer