[00:06] can I use ""'s for a metadata value when uploading to IA? [01:00] Maybe. [01:00] I do everything with the tool now. [03:12] the aol experiment is generating some good results now [03:13] scraping aol:// URLs from the old client? [03:14] yep [03:14] just downloads right now [03:16] awesome. that should be fun to look at. i recall stumbling through the old client with a few folks here and i found lots of 90s wonderfulness. like MTV stuck in a time capsule from 1996, etc. [03:17] will it eventually traverse menu tree structure, etc to find content or do you envision feeding in a URL list? [03:18] atz0: want to help write a client for it? [03:18] not sure- currently, it would be both. If you've got ideas, please join us [03:18] think custom proprietary arcane low-level protocol with a similar high level protocol on top [03:19] ideas i have in plentifulness. ability to implement any, not so much [03:24] well, if you've got AOL documentation, experience with AutoIT scripting, or just classic pages about AOL, we'd love the help [06:05] ------------------------------------------- [06:05] Jason is back, joy joy [06:05] If he hasn't gotten to you with whosis [06:05] write him or ping him here or elsewhere [06:05] ------------------------------------------- [06:47] ok everyone! [06:47] let's create a channel for http://quizilla.teennick.com/ [06:47] it's shutting down october 1st [06:47] and has tens of million of pages [06:47] actually around 24 million [06:52] #quitzilla ? [06:56] hah [06:56] is that going to be our standard ? :P [07:00] #fizzilla [07:02] that's much better [07:03] yeah, i like that one. [07:30] voting for #fizzilla [07:31] I guess I'll put that up on the front page [07:31] let's do #fizzilla [07:31] yep do that [08:05] fizzilla is ok. Use it. [11:31] I forgot to check one of ArchiveBot's earlier crawls, and the URL doesn't appear to link to it anymore. Does ABot expire links after awhile? [12:16] could be completed or failed [12:46] midas, it's completed, yes. Several months ago IIRC. [12:46] well it dissapears from the tracker after its done :) [12:48] ah right. So it then would be uploaded to the IA, presumably after that. [12:51] yep [12:51] which site was it? [12:52] midas, userscripts.org [12:52] ah yes [12:52] that one was cancelled [12:52] server went down after X gb [12:53] really? I remember seeing its progress at about 18GB [12:53] Had assumed it finished. Do canceled crawls still get uploaded to IA? [12:53] we grabbed what we could using port 8080 [12:54] https://web.archive.org/web/20140531035630/http://userscripts.org:8080/ [12:55] Ah, thanks. At least there's a fairly recent copy. [12:55] yep [12:56] It's a pity the admin didn't just decide to make the site read-only for the time being. [12:57] As from what I read they had spam problems. [13:44] xiaomi.eu has closed [13:44] http://xiaomi.eu/community/ is still available for a few more days [13:44] http://miuiandroid.com/ is apparently operated by the same guys [14:36] Hey anybody there that can give me a hand? Warrior wont let me change the bandwidth limit [14:38] What is the point of these chats if nobody replies? [14:39] er [14:44] don't expect a reply in 2 minutes [14:48] Your call is important to us, please hold. [14:48] * antomatic plays some music [15:26] Have you tried our new archive team credit card? With low introductory rates and a host of benefits, you could be saving the internet AND a whole lot of money. [15:27] And, you can take advantage of our loyalty program - each archived item earns you one (1) karma point! [15:48] LOL [16:02] You know what we /should/ do? [16:02] Set up an ArchiveTeam proxy server. Then we can archive every single thing everyone looks at. :) [16:02] there's an idea [16:02] WarcProxy doesn't handle POST requests properly, it converts them to GET [16:03] but other than that it's serviceable, I used it for a few weeks straight [16:04] it also wouldn't work with ssl [16:05] aye [16:05] well that's OK, we wouldn't want to archive people's bank accounts :) [16:05] I don't see why not ... [16:06] yeah I'm sure there are times when you wish you had your old balance back ;) [16:35] ------------------------------------------------------------------------ [16:36] Statement by Jason Scott on Archive Team [16:36] http://ascii.textfiles.com/archives/4387 [16:36] ------------------------------------------------------------------------ [16:39] "You cannot arrest an idea. Or run it over with a bus." [16:39] - Confucious [16:41] Avoid the back of buses also. You can get exhusted. [16:52] antomatic: that exists, somewhat [16:52] webrecorder.io [16:52] Ah. [16:52] it's one of Ilya Kreymer's things (also authored pywb) [18:06] Wired now wants an interview. [18:10] kick his ass, sea bass [18:16] http://blog.easel.io/blog/2014/09/17/easel-is-shutting-down/ [18:16] closes on "Oct 31st at 11:59pm pacific time" [18:19] " To those of you who continue to depend on easel.io, we are very sorry." [18:19] ha [18:19] " To those of you who continue to depend on easel.io, SUCKERS" [18:22] This is a bittersweet day for us. We love Easel, and we use it daily. We are extremely thankful and appreciative of our loyal users. To those of you who continue to depend on easel.io, we are very sorry. [18:23] easel.io is only a building tool and not a hosting site right? [18:23] That is the gist I got. [18:25] yeah. it was a website designer thing. [18:26] ok good. had to make sure [19:26] the tracker will be unavailable for a bit while i release some claims [19:27] ok, done [19:37] can anyone tell me if there is a mirror for the warrior VM image download? the one linked on the wiki is paiiiinfully slow for me. [19:39] yeah sure [19:39] try wayback machine [19:39] maybe that's faster [19:40] actually it's already in wayback, heh [19:40] i can mega it [19:40] in IA* [19:40] oh, ok [19:41] either would be amazing, seriously getting <10k/s off the wiki link. wtf. [19:45] onoj: yeah, it slows down occasionally [20:12] Hello. I hope this is the right place to ask. I have a collection of saved threads (text-only, no images) from 4chan and other imageboard / chan sites. It is in total ~300GB when compressed with 7z, I don't know how big it is uncompressed. There are no http header logs or warc output, ie no provenance. Is this something the archive team would be interested in adding to the collection? I don't know much about this so sor [20:13] -sorry if it is a stupid question. Thank you for your time [20:13] sure, upload it to an item on archive.org [20:13] Definitely. [22:08] http://www.geekwire.com/2014/archive-team-twitpic-blocking-us-downloading-photos-shutdown/ [22:56] maybe public pressure will help change the situation