#archiveteam 2014-09-17,Wed

↑back Search

Time Nickname Message
00:06 🔗 dashcloud can I use ""'s for a metadata value when uploading to IA?
01:00 🔗 SketchCow Maybe.
01:00 🔗 SketchCow I do everything with the tool now.
03:12 🔗 dashcloud the aol experiment is generating some good results now
03:13 🔗 atz0 scraping aol:// URLs from the old client?
03:14 🔗 dashcloud yep
03:14 🔗 dashcloud just downloads right now
03:16 🔗 atz0 awesome. that should be fun to look at. i recall stumbling through the old client with a few folks here and i found lots of 90s wonderfulness. like MTV stuck in a time capsule from 1996, etc.
03:17 🔗 atz0 will it eventually traverse menu tree structure, etc to find content or do you envision feeding in a URL list?
03:18 🔗 balrog atz0: want to help write a client for it?
03:18 🔗 dashcloud not sure- currently, it would be both. If you've got ideas, please join us
03:18 🔗 balrog think custom proprietary arcane low-level protocol with a similar high level protocol on top
03:19 🔗 atz0 ideas i have in plentifulness. ability to implement any, not so much
03:24 🔗 dashcloud well, if you've got AOL documentation, experience with AutoIT scripting, or just classic pages about AOL, we'd love the help
06:05 🔗 SketchCow -------------------------------------------
06:05 🔗 SketchCow Jason is back, joy joy
06:05 🔗 SketchCow If he hasn't gotten to you with whosis
06:05 🔗 SketchCow write him or ping him here or elsewhere
06:05 🔗 SketchCow -------------------------------------------
06:47 🔗 arkiver ok everyone!
06:47 🔗 arkiver let's create a channel for http://quizilla.teennick.com/
06:47 🔗 arkiver it's shutting down october 1st
06:47 🔗 arkiver and has tens of million of pages
06:47 🔗 arkiver actually around 24 million
06:52 🔗 garyrh #quitzilla ?
06:56 🔗 arkiver hah
06:56 🔗 arkiver is that going to be our standard ? :P
07:00 🔗 vantec #fizzilla
07:02 🔗 ivan` that's much better
07:03 🔗 garyrh yeah, i like that one.
07:30 🔗 wp494 voting for #fizzilla
07:31 🔗 wp494 I guess I'll put that up on the front page
07:31 🔗 arkiver let's do #fizzilla
07:31 🔗 arkiver yep do that
08:05 🔗 SketchCow fizzilla is ok. Use it.
11:31 🔗 Sum I forgot to check one of ArchiveBot's earlier crawls, and the URL doesn't appear to link to it anymore. Does ABot expire links after awhile?
12:16 🔗 midas could be completed or failed
12:46 🔗 Sum midas, it's completed, yes. Several months ago IIRC.
12:46 🔗 midas well it dissapears from the tracker after its done :)
12:48 🔗 Sum ah right. So it then would be uploaded to the IA, presumably after that.
12:51 🔗 midas yep
12:51 🔗 midas which site was it?
12:52 🔗 Sum midas, userscripts.org
12:52 🔗 midas ah yes
12:52 🔗 midas that one was cancelled
12:52 🔗 midas server went down after X gb
12:53 🔗 Sum really? I remember seeing its progress at about 18GB
12:53 🔗 Sum Had assumed it finished. Do canceled crawls still get uploaded to IA?
12:53 🔗 midas we grabbed what we could using port 8080
12:54 🔗 midas https://web.archive.org/web/20140531035630/http://userscripts.org:8080/
12:55 🔗 Sum Ah, thanks. At least there's a fairly recent copy.
12:55 🔗 midas yep
12:56 🔗 Sum It's a pity the admin didn't just decide to make the site read-only for the time being.
12:57 🔗 Sum As from what I read they had spam problems.
13:44 🔗 joepie91 xiaomi.eu has closed
13:44 🔗 joepie91 http://xiaomi.eu/community/ is still available for a few more days
13:44 🔗 joepie91 http://miuiandroid.com/ is apparently operated by the same guys
14:36 🔗 n00b288 Hey anybody there that can give me a hand? Warrior wont let me change the bandwidth limit
14:38 🔗 n00b288 What is the point of these chats if nobody replies?
14:39 🔗 midas er
14:44 🔗 DFJustin don't expect a reply in 2 minutes
14:48 🔗 antomatic Your call is important to us, please hold.
14:48 🔗 * antomatic plays some music
15:26 🔗 SketchCow Have you tried our new archive team credit card? With low introductory rates and a host of benefits, you could be saving the internet AND a whole lot of money.
15:27 🔗 lhobas And, you can take advantage of our loyalty program - each archived item earns you one (1) karma point!
15:48 🔗 raylee LOL
16:02 🔗 antomatic You know what we /should/ do?
16:02 🔗 antomatic Set up an ArchiveTeam proxy server. Then we can archive every single thing everyone looks at. :)
16:02 🔗 xmc there's an idea
16:02 🔗 xmc WarcProxy doesn't handle POST requests properly, it converts them to GET
16:03 🔗 xmc but other than that it's serviceable, I used it for a few weeks straight
16:04 🔗 raylee it also wouldn't work with ssl
16:05 🔗 xmc aye
16:05 🔗 antomatic well that's OK, we wouldn't want to archive people's bank accounts :)
16:05 🔗 xmc I don't see why not ...
16:06 🔗 DFJustin yeah I'm sure there are times when you wish you had your old balance back ;)
16:35 🔗 SketchCow ------------------------------------------------------------------------
16:36 🔗 SketchCow Statement by Jason Scott on Archive Team
16:36 🔗 SketchCow http://ascii.textfiles.com/archives/4387
16:36 🔗 SketchCow ------------------------------------------------------------------------
16:39 🔗 antomatic "You cannot arrest an idea. Or run it over with a bus."
16:39 🔗 antomatic - Confucious
16:41 🔗 vantec Avoid the back of buses also. You can get exhusted.
16:52 🔗 yipdw antomatic: that exists, somewhat
16:52 🔗 yipdw webrecorder.io
16:52 🔗 antomatic Ah.
16:52 🔗 yipdw it's one of Ilya Kreymer's things (also authored pywb)
18:06 🔗 SketchCow Wired now wants an interview.
18:10 🔗 yipdw kick his ass, sea bass
18:16 🔗 garyrh http://blog.easel.io/blog/2014/09/17/easel-is-shutting-down/
18:16 🔗 garyrh closes on "Oct 31st at 11:59pm pacific time"
18:19 🔗 yipdw " To those of you who continue to depend on easel.io, we are very sorry."
18:19 🔗 yipdw ha
18:19 🔗 yipdw " To those of you who continue to depend on easel.io, SUCKERS"
18:22 🔗 RedType This is a bittersweet day for us. We love Easel, and we use it daily. We are extremely thankful and appreciative of our loyal users. To those of you who continue to depend on easel.io, we are very sorry.
18:23 🔗 arkiver easel.io is only a building tool and not a hosting site right?
18:23 🔗 aaaaaaaaa That is the gist I got.
18:25 🔗 garyrh yeah. it was a website designer thing.
18:26 🔗 arkiver ok good. had to make sure
19:26 🔗 chfoo the tracker will be unavailable for a bit while i release some claims
19:27 🔗 chfoo ok, done
19:37 🔗 onoj can anyone tell me if there is a mirror for the warrior VM image download? the one linked on the wiki is paiiiinfully slow for me.
19:39 🔗 arkiver yeah sure
19:39 🔗 arkiver try wayback machine
19:39 🔗 arkiver maybe that's faster
19:40 🔗 arkiver actually it's already in wayback, heh
19:40 🔗 Rotab i can mega it
19:40 🔗 arkiver in IA*
19:40 🔗 Rotab oh, ok
19:41 🔗 onoj either would be amazing, seriously getting <10k/s off the wiki link. wtf.
19:45 🔗 joepie91 onoj: yeah, it slows down occasionally
20:12 🔗 n00b121 Hello. I hope this is the right place to ask. I have a collection of saved threads (text-only, no images) from 4chan and other imageboard / chan sites. It is in total ~300GB when compressed with 7z, I don't know how big it is uncompressed. There are no http header logs or warc output, ie no provenance. Is this something the archive team would be interested in adding to the collection? I don't know much about this so sor
20:13 🔗 n00b121 -sorry if it is a stupid question. Thank you for your time
20:13 🔗 xmc sure, upload it to an item on archive.org
20:13 🔗 antomatic Definitely.
22:08 🔗 SketchCow http://www.geekwire.com/2014/archive-team-twitpic-blocking-us-downloading-photos-shutdown/
22:56 🔗 amerrykan maybe public pressure will help change the situation

irclogger-viewer