[02:54] can you have a multi-word tag (like animated gifs) or does it have to be one word? [02:55] hm? [02:56] I'm uploading an item to IA, and wondering if I can have multi word tags, or just single word tags? [02:56] oh, dunno sorry [02:57] http://archive.org/search.php?query=mediatype%3Amovies%20AND%20collection%3ANIST_WTC_Repository%20AND%20subject%3A%22No%20Tags%22 [02:57] if tags == subject, then yes, by the look of that [02:57] s/tags/keywords/ [02:59] where should shareware and related CDs go? community texts or community media? [02:59] community texts i think [02:59] * winr4r pokes godane, who will know for sure [03:10] just decided to go with community texts- it'll have to get moved anyway [03:12] yes [03:37] got a pretty cool upload here: https://archive.org/details/22000Animatedgifs a CD from the mid-90's with 22k animated gifs, and advertising support for Windows, Mac, and Linux (still in the process of deriving) [03:38] These Exciting Active Images Keep Your Web Pages Stand Out from Others [03:40] dashcloud: hey, is the cover image meant to be broken [03:41] nice find btw [04:21] so, who is on URLteam here and has a local copy of the URL lists [04:22] and is awesome because they want to grep it for me [05:14] that's a lot of gifs [05:53] winr4r: Well, that might be soultcer if he has time and if he still has all the data. Otherwise, you'll need to download the URLTeam torrent releases and grep 'em yourself. [05:54] ersi: yeah, the downside is my bandwidth cap :< [05:54] but if that's what i gotta do [06:09] so this is fun [06:09] hak5 909 hd is out of sync for some reason [09:18] Hi! So I've been downloading non-binary Usenet the last few months. Got 2GB so far (turns out it's slow to do using a custom Python script over NNTP) [09:18] Anyway, British Internet censorship seems to be tightening up, and my partner's worried I'll accidentally download a few thousand illegal posts [09:19] I just wanted to check that some other people have Usenet covered before I stop, basically [09:20] ZoeB: illegal how [09:20] unless they are cp. [09:20] and that should be blocked already by the INF (might have that name wrong) blacklist eitherway. [09:20] IWF, there we go [09:21] well eg anything depicting rape may become illegal unless it has enough artistic merit or something arbitrary sounding, so there's alt.sex.stories.moderated out or not, depending on the interpretation [09:21] Oh, I thought that was images only. [09:21] (I'm UK based too btw) [09:21] I have no idea if it's images only... [09:22] I also have no idea if my automated script is getting binary stuff that was, for whatever reason, posted outside of alt.binaries [09:22] yeah, thinking about it I don't either. [09:22] For all I know, 50 Shaes of Grey is illegal here until picked up by a major publisher... :) it's crazy and arbitrary sounding, which is dangerous in itself [09:22] nod. [09:23] Anyway, my partner's getting freaked about legal implications of my digital hoarding, as she calls it :) [09:23] Nod. [09:23] I get that, I sometimes worry myself. [09:23] Hmmmmm [09:23] I can't say if the usenet stuff has been grabbed or not. [09:24] You can get a pretty thorough backup of usenet using the Utzoo tapes, Walnut Creek CD-ROMs (which I haven't found yet), and eg MaximumUsenet.com (which I'm currently leeching via), or anyone else with decades-old retention [09:25] I'm happy to stop for the sake of my partner's sanity, but just as that would help her sleep better at night, it'd help me sleep better at night to know that someone, anyone, is preserving all those decades of usenet. [09:25] have you published your scripts? [09:26] Yep: https://github.com/ZoeB/arcmesg [09:26] good job :) [09:26] if that counts [09:26] It grabs over e-mail and usenet, and puts each message in a well-organised directory based on the hash of the unique message ID [09:26] I've also nabbed lots of mailing lists (music, GNU, etc) and will import those into my stash of messages [09:26] is it me or did zoe blade actually just show up in here [09:27] I do that occasionally... not often. :) [09:27] Where do you know me from? :D [09:27] ZoeB: get lamp soundtrack [09:28] Ah, well spotted! [09:28] and other things! [09:28] Ooh [09:30] Anyway, yeah, there's the script if anyone wants it. Find a news server with decades-lasting retention, run that script, come back a few months or so later. :) Mine's running on a raspberry pi, which is slow but works very well. It's just a shame I have to stop [09:36] hm :/ [09:37] yeah i don't know how "i just ran this script and didn't pay any attention to what the fuck it was grabbing" is much of a legal defense here [09:37] and i can see why you would not like to be the first person to find out one way or the other [09:38] Thats you/~? yey [09:39] Yeah, I mean, I want to preserve usenet, but not enough to serve time... Especially as we're watching Orange is the New Black and it's so depressing... Heh [09:40] * ZoeB stops the script -.- [09:41] * winr4r plays funeral music [09:41] Well it was fun while it lasted [09:41] on the other hand, the bullshit laws aren't here *yet* [09:44] At least I'm still archiving a mailing list or two... [09:45] which is plenty useful! [09:45] Yeah, it's a pretty interesting one to a small niche of people including myself, heh. [09:48] given enough time, everything becomes interesting [09:48] who gave a shit about saving some guy's announcement that he was making a unix clone in 1983, right [09:48] Haha, Nina's still giving me funny looks over you "spotting" me [09:48] * ZoeB hides [09:49] Stallman? Yeah, in hindsight it's easier to spot what's useful. [09:49] Or the announcement of Berners-Lee's new fangled hypertext protocol [09:50] Or how we should try using ":-)" to depict jokes so people don't take them seriously [09:51] yes [09:51] Nina being your OH? [09:51] Yes. :) [09:52] She's sticking Teflon to her mouse right now by the looks of things [09:52] (may of been a cat, my cats give me funny looks all the time :)) [09:52] Haha [09:52] No, she's my partner. :) [09:52] And right about now I say we should take the offtopic chat to -bs if you don't mind :) [09:52] Sorry [09:58] omg, Zoe blade [09:58] that's so cool [09:59] ZoeB: I'll probably end up kicking off the scripts on a box here at archive.org, too [10:00] ^_^ [10:00] Thanks [10:00] Heh, Nina suspects you're humouring me now ;) [10:00] Oh he's not. He loves abusing anything he can at IA. [10:00] :D [10:00] Underscor, the sadist [10:01] fifty shades of IA [10:01] ZoeB: I am a big fan from the soundtrack-y stuff you've done for GL, 8-bit, etc [10:01] winr4r: Oh yes, paddle me with the 4tb one this time~ [10:01] Thanks! [10:02] :) [10:02] You should watch the Defcon documentary when that comes out. :) [10:02] I've seen it! [10:02] * underscor was part of the crew [10:03] I wanna see it [10:03] Oh, cool! Hmm, should we take this to -bs too? [10:03] The audio is quite lovely. [10:03] Oh, bah, I suppose ;P [11:17] OK, time for me to get more work done... See you lovely people later o/ [12:11] o/ [13:05] MORNING team. [13:05] We [13:06] We'll see how many people from this event will join archive team. [13:32] Did they record it? [14:07] I did. [14:12] SketchCow: what event? [14:13] NDSA: National Agenda Roundtable [14:14] aha [14:24] http://www.ask-kalena.com/wp-content/uploads/2013/02/jason-scott-webstock2013.jpg [14:24] Whoops [14:24] http://www.bloomberg.com/news/2013-07-23/selling-off-detroit-s-art-could-depress-global-market.html [14:25] hmmmm [14:25] why is my snapjoy upload on S3 still not showing :( [14:27] anything showing in https://archive.org/catalog.php?justme=1 ? [14:29] snapjoy_images - History Mgr 167733559ia600904 Xamayon derive.php(2.5 hours) djsmiley2k@gmail.comdir=/26/items/snapjoy_ima.. [14:29] yeah that has to finish before it will show up [14:29] SketchCow: you did the same switcheroo on twitter just now [14:30] DFJustin: I always forget that URL for checking ;D [14:30] redfishmagazinedeleted - History Mgr 126165850iw600701 Xamayon derive.php(294.3 days) << ol [15:33] The backup of Toile Libre is up https://archive.org/details/pix_toile-libre_org 70gb of images plus sql database dumps. [15:36] If no one else has backed up buzzdata that will be next on my list. I am going to scrape the user list and then use the api to grab all the datasets [15:41] [10:28] snapjoy_images - History Mgr 167733559ia600904 Xamayon derive.php(2.5 hours) djsmiley2k@gmail.comdir=/26/items/snapjoy_ima.. <--- what? [15:42] Xamayon: he downloaded snapjoy [15:42] give him a hug [15:44] Xamayon: oh, your name got in there because i think he posted from a tab-separated file, with X as one of the columns :) [15:45] ahhhh, okay... thought someone might be impersonating me for a second :D [16:14] http://www.webcitation.org/ [16:15] WebCite is under threat of shutdown [16:15] unless fundraising goal is reached [16:15] code red [16:16] joepie91: of not taking new submissions, which is a bit different [16:17] winr4r: which is usually followed by... [16:18] SketchCow: t jason can you upload the audio, i can try to clean it up if it needs it and if you don't have time [16:18] joepie91: yeah, let's hope not [16:22] joepie91: Yes, it's been like that for quite a while unfortunately [16:24] winr4r: Tell me what URLs to look for and I can search the urlteam data. It will take a couple of days though. [16:29] soultcer: *.webtv.net [16:30] kk [17:01] it makes sense that we'd just approach webcite's people and ask for a copy of the db if they go under? [17:16] yeah they don't seem like the types to just throw the hdds into the wood chipper when the money runs out [17:20] it's worth supporting as a tool to help ordinary people preserve stuff though [18:29] something for the gaming archivists: gamebanana file servers/mirrors(?) seem easy to mirror: http://dl-files.templeservers.com/ http://8.bitgekko.net/gbanana/ are two random ones i fonud by accident [19:52] and my wifi keeps sucking [20:36] OK, taking a plane to Kansas now. [20:43] l8r! [21:02] SketchCow: have fun!