[00:07] hey sanqui... I don't see why it would be a problem. [00:08] okay, that's good to hear. [00:08] really I just needed to hear somebody tell me it's fine [00:08] otherwise I'd still have that feeling of doing something wrong unknowingly :p [00:08] I mean, it sounds to me that you have a log of an interesting technological and sociological experiment. [00:09] And here's the deal as I understand it for archive.org... [00:11] as long as what you're uploading isn't *totally* squirrely you're probably ok... you know, just don't upload several gigs of random bytes (at least not without some context of what those bytes are) [00:11] alright, cool [00:11] yeah, it's definitely interesting [00:11] video should probably be upload in hand, but I don't have that [00:15] uploaded* [00:32] sanqui: for short time availability try filebin.net [00:33] oh, I was way up in my backlog [00:33] see you got answers [01:48] heh, good to know I'm not the only one that lurks in some Twitch chat channels through an IRC client [01:49] though I haven't been able to be arsed to do so for TPP since apparently the twitch police moved it to its own server [01:58] as for video, you can do it through downloadhelper on firefox if you're on one of the recorded videos and not the stream itself [01:58] but IIRC twitch will only let you pull down 30 minute segments [01:58] making things tricky, unless there's some other method that I'm not aware about [11:49] wp494: I haven't just been "lurking" there, I've been keeping this page with live stats: open('/home/sanky/twitchplayspokemon/fillgap/#twitchplayspokemon.converted.log', 'r') [11:49] oops, sorry.. http://sanqui.rustedlogic.net/etc/tpp/ [11:52] nice [13:09] anyone here know a good program for downloading deviantart galleries? [13:56] uploaded: https://archive.org/details/tpp_logs [15:36] i'm starting to upload dpk magazine: https://archive.org/details/DPK_1998_12 [15:36] its another russian computer magazine [15:59] now the japanese version of kojima productions podcasts is being uploaded: https://archive.org/details/kojima_productions_hidechanradio_01 [19:00] Hugs and kisses. [19:00] I'm a little not well. [19:00] SketchCow: you have the cold or something? [19:00] I assume so. [19:01] I nyquil'd the fuck out of myself. [19:01] Let's operate equipment! [19:01] that sucks [19:01] I don't get sick all that often. [19:01] SketchCow: i asked you about a 4tb hard drive last week [19:01] Andyway, is sanqui still around? [19:02] When he or she sees the highlight: I can help you upload the material, or you can point to where it is and I'll help get it all into the Internet Archive. [19:05] SketchCow: it's already up :) https://archive.org/details/tpp_logs [19:05] Good. [19:05] Let me see if I can find a better place for it. [19:05] please let me know if I should change anything about the metadata, though [19:06] The thing with an item like that is how weird it is. [19:06] Like, where does the ancillary discussion or activity related to an online event GO? [19:06] This isn't your problem, it's Internet Archive/My problem [19:07] it's not just discussion! it's also the actual *inputs* used to play the game [19:07] Yeah, I know. [19:07] But it's the contextual items around a specific event. [19:07] Trust me, it's a classification nightmare. [19:07] But I'm going to put it in software for now. [19:07] though, of course, it couldn't be used to reproduce the event, due to different network lag to the other computer (obviously) [19:07] Because texts is a nightmare shitbin. [19:07] and, I see [19:07] Either way, your url will never ever change. [19:07] That's permanent. [19:08] That's great, I've already thrown it around. [19:08] And yeah. I was thinking it should go under web, since it was content on the internet at some point, but maybe that's only for actual websites? [19:08] I was sick and unable to answer your question before, but I agree that events lik this need as much material saved from them as possible. [19:09] That's okay! [19:09] Because then someone in 5 years goes "but what if I wanted to duplicate that event - where are the materials" [19:10] 5 years? I've already had over ten people email me asking for the logs for study purposes! [19:12] Those people are hip and connected. [19:13] I'm talking about everyone else. [19:13] This stuff gets passed around, sure. [19:13] But then come the people who weren't connected to it. [19:13] They're always sad. [19:13] No recordings, no materials, nowhere to find them. [19:13] So this helps fix that. [19:13] I understand! [19:13] One of the FOS drives is down to 1tb of material on it, of which 900gb of that is hacker con materials. [19:13] I'm happy to make this data available, actually. Way better to have it on archive.org than some shady user-hostile download side. [19:13] So that's good. [19:14] I should dump all that. [19:14] I will also start aggressively downloading FTP sites shortly. [19:14] So I can get those off the to-do. [19:14] (I have e-mails with lists of FTP sites.) [19:16] i'm dumping DPK magazine into my godaneinbox [19:16] another computer magazine from russia [19:16] i have to restart my ftp downloads again. [19:55] I'm trying to desperately push myopera out of the box. [19:57] SketchCow: any chance of getting that hard drive i asked in PM? [19:57] it was a few days ago [19:58] i got some big projects that i can do [19:59] I need to speak with the Archive. [19:59] Or we can get people in here to all go in and buy you one. [19:59] WHO WANTS TO HELP BUY GODANE A 4GB [19:59] I MEAN A 4TB [19:59] AND A 4GB, SO HE CAN PUT IT UNDER THE 4TB SO THE 4TB DOESN'T GET DIRTY [20:00] i want one first to offload the jamendo ogg vorbis collection =) [20:00] i found stuff like this project: http://uk.reuters.com/embeddedVideo?videoId=2850 [20:00] reuters has meta for videos going back to 2007 [20:00] actually... joepie91 are you on the flac jamendo archiving? maybe we should just let that go through and then check what albums i have that were no longer available today [20:04] also the other project is a fox news collection [20:04] with 170k+ urls so far [20:05] good news is i maybe able to throw out 63gb later today [20:05] cause i'm uploading cnn money videos that i have currently [20:05] that only goes up to 2009-08 [20:06] that way i can just download a month then upload and delete [20:08] SketchCow: i'm also grabbing world news from hulu [20:08] turns out that they really did a bad job with there podcasts from 2010 to 2011-03 [20:10] also the hulu rips may end up going into another collection [20:20] actually... joepie91 are you on the flac jamendo archiving? maybe we should just let that go through and then check what albums i have that were no longer available today [20:20] have not gotten around to that yet [20:21] past two weeks have been a mess [20:24] one thing that would be good now that the tpp logs are available is to go through and archive all of the urls linked in the chat [20:24] because they have all sorts of images and such made for the event [20:25] I tried to run the ones I saw through web.archive.org/save but I only watched a tiny fraction of the stream [20:32] DFJustin: Agreed [20:55] DFJustin: I'm downloading the links from the first log now [20:56] yay [20:59] http://pastebin.com/TSNQcQje [20:59] list of first log: [20:59] downloading as warc [21:12] second log: [21:12] http://pastebin.com/riqwteQx [21:15] third log: [21:15] http://pastebin.com/uTvyJRbL [21:19] If I want to summon ArchiveTeam to mirror a site, is there a standard process for that? [21:23] what site is it about? [21:23] creature: you can drop a url here and someone will feed it to the archiveteam bot if possible [21:23] DFJustin: and last log: [21:23] http://pastebin.com/257Md61Y [21:25] I guess it's slightly complicated by me not being sure when's the best time to do so. [21:25] See, there is this thing called FAWM. February Album Writing Month. A small community where each year, people attempt to write 14 songs in 28 days. [21:26] The site's only open to posting new tracks in February, and I understand it remains open for comments throughout March (to give people time to listen to stuff). But then... it vanishes. And each year, when it returns, the old stuff is gone. [21:26] .org? [21:26] http://fawm.org/ is the URL. [21:26] ah [21:27] archive.org gets some of it, normally, but I wish there were a complete snapshot. [21:28] I just added it to archivebt [21:28] archivebot* [21:28] it will be downloaded completely then [21:28] keep an eye on it, it may miss the mp3 embeds [21:29] hmm yes [21:29] it will miss them [21:29] i think wget won't get it since the url is in a "data-mp3-url" tag [21:30] Is the archivebot a one-time thing, or will it keep its copy up to date? [21:31] archivebot just creates a copy as it downloads the pages [21:31] and that's it [21:31] it won't keep it up to date [21:31] but we can add it multiple times [21:31] it can be re-run later [21:31] unless you re-run it [21:31] yes [21:31] lol [21:31] 3x same [21:31] Okay. Thanks. :) [21:35] chfoo: I can create a list of links from that websites, open the links in my browser autoatically one by one, get the link to the audio file [21:35] and download the audio files [21:35] that should work I think [21:36] arkiver: sounds good [21:36] shall I do that? :) [21:37] creature: do you have an exact date when the event ends? [21:37] and do you have an indication when the websites goes down again? [21:38] arkiver: I don't. I think it stays open for comments throughout March, and then... maybe goes read-only? Or maybe stays up, but essentially falls dormant? [21:39] hmm [21:39] I will do all the urls fro the songs next week [21:39] I'll keep you informed [21:41] Yeah, the songs are now static. They're not going to change. [21:41] Hmm. Although they *might* get added to; people can update pages they've already uploaded. [21:41] So if you upload a song without a demo, you can still edit it. [21:42] so a demo can be added later too? [21:42] Yep. [21:42] hmm [21:42] maybe I'll doo a weekly audio file rip [21:43] to be sure to have everything [21:44] It's hard for me to say what's most important - the songs or the comments/pages around it. [21:46] the comments/pages will be downloaded by archivebot [21:46] I'll do the songs [21:54] https://web.archive.org/web/20130603145229/http://fawm.org/forums/ is a snapshot of the website from June last year, suggesting that the site will stay up past March. [21:59] creature: thank you, I leaving now [21:59] I'm* [21:59] Thank *you* for taking a snapshot. :) [22:00] will keep you informed about the progress... ;) [22:00] http://fawm.org/forums/topic/4223/ is a forum post where I asked about this; Burr (the guy who runs FAWM) basically said "We have vague plans to restore some old content, but nothing concrete." [22:13] oh haha [22:13] look: [22:13] http://files.fawmers.com/FAWM_14/ [22:13] :P [22:14] or hmpf [22:15] doesn't work for every song [22:15] http://matthewdivito.com/music/2014_fawm/14_feel_heartbeat.mp3 [22:15] http://fawm.org/songs/43106/ [22:15] however really going now, later everyone!