#archiveteam 2014-03-02,Sun

↑back Search

Time Nickname Message
00:07 🔗 SadDM hey sanqui... I don't see why it would be a problem.
00:08 🔗 sanqui okay, that's good to hear.
00:08 🔗 sanqui really I just needed to hear somebody tell me it's fine
00:08 🔗 sanqui otherwise I'd still have that feeling of doing something wrong unknowingly :p
00:08 🔗 SadDM I mean, it sounds to me that you have a log of an interesting technological and sociological experiment.
00:09 🔗 SadDM And here's the deal as I understand it for archive.org...
00:11 🔗 SadDM as long as what you're uploading isn't *totally* squirrely you're probably ok... you know, just don't upload several gigs of random bytes (at least not without some context of what those bytes are)
00:11 🔗 sanqui alright, cool
00:11 🔗 sanqui yeah, it's definitely interesting
00:11 🔗 sanqui video should probably be upload in hand, but I don't have that
00:15 🔗 sanqui uploaded*
00:32 🔗 Atluxity sanqui: for short time availability try filebin.net
00:33 🔗 Atluxity oh, I was way up in my backlog
00:33 🔗 Atluxity see you got answers
01:48 🔗 wp494 heh, good to know I'm not the only one that lurks in some Twitch chat channels through an IRC client
01:49 🔗 wp494 though I haven't been able to be arsed to do so for TPP since apparently the twitch police moved it to its own server
01:58 🔗 wp494 as for video, you can do it through downloadhelper on firefox if you're on one of the recorded videos and not the stream itself
01:58 🔗 wp494 but IIRC twitch will only let you pull down 30 minute segments
01:58 🔗 wp494 making things tricky, unless there's some other method that I'm not aware about
11:49 🔗 sanqui wp494: I haven't just been "lurking" there, I've been keeping this page with live stats: open('/home/sanky/twitchplayspokemon/fillgap/#twitchplayspokemon.converted.log', 'r')
11:49 🔗 sanqui oops, sorry.. http://sanqui.rustedlogic.net/etc/tpp/
11:52 🔗 Schbirid nice
13:09 🔗 kelenken anyone here know a good program for downloading deviantart galleries?
13:56 🔗 sanqui uploaded: https://archive.org/details/tpp_logs
15:36 🔗 godane i'm starting to upload dpk magazine: https://archive.org/details/DPK_1998_12
15:36 🔗 godane its another russian computer magazine
15:59 🔗 godane now the japanese version of kojima productions podcasts is being uploaded: https://archive.org/details/kojima_productions_hidechanradio_01
19:00 🔗 SketchCow Hugs and kisses.
19:00 🔗 SketchCow I'm a little not well.
19:00 🔗 godane SketchCow: you have the cold or something?
19:00 🔗 SketchCow I assume so.
19:01 🔗 SketchCow I nyquil'd the fuck out of myself.
19:01 🔗 SketchCow Let's operate equipment!
19:01 🔗 godane that sucks
19:01 🔗 SketchCow I don't get sick all that often.
19:01 🔗 godane SketchCow: i asked you about a 4tb hard drive last week
19:01 🔗 SketchCow Andyway, is sanqui still around?
19:02 🔗 SketchCow When he or she sees the highlight: I can help you upload the material, or you can point to where it is and I'll help get it all into the Internet Archive.
19:05 🔗 sanqui SketchCow: it's already up :) https://archive.org/details/tpp_logs
19:05 🔗 SketchCow Good.
19:05 🔗 SketchCow Let me see if I can find a better place for it.
19:05 🔗 sanqui please let me know if I should change anything about the metadata, though
19:06 🔗 SketchCow The thing with an item like that is how weird it is.
19:06 🔗 SketchCow Like, where does the ancillary discussion or activity related to an online event GO?
19:06 🔗 SketchCow This isn't your problem, it's Internet Archive/My problem
19:07 🔗 sanqui it's not just discussion! it's also the actual *inputs* used to play the game
19:07 🔗 SketchCow Yeah, I know.
19:07 🔗 SketchCow But it's the contextual items around a specific event.
19:07 🔗 SketchCow Trust me, it's a classification nightmare.
19:07 🔗 SketchCow But I'm going to put it in software for now.
19:07 🔗 sanqui though, of course, it couldn't be used to reproduce the event, due to different network lag to the other computer (obviously)
19:07 🔗 SketchCow Because texts is a nightmare shitbin.
19:07 🔗 sanqui and, I see
19:07 🔗 SketchCow Either way, your url will never ever change.
19:07 🔗 SketchCow That's permanent.
19:08 🔗 sanqui That's great, I've already thrown it around.
19:08 🔗 sanqui And yeah. I was thinking it should go under web, since it was content on the internet at some point, but maybe that's only for actual websites?
19:08 🔗 SketchCow I was sick and unable to answer your question before, but I agree that events lik this need as much material saved from them as possible.
19:09 🔗 sanqui That's okay!
19:09 🔗 SketchCow Because then someone in 5 years goes "but what if I wanted to duplicate that event - where are the materials"
19:10 🔗 sanqui 5 years? I've already had over ten people email me asking for the logs for study purposes!
19:12 🔗 SketchCow Those people are hip and connected.
19:13 🔗 SketchCow I'm talking about everyone else.
19:13 🔗 SketchCow This stuff gets passed around, sure.
19:13 🔗 SketchCow But then come the people who weren't connected to it.
19:13 🔗 SketchCow They're always sad.
19:13 🔗 SketchCow No recordings, no materials, nowhere to find them.
19:13 🔗 SketchCow So this helps fix that.
19:13 🔗 sanqui I understand!
19:13 🔗 SketchCow One of the FOS drives is down to 1tb of material on it, of which 900gb of that is hacker con materials.
19:13 🔗 sanqui I'm happy to make this data available, actually. Way better to have it on archive.org than some shady user-hostile download side.
19:13 🔗 SketchCow So that's good.
19:14 🔗 SketchCow I should dump all that.
19:14 🔗 SketchCow I will also start aggressively downloading FTP sites shortly.
19:14 🔗 SketchCow So I can get those off the to-do.
19:14 🔗 SketchCow (I have e-mails with lists of FTP sites.)
19:16 🔗 godane i'm dumping DPK magazine into my godaneinbox
19:16 🔗 godane another computer magazine from russia
19:16 🔗 midas i have to restart my ftp downloads again.
19:55 🔗 SketchCow I'm trying to desperately push myopera out of the box.
19:57 🔗 godane SketchCow: any chance of getting that hard drive i asked in PM?
19:57 🔗 godane it was a few days ago
19:58 🔗 godane i got some big projects that i can do
19:59 🔗 SketchCow I need to speak with the Archive.
19:59 🔗 SketchCow Or we can get people in here to all go in and buy you one.
19:59 🔗 SketchCow WHO WANTS TO HELP BUY GODANE A 4GB
19:59 🔗 SketchCow I MEAN A 4TB
19:59 🔗 SketchCow AND A 4GB, SO HE CAN PUT IT UNDER THE 4TB SO THE 4TB DOESN'T GET DIRTY
20:00 🔗 Schbirid i want one first to offload the jamendo ogg vorbis collection =)
20:00 🔗 godane i found stuff like this project: http://uk.reuters.com/embeddedVideo?videoId=2850
20:00 🔗 godane reuters has meta for videos going back to 2007
20:00 🔗 Schbirid actually... joepie91 are you on the flac jamendo archiving? maybe we should just let that go through and then check what albums i have that were no longer available today
20:04 🔗 godane also the other project is a fox news collection
20:04 🔗 godane with 170k+ urls so far
20:05 🔗 godane good news is i maybe able to throw out 63gb later today
20:05 🔗 godane cause i'm uploading cnn money videos that i have currently
20:05 🔗 godane that only goes up to 2009-08
20:06 🔗 godane that way i can just download a month then upload and delete
20:08 🔗 godane SketchCow: i'm also grabbing world news from hulu
20:08 🔗 godane turns out that they really did a bad job with there podcasts from 2010 to 2011-03
20:10 🔗 godane also the hulu rips may end up going into another collection
20:20 🔗 joepie91 <Schbirid>actually... joepie91 are you on the flac jamendo archiving? maybe we should just let that go through and then check what albums i have that were no longer available today
20:20 🔗 joepie91 have not gotten around to that yet
20:21 🔗 joepie91 past two weeks have been a mess
20:24 🔗 DFJustin one thing that would be good now that the tpp logs are available is to go through and archive all of the urls linked in the chat
20:24 🔗 DFJustin because they have all sorts of images and such made for the event
20:25 🔗 DFJustin I tried to run the ones I saw through web.archive.org/save but I only watched a tiny fraction of the stream
20:32 🔗 SketchCow DFJustin: Agreed
20:55 🔗 arkiver DFJustin: I'm downloading the links from the first log now
20:56 🔗 DFJustin yay
20:59 🔗 arkiver http://pastebin.com/TSNQcQje
20:59 🔗 arkiver list of first log:
20:59 🔗 arkiver downloading as warc
21:12 🔗 arkiver second log:
21:12 🔗 arkiver http://pastebin.com/riqwteQx
21:15 🔗 arkiver third log:
21:15 🔗 arkiver http://pastebin.com/uTvyJRbL
21:19 🔗 creature If I want to summon ArchiveTeam to mirror a site, is there a standard process for that?
21:23 🔗 arkiver what site is it about?
21:23 🔗 midas creature: you can drop a url here and someone will feed it to the archiveteam bot if possible
21:23 🔗 arkiver DFJustin: and last log:
21:23 🔗 arkiver http://pastebin.com/257Md61Y
21:25 🔗 creature I guess it's slightly complicated by me not being sure when's the best time to do so.
21:25 🔗 creature See, there is this thing called FAWM. February Album Writing Month. A small community where each year, people attempt to write 14 songs in 28 days.
21:26 🔗 creature The site's only open to posting new tracks in February, and I understand it remains open for comments throughout March (to give people time to listen to stuff). But then... it vanishes. And each year, when it returns, the old stuff is gone.
21:26 🔗 midas .org?
21:26 🔗 creature http://fawm.org/ is the URL.
21:26 🔗 arkiver ah
21:27 🔗 creature archive.org gets some of it, normally, but I wish there were a complete snapshot.
21:28 🔗 arkiver I just added it to archivebt
21:28 🔗 arkiver archivebot*
21:28 🔗 arkiver it will be downloaded completely then
21:28 🔗 DFJustin keep an eye on it, it may miss the mp3 embeds
21:29 🔗 arkiver hmm yes
21:29 🔗 arkiver it will miss them
21:29 🔗 chfoo i think wget won't get it since the url is in a "data-mp3-url" tag
21:30 🔗 creature Is the archivebot a one-time thing, or will it keep its copy up to date?
21:31 🔗 arkiver archivebot just creates a copy as it downloads the pages
21:31 🔗 arkiver and that's it
21:31 🔗 arkiver it won't keep it up to date
21:31 🔗 arkiver but we can add it multiple times
21:31 🔗 DFJustin it can be re-run later
21:31 🔗 ersi unless you re-run it
21:31 🔗 arkiver yes
21:31 🔗 arkiver lol
21:31 🔗 arkiver 3x same
21:31 🔗 creature Okay. Thanks. :)
21:35 🔗 arkiver chfoo: I can create a list of links from that websites, open the links in my browser autoatically one by one, get the link to the audio file
21:35 🔗 arkiver and download the audio files
21:35 🔗 arkiver that should work I think
21:36 🔗 chfoo arkiver: sounds good
21:36 🔗 arkiver shall I do that? :)
21:37 🔗 arkiver creature: do you have an exact date when the event ends?
21:37 🔗 arkiver and do you have an indication when the websites goes down again?
21:38 🔗 creature arkiver: I don't. I think it stays open for comments throughout March, and then... maybe goes read-only? Or maybe stays up, but essentially falls dormant?
21:39 🔗 arkiver hmm
21:39 🔗 arkiver I will do all the urls fro the songs next week
21:39 🔗 arkiver I'll keep you informed
21:41 🔗 creature Yeah, the songs are now static. They're not going to change.
21:41 🔗 creature Hmm. Although they *might* get added to; people can update pages they've already uploaded.
21:41 🔗 creature So if you upload a song without a demo, you can still edit it.
21:42 🔗 arkiver so a demo can be added later too?
21:42 🔗 creature Yep.
21:42 🔗 arkiver hmm
21:42 🔗 arkiver maybe I'll doo a weekly audio file rip
21:43 🔗 arkiver to be sure to have everything
21:44 🔗 creature It's hard for me to say what's most important - the songs or the comments/pages around it.
21:46 🔗 arkiver the comments/pages will be downloaded by archivebot
21:46 🔗 arkiver I'll do the songs
21:54 🔗 creature https://web.archive.org/web/20130603145229/http://fawm.org/forums/ is a snapshot of the website from June last year, suggesting that the site will stay up past March.
21:59 🔗 arkiver creature: thank you, I leaving now
21:59 🔗 arkiver I'm*
21:59 🔗 creature Thank *you* for taking a snapshot. :)
22:00 🔗 arkiver will keep you informed about the progress... ;)
22:00 🔗 creature http://fawm.org/forums/topic/4223/ is a forum post where I asked about this; Burr (the guy who runs FAWM) basically said "We have vague plans to restore some old content, but nothing concrete."
22:13 🔗 arkiver oh haha
22:13 🔗 arkiver look:
22:13 🔗 arkiver http://files.fawmers.com/FAWM_14/
22:13 🔗 arkiver :P
22:14 🔗 arkiver or hmpf
22:15 🔗 arkiver doesn't work for every song
22:15 🔗 arkiver http://matthewdivito.com/music/2014_fawm/14_feel_heartbeat.mp3
22:15 🔗 arkiver http://fawm.org/songs/43106/
22:15 🔗 arkiver however really going now, later everyone!

irclogger-viewer