[00:17] https://archive.org/details/MacrovisionAnalysisOfRecentDVDDeCSSHackNovember151999 is pure gold. [00:25] "standard 56kb modem" [00:27] it's pretty accurate as far as the time goes, just woefully non-forward-looking [00:27] in 1999 I had a 4.2gb hard drive and dial-up, in a major us city [00:29] when divx stuff first started coming out I had to use this super optimized DOS player to play at full framerate [00:31] http://www.multimediaware.com/qv/ [00:32] Yeah, very true! I knew people in similar positions. Their wording of "2-3 years" as an eternity really makes it seem they had no plan though [00:32] " No Windows required!" [00:32] Wait, they take PayPal? This site is a going concern, not just a 90s relic? [00:34] Last updated 8th July 2010 - why? [00:35] and I say that as a relic myself! [00:35] Most recent Pro version for DOS released 7 years ago. [00:36] "QuickView Pro is a 32 bit protected mode program and therefore needs at least a 80386 processor and a VGA card." [00:36] well, at least it's honest. I was not impressed a couple of years ago to discover that memtest86 needs at least a 386 [00:37] I was trying to run it on an 8086, I figured the '86' meant I was onto a winner! [00:37] developing countries, impoverished school districts, you'd be surprised [00:39] http://www.macworld.com.au/features/if-it-aint-broke-dont-fix-it-ancient-computers-still-in-use-45073/ [05:09] so i think my glenn beck archiving may have to be put on hold [05:09] i'm not getting any of video today or yesterdays [05:09] rtmpdump keeps erroring [05:18] finally getting wilkow [05:33] Letting you know, my archivebot computer overheated and died prematurely. :( [05:46] Bill yahoo! [08:23] wow, so archivebot now officially killed a server? heh [08:41] ? [08:42] lol :p [08:42] can someone archive that? dont use the bot, it might burn down ;-) [10:57] so i should be able to get most of night line 2013 now [10:57] i think they had the encoding or time to file name [11:04] i got a good afternoon america episode [11:18] midas: archive what? [11:19] i'm currently upgrading my kernel so I can run a archivebot in a lxc [11:19] going to try and build on gentoo, if no worky then I'll build on debian [11:22] Smiley: "06:33:15 kyan | Letting you know, my archivebot computer overheated and died prematurely. :( " [11:22] archiveteam, breaking all your servers [11:23] :D [11:23] if you melt mine, I'll cry. [11:23] he joined, said it broke down, and quit :p [11:25] :D [13:47] Archiving USENET [13:47] dashcloud: Early... pre Deja USENET can possibly be recovered from CD produced in the Mid 1990's [13:48] Deja material is still technically on Google albiet buried [13:48] Post Google ( ie post 2002 material) is trickier... [13:49] The first issue is technical, It's not as if you can just pull a torrent of USENET from Google.... [13:50] I would strongly suggest requesting someone with access to a high volume NNTP feed and 'cheap' storage begin to archive 'new' content on USENET in parrallel with google... [13:50] there was speculation earlier this morning that Google Groups uses a custom JavaScript protocol [13:50] Internally yes... [13:50] Content posted to USENET more wdiely will be over NNTP and UUCP as it always has been [13:51] I.E USENET groups, of alt.* comp.* etc design will still appear on NNTP style feeds with NNTP style headers [13:52] Proprietary groups are not something I am concerned with myself..) [13:53] create a page on the archiveteam wiki with all the details and your ideas, and then point people to it [13:53] I don't have an account there yet [13:53] And because I'm UK based, what i can request differes... [13:55] you might find this tool interesting: https://github.com/ZoeB/arcmesg [14:01] dashcloud: Thanks [14:02] dashcloud: USENET content on Google could also be retrived by brute force [14:03] Although you'd need a rather sophistacted generator /scrapper to do that [14:03] brute force and google isn't usually a hot idea- you certainly can't scrap search results like that [14:03] dsahcloud: The URL's used to find original postings are at least stable... [14:04] Is it worth starting a petition, calling on Google to liberate it's USENET archive to an appropriate archivist? [14:04] There are times when I wonder if Google's forgotten it has groups [14:09] peitition no- too easy for Google to say no [14:09] and you've already stated a 'manual'extraction would get picked up by their load blancing schemes [14:10] This leaves parrallel archving.... but the loss of pre 2014 material potentially :( [14:11] (Of course it coudl be argued that EVERY Usent posting Google is holding is ptential copyright timebomb...) [14:11] (Or could be in respect of the UK... as you don't give up copyright in letters you write here..) [14:50] S[h])[r]T: can you look at abcnews.go.com for me? [14:51] i'm looking for good xml data [14:51] the current method will need 19 million urls pulled to see which ones have a video link and description [14:58] http://www.archiveteam.org/index.php?title=User:Sfan00_IMG/USENET [15:08] i'm uploading 4 xakep (hacker) russian dvds from 2004 [15:09] Hi - I have a request... [15:09] Would it be possible to have a webform for requesting archivals? [15:10] (Such as for sites that appear as Wikipedia citations?) [15:10] Webcite isn't considered as stable... [15:19] OK thanks people [15:20] EpsilonRe: https://archive.org/web/ has a "save page now" form for individual pages [15:21] whole sites can be crawled by our ArchiveBot on request [15:21] OK... [15:21] Do you have contacts within Wikipedia/ [15:21] ? [15:21] yes, nemo_bis backs up material from the wikimedia foundation [15:22] And finally this may be of use in respect of software archiving.... [15:22] ftp://bbc.nvg.org/pub/bbc/ [15:22] The supporting website died about a year ago, but the FTP is still up [15:22] Most of it's 'grey' status abandonware sadly :( [15:22] e-mail it to jason@textfiles.com [15:23] he's doing most of our ftp grabs [16:01] but it may be oSent [16:01] Sent him a note [16:01] I've also noted some that should be actively excluded [16:02] And noted that efforts should be made to trace original publishers if possible [16:02] Sometimes in approaching the original publishers more material has been uncovered ;) [16:04] Another site worth archiving - http://www.6581-8580.com/soasc_copyright.php - :) [16:04] I like the note at the end ;) [16:04] Also http://www.paula8364.com/ [16:52] Hi [16:52] I'd like an explanation [16:52] http://www.archiveteam.org/index.php?title=Special:Contributions/Sfan00_IMG [16:53] Why is this blocked? [17:18] why the hasty ban? or is that something not my business? [17:30] hm could be rather harsh, but there might be a reason [17:30] one does not know if the facts arent there [17:43] If someone has an issue, msgme. [17:49] Did the wiki switch away from spam account filtering during registration because it just wasn't working? [18:00] mistym: I think that filtering's still in place [18:00] the FORSOOTH, PRITHEE TELL ME THE SECRET WORD bit [18:00] were you referring to something else? [18:02] yipdw: Before that - there were attempts to filter spam accounts [18:02] I'm helping someone else who hasn't been able to open public registrations, was wondering what decent options there are. Seems like PRITHEE solution works okay though [18:03] Ah [18:03] On another wiki? [18:03] We had a spam thing. Captcha. [18:03] that worked for between 40 and 80 milliseconds [18:04] We had to clean out, oh lord, a couple thousand spam accounts. [18:04] Ha [18:04] GLaDOS and my little australian buddy worked on that, hard. [18:04] So, the PRITHEE solution is very helpful. [18:04] (Although with the new guy we may need to change it.) [18:05] 1. It makes them come to IRC and say hello [18:05] 2. It makes them do something in public, i.e. a form of light hazing [18:05] 3. it starts a conversation [18:23] what's up with the new guy? [18:23] (if we should be sizing people up before giving out the secret word, oops, sorry! although I didn't give it to that chap, I have a couple of others...)# [18:46] You don't have to size people up. [18:47] Someone who jumps the hurdle of coming to IRC to get the secret word is quite good enough. [18:47] We're just getting rid of bots and spammers. We can deal with specific people issues down the road. [18:48] SketchCow: Yeah, makes sense. No IRC channel for this other wiki, but maybe email is good enough [18:48] Maybe. [18:48] See, remember, the wikicultists are very much against any friction for signing on [18:48] So they might not like something like "mail us" [18:49] But you could have them text a number. :) [18:57] The site admin is not a wikicultist :b She's *already* closed registration altogether because of spam problems. She wants to open it, just in a way that doesn't also open the spam floodgates [19:01] meh, lostprophets website is gone it seems, i know the singer was wrong but i'd love to grab that site anyway [19:48] Can I get help with a python script I'm trying to do? (non archive purpose) [19:50] ask the question and I [19:50] oh [19:50] dman [19:50] I was going to say that someone might answer it, but ok [19:56] Can I still ask? [19:58] I want to scrape http://bit.ly/1c44Of2 , but I keep getting a 404 error when I try submit data. The code I am using is http://pastebin.com/PCMLyb1e [20:26] Anyone :3 [20:27] I hope you stick around while I look at it [20:29] I don't plan on going anywhere (and if I do i look at the logs) :) [20:35] I didn't see anything obviously wrong [20:35] did you try copying and sending the exact POST data that a browser sends? [20:38] Yes, All I am getting in the temp.htm file is "404 - File or directory not found." [20:40] I don't see you sending __EVENTTARGET __EVENTARGUMENT and maybe more [20:57] Hmm k, http://pastebin.com/diff.php?i=8YRKUHYD that's updated now but the same [21:08] I fixed, I was submitting __VIEWSTATEENCRYPTED, but there that isn't on the page at all. It works now. [21:08] Thanks for your help :0 [21:08] * :) [21:12] ah, fucking asp viewstate [22:52] https://www.youtube.com/watch?v=DgeYCCOWfjQ [22:52] BIORHYTHM IS JUST A THEORY..... [22:52] ....FOR NOW [22:54] Oh my goodness that music is charming as hell