[00:19] arkhive: Sky is a television company so they're not likely to be giving the programme away for free - that website is really designed for other TV comapnies who want to buy the programme to show on their own channel. But they might be able to help you track down an episode, it's hard to say. [00:20] I don't actually remember there being any children on the show (it was mostly actors and puppets), but you never know. [00:23] Sesame Street is mostly actors and puppets, but they still have (had? not sure if they still do) a few children in each episode. [00:23] like one of the puppets talking to a kid about a concept. [00:25] Yes, but Tiny and Crew was not Sesame Street and had nothing like as much of a budget. It wasn't an expensive show. :) [00:28] arkhive: you're probably SOL unless someone broadcasts it and someone records it [00:32] https://en.wikipedia.org/wiki/Tiny_and_Crew oh hey wikipedia says there's a VHS and DVD [00:33] I am skeptical [00:36] I don't think anyone will be broadcasting this, heh [01:25] i cannot find the dvd or vhs [01:26] i bet TTOKTV would have had it. I tried to buy that torrent site database and domain and stuff from the admin after he closed it down. I emailed him asked about it. he emailed back. then i emailed him again asking about the price blah blah. and he never emailed me back. [01:27] The Tao of Kids Television [01:28] antomatic: ya i know. it sucks [01:29] Hi, [01:29] Feel free to make an offer. [01:29] I'd strip out a few private bits of the database - the private messages etc. The userlist should be able to remain intact - you having access to it would be no different than if I'd promoted you to a mod or admin. [01:29] Cheers, [01:29] iyatoni [01:30] that was the email. last i heard of him was 12/5/09.. i guess to the rest of the world it's 5/12/09 lol but december.. anyway.. [01:36] it Our only requirement at the moment is that anything uploaded isn't available to buy commercially anywhere in the world. [01:37] anyway that was it's deal. [01:37] i'll stop since it's turning to -bs [01:41] Okay topic changed. Appologies for tons of posts. I have a question. I was reading a ArsTechnica article(lol didn't finish it) and it was talking about bit rot and file systems that help fight against it. Does Archive.org have some measure in place to protect against bit rot? Link to article http://arstechnica.com/information-technology/2014/01/bitrot-and-atomic-cows-inside-next-gen-filesystems/ If this is more -bs let me know and i [01:41] will be happy to move discussion. [02:35] So from a five minute cursory glance at how Google Groups works. (No point in really doing more than that, I've basically never worked with HTML/HTTP/JS) [02:35] It looks like the interface is JS but the actual content is served using standard HTTP. [02:36] Gonna see if I can find a way to grab things with wget. [02:42] Nope, not gonna be that straightforward. [04:52] namespace: if you want to grab Google Groups via the Web, you'll want to use something that interprets Javascript [04:52] phantomjs is one such tool [04:52] phantomjs, however, doesn't have built-in WARC support [04:53] so you will want to use a proxy server that saves WARCs or add it to phantomjs on your own [04:53] grabbing Javascript-heavy sites is so hit-and-miss, though, because the availability of content depends on simulating user actions [04:53] and that is a pain in the fucking ass, to put it lightly [05:12] so some help grab akamai.com stream files [05:12] i'm looking for the abcondemand ones [05:38] yipdw: I wouldn't bother, except that the content involved is pretty historically significant. [05:53] hyves was javascript heavy but that didn't stop us from grabbing everything. the flash music player was even disassembled to get the music urls. [05:54] i decide to start mirroring more of microsoft research stuff [05:55] SketchCow: can i get full access to this collection: https://archive.org/details/microsoft_research_audio [06:04] for now i'm going to try to do a more complete mirror of microsoft research video [10:50] hello, I'm wondering if anyone has archived fully the old http://unreal.epicgames.com there used to be a file called "Index.unr" which got deleted later on, this was the original server lobby before the later patches came with the browser integrated in the game [10:51] it used to be here http://unreal.epicgames.com/Files/Index.unr [10:52] that was way back in 1998, many years later the link became dead and later on the whole website shut down in favor of epic new stuff [10:53] it was referenced on this page http://unreal.epicgames.com/ServerTips.htm [11:08] also, i'm looking for the "first beta" leak from way back to 1996, this used to spread in some bbs circles mostly but also found its way on few cds like VooDoo 13 and Numbers Fate 029 [11:08] it was leaked by RoR = Release on Rampage [11:09] it was called "The Unreal Editor! / Unreal Beta!" [1/3], just three zip files, because it had just one or two levels but it had unseen content in there too, and for game "historicians" this is important build and has been missing for ages [11:10] it was like a beta demo with editor in it [11:10] but none of the content survived later, hence its importance [11:11] let me know if either of the two things any of you guys might have, or have idea who has it, etc [11:13] the "first beta" was referenced here fr example: web.textfiles.com/ezines/TGR/review10.txt [11:14] and no the "reviewer" doesn't have it anymore either, i tried to ask way ago [12:21] they called it index.unr? [12:27] yeah [12:29] index.unr is always the default feedback of the server, except it later just served as kind of a redirection but originally index.unr was the lobby map that had to be downloaded by server admins if they wanted to run a lobby [12:30] but all links to it were dead basically [12:31] and it only seemed to be on epic's site there for download at the server tips [12:31] it coul have been on some other lost pages but in general its not avaiable anywhere now it seems [12:31] makes it really hard to find something called index.X, but hey we are grabbing all the FTP servers worldwide so it might be there somewhere [12:33] yea, the unreal map/teleporter system was inspired by url and html stuf to begin with, so i guess its not surprising that default map was always supposed to be index.unr for servers back then [12:33] lol [12:35] either way, as for the other thing, the beta, the problem is exact filename is unknown to me and the guys who knew it disapeared or where it was once mentioned on the web [12:35] but it had to be roru**01.zip to roru**03.zip [12:35] i think [12:36] because from that group the filenames from the same month were named in this style [12:36] it could as well have been rorunr01 fr example [12:36] probably all uppercase though [12:36] the letters [13:23] Added Flickr to http://archiveteam.org/index.php?title=Alive..._OR_ARE_THEY#Sites [13:34] Nemo_bis: I maintain that we need to have some kind of 'live include' of the "yahoo properties" table on wikipedia [13:36] joepie91: it's not hard [13:38] there are at least tree options 1) enable scary transclusion (config, needs SketchCow ) and transclude a wiki list / table on a page of ours, 2) use an RSS feed of said list, 3) use the RSS feed of the category, 4) use Wikidata queries [13:39] (3) https://en.wikipedia.org/w/index.php?title=Special:RecentChangesLinked/Category:Yahoo!_acquisitions&feed=atom&target=Category%3AYahoo%21_acquisitions [13:40] (2) https://en.wikipedia.org/w/index.php?title=List_of_mergers_and_acquisitions_by_Yahoo!&feed=atom&action=history [13:41] wikidata queries I still have to learn :) [13:42] that list isnt up2date btw [13:43] {{sofixit}} [13:43] maybe we need a google alert when the words yahoo + acquisition is found [13:43] Hm https://jira.toolserver.org/browse/MAGNUS-129 [13:43] yep, that too would be helpful, can one make a feed out of it? [13:43] think so, lemme check [13:44] email [13:44] then it's easy to use a feed2twitter service to have a nice tweet for every likely news of Yahoo! deeds [13:44] http://www.google.com/alerts [13:44] usually it's email, yes [13:45] it does give you a feed [13:45] I have http://www.google.com/alerts/feeds/03733117766037168292/5703209084241006175 [13:46] maybe archiveteam's twitter can send a message to @yahoo "hey yahoo, can you send us a message when you decide to destroy a part of the internet? thnx!" [13:46] midas: with a daily reminder? [13:46] yeah [13:46] ;-) [13:46] Good morning @Yahoo, what part of the internet are you going to kill today? [13:47] sounds like pinky and brain that way though [13:48] takes 15 min to create such a Twitter feed, anyone? [13:52] So... some time today I'll be ready to concatenate a few hundred warcs using Megawarc for the first time. Are there any gotchas I should know about? [13:55] yeah, make sure they upload and no errors [13:59] * SadDM isn't sure if midas was sarcastic or serious... or maybe a combination of both. |:-| [14:01] combination of both, i had a couple not uploading, well it uploads but then IA reports it not allowed. so keep an eye on your uploads [14:03] I was just going to make a big warc locally and then upload it via ftp... which has been my goto uploading process in the past. [14:03] are you alking about using the megawarc factory? [14:12] yeah [14:26] Hmm I wonder what "occasional" means, no other option for feed-only alerts http://www.google.com/alerts/feeds/03733117766037168292/11115209096644139952 [14:33] Oh http://searchengineland.com/google-quietly-brings-back-rss-feed-option-to-google-alerts-171645 [14:39] For now with Wikipedia only https://twitter.com/YahooVictims [15:15] brb [15:15] nice Nemo_bis ;) [17:06] hi, I have finally finished going through the list of FTP sites here, and marking what's alive and dead: https://www.piratepad.ca/p/old-ftp-list [17:07] fascinating to see what's still around from 1996 [17:19] Ah, I thought you were talking of the early acquisitions by Yahoo :P [17:21] dashcloud: is that the master list or are there more people are working on? [17:21] like, where is the list from the internetz spidering [17:25] from 1996, well that's what im looking for, let me look at the list but i wonder if any o those files im searching for are there [17:41] i cant really acess it though, so yeah if someone could search those ftp sites for the ror files, if there's lots of 96 stuff [18:19] Nemo_bis: the list is from the ebook version located here: https://archive.org/details/cdrom-internetgamesdirectorycd [18:19] more details at the top of the sheet itself [18:19] yes I saw that [18:22] if you've got an easy way to separate out the URLs from each chapter for archiving, that would be great [19:13] little late now (applications by the 14th), but I think it would be great to have IA or ArchiveTeam in Google's Summer of Code sometime [20:11] dashcloud: and how would that be possible, what's FLOSS in IA? heritrix only perhaps? [20:12] if there's some software development project you would like a student to do (and which would find a mentor) I suggest to start writing it down :) [20:13] we have a wiki for a reason ;) [20:35] I don't have any particular ideas in mind, just thought it might be interesting [20:51] Sure. Just remember to write down ideas whenever one comes to your mind [20:52] We do so year-long for MediaWiki and it's been very helpful [20:57] Leo_TCK: http://web.archive.org/web/*/http://unreal.epicgames.com/Files/Index.unr [21:09] doesnt work [21:09] says got 302 at crawl time [21:09] and redirects to unrealengine.com [21:09] the first snapshot seems to be from 2011 which is after the site didnt exist anyway [21:21] actually the first is from 2000 but it looks like it still didn't exist at that point [21:32] i can't seem to follow that one [21:32] it for sure was there in 98 though [21:51] I might have it in my ut-files.com grab but I don't know how to search all the zips [21:57] okay I'm searching it [22:00] i doubt its there though ut-files is recent and mostly for ut i think, this was the server lobby for the original unreal