#archiveteam 2014-02-09,Sun

↑back Search

Time Nickname Message
00:19 🔗 antomatic arkhive: Sky is a television company so they're not likely to be giving the programme away for free - that website is really designed for other TV comapnies who want to buy the programme to show on their own channel. But they might be able to help you track down an episode, it's hard to say.
00:20 🔗 antomatic I don't actually remember there being any children on the show (it was mostly actors and puppets), but you never know.
00:23 🔗 Coderjoe Sesame Street is mostly actors and puppets, but they still have (had? not sure if they still do) a few children in each episode.
00:23 🔗 Coderjoe like one of the puppets talking to a kid about a concept.
00:25 🔗 antomatic Yes, but Tiny and Crew was not Sesame Street and had nothing like as much of a budget. It wasn't an expensive show. :)
00:28 🔗 ivan` arkhive: you're probably SOL unless someone broadcasts it and someone records it
00:32 🔗 ivan` https://en.wikipedia.org/wiki/Tiny_and_Crew oh hey wikipedia says there's a VHS and DVD
00:33 🔗 ivan` I am skeptical
00:36 🔗 ivan` I don't think anyone will be broadcasting this, heh
01:25 🔗 arkhive i cannot find the dvd or vhs
01:26 🔗 arkhive i bet TTOKTV would have had it. I tried to buy that torrent site database and domain and stuff from the admin after he closed it down. I emailed him asked about it. he emailed back. then i emailed him again asking about the price blah blah. and he never emailed me back.
01:27 🔗 arkhive The Tao of Kids Television
01:28 🔗 arkhive antomatic: ya i know. it sucks
01:29 🔗 arkhive Hi,
01:29 🔗 arkhive Feel free to make an offer.
01:29 🔗 arkhive I'd strip out a few private bits of the database - the private messages etc. The userlist should be able to remain intact - you having access to it would be no different than if I'd promoted you to a mod or admin.
01:29 🔗 arkhive Cheers,
01:29 🔗 arkhive iyatoni
01:30 🔗 arkhive that was the email. last i heard of him was 12/5/09.. i guess to the rest of the world it's 5/12/09 lol but december.. anyway..
01:36 🔗 arkhive it Our only requirement at the moment is that anything uploaded isn't available to buy commercially anywhere in the world.
01:37 🔗 arkhive anyway that was it's deal.
01:37 🔗 arkhive i'll stop since it's turning to -bs
01:41 🔗 arkhive Okay topic changed. Appologies for tons of posts. I have a question. I was reading a ArsTechnica article(lol didn't finish it) and it was talking about bit rot and file systems that help fight against it. Does Archive.org have some measure in place to protect against bit rot? Link to article http://arstechnica.com/information-technology/2014/01/bitrot-and-atomic-cows-inside-next-gen-filesystems/ If this is more -bs let me know and i
01:41 🔗 arkhive will be happy to move discussion.
02:35 🔗 namespace So from a five minute cursory glance at how Google Groups works. (No point in really doing more than that, I've basically never worked with HTML/HTTP/JS)
02:35 🔗 namespace It looks like the interface is JS but the actual content is served using standard HTTP.
02:36 🔗 namespace Gonna see if I can find a way to grab things with wget.
02:42 🔗 namespace Nope, not gonna be that straightforward.
04:52 🔗 yipdw namespace: if you want to grab Google Groups via the Web, you'll want to use something that interprets Javascript
04:52 🔗 yipdw phantomjs is one such tool
04:52 🔗 yipdw phantomjs, however, doesn't have built-in WARC support
04:53 🔗 yipdw so you will want to use a proxy server that saves WARCs or add it to phantomjs on your own
04:53 🔗 yipdw grabbing Javascript-heavy sites is so hit-and-miss, though, because the availability of content depends on simulating user actions
04:53 🔗 yipdw and that is a pain in the fucking ass, to put it lightly
05:12 🔗 godane so some help grab akamai.com stream files
05:12 🔗 godane i'm looking for the abcondemand ones
05:38 🔗 namespace yipdw: I wouldn't bother, except that the content involved is pretty historically significant.
05:53 🔗 chfoo hyves was javascript heavy but that didn't stop us from grabbing everything. the flash music player was even disassembled to get the music urls.
05:54 🔗 godane i decide to start mirroring more of microsoft research stuff
05:55 🔗 godane SketchCow: can i get full access to this collection: https://archive.org/details/microsoft_research_audio
06:04 🔗 godane for now i'm going to try to do a more complete mirror of microsoft research video
10:50 🔗 Leo_TCK hello, I'm wondering if anyone has archived fully the old http://unreal.epicgames.com there used to be a file called "Index.unr" which got deleted later on, this was the original server lobby before the later patches came with the browser integrated in the game
10:51 🔗 Leo_TCK it used to be here http://unreal.epicgames.com/Files/Index.unr
10:52 🔗 Leo_TCK that was way back in 1998, many years later the link became dead and later on the whole website shut down in favor of epic new stuff
10:53 🔗 Leo_TCK it was referenced on this page http://unreal.epicgames.com/ServerTips.htm
11:08 🔗 Leo_TCK also, i'm looking for the "first beta" leak from way back to 1996, this used to spread in some bbs circles mostly but also found its way on few cds like VooDoo 13 and Numbers Fate 029
11:08 🔗 Leo_TCK it was leaked by RoR = Release on Rampage
11:09 🔗 Leo_TCK it was called "The Unreal Editor! / Unreal Beta!" [1/3], just three zip files, because it had just one or two levels but it had unseen content in there too, and for game "historicians" this is important build and has been missing for ages
11:10 🔗 Leo_TCK it was like a beta demo with editor in it
11:10 🔗 Leo_TCK but none of the content survived later, hence its importance
11:11 🔗 Leo_TCK let me know if either of the two things any of you guys might have, or have idea who has it, etc
11:13 🔗 Leo_TCK the "first beta" was referenced here fr example: web.textfiles.com/ezines/TGR/review10.txt
11:14 🔗 Leo_TCK and no the "reviewer" doesn't have it anymore either, i tried to ask way ago
12:21 🔗 midas they called it index.unr?
12:27 🔗 Leo_TCK yeah
12:29 🔗 Leo_TCK index.unr is always the default feedback of the server, except it later just served as kind of a redirection but originally index.unr was the lobby map that had to be downloaded by server admins if they wanted to run a lobby
12:30 🔗 Leo_TCK but all links to it were dead basically
12:31 🔗 Leo_TCK and it only seemed to be on epic's site there for download at the server tips
12:31 🔗 Leo_TCK it coul have been on some other lost pages but in general its not avaiable anywhere now it seems
12:31 🔗 midas makes it really hard to find something called index.X, but hey we are grabbing all the FTP servers worldwide so it might be there somewhere
12:33 🔗 Leo_TCK yea, the unreal map/teleporter system was inspired by url and html stuf to begin with, so i guess its not surprising that default map was always supposed to be index.unr for servers back then
12:33 🔗 Leo_TCK lol
12:35 🔗 Leo_TCK either way, as for the other thing, the beta, the problem is exact filename is unknown to me and the guys who knew it disapeared or where it was once mentioned on the web
12:35 🔗 Leo_TCK but it had to be roru**01.zip to roru**03.zip
12:35 🔗 Leo_TCK i think
12:36 🔗 Leo_TCK because from that group the filenames from the same month were named in this style
12:36 🔗 Leo_TCK it could as well have been rorunr01 fr example
12:36 🔗 Leo_TCK probably all uppercase though
12:36 🔗 Leo_TCK the letters
13:23 🔗 Nemo_bis Added Flickr to http://archiveteam.org/index.php?title=Alive..._OR_ARE_THEY#Sites
13:34 🔗 joepie91 Nemo_bis: I maintain that we need to have some kind of 'live include' of the "yahoo properties" table on wikipedia
13:36 🔗 Nemo_bis joepie91: it's not hard
13:38 🔗 Nemo_bis there are at least tree options 1) enable scary transclusion (config, needs SketchCow ) and transclude a wiki list / table on a page of ours, 2) use an RSS feed of said list, 3) use the RSS feed of the category, 4) use Wikidata queries
13:39 🔗 Nemo_bis (3) https://en.wikipedia.org/w/index.php?title=Special:RecentChangesLinked/Category:Yahoo!_acquisitions&feed=atom&target=Category%3AYahoo%21_acquisitions
13:40 🔗 Nemo_bis (2) https://en.wikipedia.org/w/index.php?title=List_of_mergers_and_acquisitions_by_Yahoo!&feed=atom&action=history
13:41 🔗 Nemo_bis wikidata queries I still have to learn :)
13:42 🔗 midas that list isnt up2date btw
13:43 🔗 Nemo_bis {{sofixit}}
13:43 🔗 midas maybe we need a google alert when the words yahoo + acquisition is found
13:43 🔗 Nemo_bis Hm https://jira.toolserver.org/browse/MAGNUS-129
13:43 🔗 Nemo_bis yep, that too would be helpful, can one make a feed out of it?
13:43 🔗 midas think so, lemme check
13:44 🔗 midas email
13:44 🔗 Nemo_bis then it's easy to use a feed2twitter service to have a nice tweet for every likely news of Yahoo! deeds
13:44 🔗 midas http://www.google.com/alerts
13:44 🔗 Nemo_bis usually it's email, yes
13:45 🔗 Nemo_bis it does give you a feed
13:45 🔗 Nemo_bis I have http://www.google.com/alerts/feeds/03733117766037168292/5703209084241006175
13:46 🔗 midas maybe archiveteam's twitter can send a message to @yahoo "hey yahoo, can you send us a message when you decide to destroy a part of the internet? thnx!"
13:46 🔗 Nemo_bis midas: with a daily reminder?
13:46 🔗 midas yeah
13:46 🔗 midas ;-)
13:46 🔗 Nemo_bis Good morning @Yahoo, what part of the internet are you going to kill today?
13:47 🔗 Nemo_bis sounds like pinky and brain that way though
13:48 🔗 Nemo_bis takes 15 min to create such a Twitter feed, anyone?
13:52 🔗 SadDM So... some time today I'll be ready to concatenate a few hundred warcs using Megawarc for the first time. Are there any gotchas I should know about?
13:55 🔗 midas yeah, make sure they upload and no errors
13:59 🔗 * SadDM isn't sure if midas was sarcastic or serious... or maybe a combination of both. |:-|
14:01 🔗 midas combination of both, i had a couple not uploading, well it uploads but then IA reports it not allowed. so keep an eye on your uploads
14:03 🔗 SadDM I was just going to make a big warc locally and then upload it via ftp... which has been my goto uploading process in the past.
14:03 🔗 SadDM are you alking about using the megawarc factory?
14:12 🔗 midas yeah
14:26 🔗 Nemo_bis Hmm I wonder what "occasional" means, no other option for feed-only alerts http://www.google.com/alerts/feeds/03733117766037168292/11115209096644139952
14:33 🔗 Nemo_bis Oh http://searchengineland.com/google-quietly-brings-back-rss-feed-option-to-google-alerts-171645
14:39 🔗 Nemo_bis For now with Wikipedia only https://twitter.com/YahooVictims
15:15 🔗 midas brb
15:15 🔗 midas nice Nemo_bis ;)
17:06 🔗 dashcloud hi, I have finally finished going through the list of FTP sites here, and marking what's alive and dead: https://www.piratepad.ca/p/old-ftp-list
17:07 🔗 dashcloud fascinating to see what's still around from 1996
17:19 🔗 Nemo_bis Ah, I thought you were talking of the early acquisitions by Yahoo :P
17:21 🔗 Nemo_bis dashcloud: is that the master list or are there more people are working on?
17:21 🔗 Nemo_bis like, where is the list from the internetz spidering
17:25 🔗 Leo_TCK from 1996, well that's what im looking for, let me look at the list but i wonder if any o those files im searching for are there
17:41 🔗 Leo_TCK i cant really acess it though, so yeah if someone could search those ftp sites for the ror files, if there's lots of 96 stuff
18:19 🔗 dashcloud Nemo_bis: the list is from the ebook version located here: https://archive.org/details/cdrom-internetgamesdirectorycd
18:19 🔗 dashcloud more details at the top of the sheet itself
18:19 🔗 Nemo_bis yes I saw that
18:22 🔗 dashcloud if you've got an easy way to separate out the URLs from each chapter for archiving, that would be great
19:13 🔗 dashcloud little late now (applications by the 14th), but I think it would be great to have IA or ArchiveTeam in Google's Summer of Code sometime
20:11 🔗 Nemo_bis dashcloud: and how would that be possible, what's FLOSS in IA? heritrix only perhaps?
20:12 🔗 Nemo_bis if there's some software development project you would like a student to do (and which would find a mentor) I suggest to start writing it down :)
20:13 🔗 Nemo_bis we have a wiki for a reason ;)
20:35 🔗 dashcloud I don't have any particular ideas in mind, just thought it might be interesting
20:51 🔗 Nemo_bis Sure. Just remember to write down ideas whenever one comes to your mind
20:52 🔗 Nemo_bis We do so year-long for MediaWiki and it's been very helpful
20:57 🔗 ivan` Leo_TCK: http://web.archive.org/web/*/http://unreal.epicgames.com/Files/Index.unr
21:09 🔗 Leo_TCK doesnt work
21:09 🔗 Leo_TCK says got 302 at crawl time
21:09 🔗 Leo_TCK and redirects to unrealengine.com
21:09 🔗 Leo_TCK the first snapshot seems to be from 2011 which is after the site didnt exist anyway
21:21 🔗 DFJustin actually the first is from 2000 but it looks like it still didn't exist at that point
21:32 🔗 Leo_TCK i can't seem to follow that one
21:32 🔗 Leo_TCK it for sure was there in 98 though
21:51 🔗 ivan` I might have it in my ut-files.com grab but I don't know how to search all the zips
21:57 🔗 ivan` okay I'm searching it
22:00 🔗 Leo_TCK i doubt its there though ut-files is recent and mostly for ut i think, this was the server lobby for the original unreal

irclogger-viewer