[00:36] hi folks, what's the correct way to create a listing of the files inside a tar file without including owner or group information? (see this page: http://archiveteam.org/index.php?title=FTP for current command line) [00:48] I do tar vtf and then if I hate the owner being in there I awk it out. [00:49] so, should the commands on that page be changed? (I do believe the three commands listed came from you originally) [01:02] I don't have an issue with the user/group [01:02] I assume this is a privacy issue [01:07] you could probably sed your name to 1000 or whatever [01:17] there is currently a small manual grab script test run in #rawdogster right now [01:32] okay- thanks [01:53] there's an interesting FTP site here: http://www.gaby.de/ftp/ which can only be accessed through the web- is it possible to connect and download the stuff through FTP, or just use wget to grab it all? [02:42] dashcloud: wget is probably your best option if it's only accessible by http [11:32] FTP site accessible via HTTP only? Seems contradictory, like WWW HTTP site accessible only via FTP [11:43] is there a wayback machine channel? [11:44] there was briefly a robots.txt Disallow: / on my work's domain, and I'm hoping I can stop the history being flushed [11:44] there is #internetarchive [11:44] cheers [12:05] trs80: nothing is flushed afaik, it's just hidden [12:05] once the disallow goes away, the pages reappear ( THis is what I understand to happen, I maybe wrong). [12:09] after recrawl of course [16:31] It'll never go away [16:31] if the robots goes away, it comes back [17:02] They just wiped all the 1UP podcasts [17:02] That little community is going insane [17:11] Open Diary disappeared last night, one week warning. [17:11] ONLY FEELING A LITTLE ANXIOUS TODAY [17:11] Also, Apple iTunes now deleting podcasts [17:17] WHAT FORSOOTH, PRITHEE TELL ME THE SECRET WORD [17:17] http://archiveteam.org/index.php?title=AOL_Music and http://archiveteam.org/index.php?title=Ispygames are in main page but have to repo or anything to run [17:17] Unbeholde: yahoosucks [17:18] Agreed, those need fixing. [17:18] Maybe we need an assessment day [17:18] What's on the plate, what's done, etc. [17:19] I ask thee, how I can, ah screw medieval language. Can anyone tell me how I can get hold of all the Unreal Tournament 3 files that where archived from fileplanet? In my moderator position on modDB, I am in the position to upload all the mods over to there. [17:19] https://archive.org/details/archiveteam-fileplanet [17:19] Root around a tad. [17:20] Others might help you, but that's where the result of the work is stored. [17:21] http://archiveteam.org/index.php?title=Fileplanet is our war-room stuff. [17:23] I think someone was working on a nicer interface to it too [17:23] I think a cleanup's definitely in order. [17:24] Obviously I'm still acquiring items very quickly, but one of the goals this year is improved sorting of Internet Archive stuff, so it's easier to find and better listed. [17:24] computer magazines was just the first shot across the bow. [17:25] https://www.quaddicted.com/stuff/temp/fileplanet.php?directory=/ftp1/action/unrealtournament/ [17:25] Unbeholde: That one's for you [17:25] so looks like ut stuff is in here https://archive.org/details/Fileplanet_ftp1_action [17:26] hmm that's all too old to be ut3 though [17:28] schbirid would know when he gets back [17:30] looks like ut3 stuff is in the datestamped folders [17:30] http://archiveteam.org/index.php?title=Ispygames is a smiley-only project, in practice... no reason to keep it on main page [17:31] Knock it down! [17:31] https://www.quaddicted.com/stuff/temp/fileplanet.php?filename=ut3 [17:32] "details" links have the descriptions [17:34] What really needs fixing is URLteam... it's our only constant warrior project, but the tracker has been down for months. [17:36] so how do I download all of the ut3 at once.. the ftp1 tar file. [17:37] no [17:37] there isn't an easy way right now by the looks of it [17:38] blasphemy! forsooth I say this will take some time to get through. [17:38] actually you can just batch-download from the quaddicted filename search thing, iirc he put that warning on because he wasn't sure but the stuff is all hosted on archive.org which is plenty capable of handling the load [17:40] SketchCow: I doubt it's a very useful one, but this should be moved to archiveteam collection https://archive.org/details/aol_music_sites [18:01] https://archive.org/details/aol_music_sites looks better. [18:01] And is in the archive now. [18:08] SketchCow: i have a list of things needed to be moved into the archiveteam collection as well: http://pastebin.com/HqAA0jYf [18:08] oh right, I forgot that we uploaded ptch at the same time as wretch and yahooblog [18:13] godane: some criticism for you http://archiveteam.org/index.php?title=Talk:Wallbase [18:14] who is that, and why are they not in here [18:15] Ah, ftp-ftp.hp.com_pub-2013-10 [18:15] The one that caused code changes to archive.org. [18:19] oh someone else uploaded electrickery, yay I don't have to [18:21] Genius! Preserve a wiki on mediafire! http://archiveteam.org/index.php?title=Insurgency_Wiki [18:21] And of course the dump was deleted, surprise surprise [18:23] SketchCow: here are some more items http://pastebin.com/g6w3DZFL [18:23] some of them were uploaded by people outside the "team" [18:24] so I dunno if we are implicitly trusting everyone's warcs [18:25] This is really, really inefficient use of my time, not sure how to make it better. [18:25] I guess I can do a global swap [18:27] the problem being, that at one point, mediafire was permanent [18:28] with big fat quotes around the word permanent [18:31] SketchCow: i don't think all 1UP podcasts are gone [18:34] OK, I THINK I just swapped everything over, both Nemo_bis and chfoo and DFJustin mentioned sites. [18:34] I think a larger look out for them is in order, but not this second. [18:34] Obviously I will be sorting through archiveteam at some point. [18:41] I made some cleanup of http://archiveteam.org/index.php?title=Deathwatch , please everyone help. It's a wiki. :) [18:45] looks like a good chuck of the 1up show is in wayback: https://web.archive.org/web/*/http://zdmedia.vo.llnwd.net/o1/Podcasts/* [18:50] Oh, I'm sure it is. [18:50] It's just gone from the core original site, which is what put the fear into the community around it. [19:09] recruit NeoGAF for Archive Team [19:22] uploading 2014.01.ftp.peliplaneetta.net.tar: [ ] 14567/573924 - 08:51:23 [19:22] fuck yeah 9 hours [19:22] http://teamarchive0.fnf.archive.org:8088/mrtg/networkv2.html [19:22] Watch it devastate that line [19:27] I can't even see you on http://s3.us.archive.org:8088/mrtg/networkv2.html :P [20:57] bwaaaa I'm exhausting space on another disk :( the other 30 were fine, why so unlucky always grr https://ia600300.us.archive.org/host_stats.php