#archiveteam 2014-02-07,Fri

↑back Search

Time Nickname Message
00:36 🔗 dashcloud hi folks, what's the correct way to create a listing of the files inside a tar file without including owner or group information? (see this page: http://archiveteam.org/index.php?title=FTP for current command line)
00:48 🔗 SketchCow I do tar vtf and then if I hate the owner being in there I awk it out.
00:49 🔗 dashcloud so, should the commands on that page be changed? (I do believe the three commands listed came from you originally)
01:02 🔗 SketchCow I don't have an issue with the user/group
01:02 🔗 SketchCow I assume this is a privacy issue
01:07 🔗 xmc you could probably sed your name to 1000 or whatever
01:17 🔗 chfoo there is currently a small manual grab script test run in #rawdogster right now
01:32 🔗 dashcloud okay- thanks
01:53 🔗 dashcloud there's an interesting FTP site here: http://www.gaby.de/ftp/ which can only be accessed through the web- is it possible to connect and download the stuff through FTP, or just use wget to grab it all?
02:42 🔗 xmc dashcloud: wget is probably your best option if it's only accessible by http
11:32 🔗 Nemo_bis FTP site accessible via HTTP only? Seems contradictory, like WWW HTTP site accessible only via FTP
11:43 🔗 trs80 is there a wayback machine channel?
11:44 🔗 trs80 there was briefly a robots.txt Disallow: / on my work's domain, and I'm hoping I can stop the history being flushed
11:44 🔗 Schbirid there is #internetarchive
11:44 🔗 trs80 cheers
12:05 🔗 Smiley trs80: nothing is flushed afaik, it's just hidden
12:05 🔗 Smiley once the disallow goes away, the pages reappear ( THis is what I understand to happen, I maybe wrong).
12:09 🔗 Nemo_bis after recrawl of course
16:31 🔗 SketchCow It'll never go away
16:31 🔗 SketchCow if the robots goes away, it comes back
17:02 🔗 SketchCow They just wiped all the 1UP podcasts
17:02 🔗 SketchCow That little community is going insane
17:11 🔗 SketchCow Open Diary disappeared last night, one week warning.
17:11 🔗 SketchCow ONLY FEELING A LITTLE ANXIOUS TODAY
17:11 🔗 SketchCow Also, Apple iTunes now deleting podcasts
17:17 🔗 Unbeholde WHAT FORSOOTH, PRITHEE TELL ME THE SECRET WORD
17:17 🔗 Nemo_bis http://archiveteam.org/index.php?title=AOL_Music and http://archiveteam.org/index.php?title=Ispygames are in main page but have to repo or anything to run
17:17 🔗 Nemo_bis Unbeholde: yahoosucks
17:18 🔗 SketchCow Agreed, those need fixing.
17:18 🔗 SketchCow Maybe we need an assessment day
17:18 🔗 SketchCow What's on the plate, what's done, etc.
17:19 🔗 Unbeholde I ask thee, how I can, ah screw medieval language. Can anyone tell me how I can get hold of all the Unreal Tournament 3 files that where archived from fileplanet? In my moderator position on modDB, I am in the position to upload all the mods over to there.
17:19 🔗 SketchCow https://archive.org/details/archiveteam-fileplanet
17:19 🔗 SketchCow Root around a tad.
17:20 🔗 SketchCow Others might help you, but that's where the result of the work is stored.
17:21 🔗 SketchCow http://archiveteam.org/index.php?title=Fileplanet is our war-room stuff.
17:23 🔗 DFJustin I think someone was working on a nicer interface to it too
17:23 🔗 SketchCow I think a cleanup's definitely in order.
17:24 🔗 SketchCow Obviously I'm still acquiring items very quickly, but one of the goals this year is improved sorting of Internet Archive stuff, so it's easier to find and better listed.
17:24 🔗 SketchCow computer magazines was just the first shot across the bow.
17:25 🔗 DFJustin https://www.quaddicted.com/stuff/temp/fileplanet.php?directory=/ftp1/action/unrealtournament/
17:25 🔗 SketchCow Unbeholde: That one's for you
17:25 🔗 DFJustin so looks like ut stuff is in here https://archive.org/details/Fileplanet_ftp1_action
17:26 🔗 DFJustin hmm that's all too old to be ut3 though
17:28 🔗 DFJustin schbirid would know when he gets back
17:30 🔗 DFJustin looks like ut3 stuff is in the datestamped folders
17:30 🔗 Nemo_bis http://archiveteam.org/index.php?title=Ispygames is a smiley-only project, in practice... no reason to keep it on main page
17:31 🔗 SketchCow Knock it down!
17:31 🔗 DFJustin https://www.quaddicted.com/stuff/temp/fileplanet.php?filename=ut3
17:32 🔗 DFJustin "details" links have the descriptions
17:34 🔗 Nemo_bis What really needs fixing is URLteam... it's our only constant warrior project, but the tracker has been down for months.
17:36 🔗 Unbeholde so how do I download all of the ut3 at once.. the ftp1 tar file.
17:37 🔗 DFJustin no
17:37 🔗 DFJustin there isn't an easy way right now by the looks of it
17:38 🔗 Unbeholde blasphemy! forsooth I say this will take some time to get through.
17:38 🔗 DFJustin actually you can just batch-download from the quaddicted filename search thing, iirc he put that warning on because he wasn't sure but the stuff is all hosted on archive.org which is plenty capable of handling the load
17:40 🔗 Nemo_bis SketchCow: I doubt it's a very useful one, but this should be moved to archiveteam collection https://archive.org/details/aol_music_sites
18:01 🔗 SketchCow https://archive.org/details/aol_music_sites looks better.
18:01 🔗 SketchCow And is in the archive now.
18:08 🔗 chfoo SketchCow: i have a list of things needed to be moved into the archiveteam collection as well: http://pastebin.com/HqAA0jYf
18:08 🔗 yipdw oh right, I forgot that we uploaded ptch at the same time as wretch and yahooblog
18:13 🔗 Nemo_bis godane: some criticism for you http://archiveteam.org/index.php?title=Talk:Wallbase
18:14 🔗 yipdw who is that, and why are they not in here
18:15 🔗 SketchCow Ah, ftp-ftp.hp.com_pub-2013-10
18:15 🔗 SketchCow The one that caused code changes to archive.org.
18:19 🔗 DFJustin oh someone else uploaded electrickery, yay I don't have to
18:21 🔗 Nemo_bis Genius! Preserve a wiki on mediafire! http://archiveteam.org/index.php?title=Insurgency_Wiki
18:21 🔗 Nemo_bis And of course the dump was deleted, surprise surprise
18:23 🔗 DFJustin SketchCow: here are some more items http://pastebin.com/g6w3DZFL
18:23 🔗 DFJustin some of them were uploaded by people outside the "team"
18:24 🔗 DFJustin so I dunno if we are implicitly trusting everyone's warcs
18:25 🔗 SketchCow This is really, really inefficient use of my time, not sure how to make it better.
18:25 🔗 SketchCow I guess I can do a global swap
18:27 🔗 RedType the problem being, that at one point, mediafire was permanent
18:28 🔗 RedType with big fat quotes around the word permanent
18:31 🔗 godane SketchCow: i don't think all 1UP podcasts are gone
18:34 🔗 SketchCow OK, I THINK I just swapped everything over, both Nemo_bis and chfoo and DFJustin mentioned sites.
18:34 🔗 SketchCow I think a larger look out for them is in order, but not this second.
18:34 🔗 SketchCow Obviously I will be sorting through archiveteam at some point.
18:41 🔗 Nemo_bis I made some cleanup of http://archiveteam.org/index.php?title=Deathwatch , please everyone help. It's a wiki. :)
18:45 🔗 godane looks like a good chuck of the 1up show is in wayback: https://web.archive.org/web/*/http://zdmedia.vo.llnwd.net/o1/Podcasts/*
18:50 🔗 SketchCow Oh, I'm sure it is.
18:50 🔗 SketchCow It's just gone from the core original site, which is what put the fear into the community around it.
19:09 🔗 yipdw recruit NeoGAF for Archive Team
19:22 🔗 SketchCow uploading 2014.01.ftp.peliplaneetta.net.tar: [ ] 14567/573924 - 08:51:23
19:22 🔗 SketchCow fuck yeah 9 hours
19:22 🔗 SketchCow http://teamarchive0.fnf.archive.org:8088/mrtg/networkv2.html
19:22 🔗 SketchCow Watch it devastate that line
19:27 🔗 Nemo_bis I can't even see you on http://s3.us.archive.org:8088/mrtg/networkv2.html :P
20:57 🔗 Nemo_bis bwaaaa I'm exhausting space on another disk :( the other 30 were fine, why so unlucky always grr https://ia600300.us.archive.org/host_stats.php

irclogger-viewer