#archiveteam 2014-06-10,Tue

↑back Search

Time Nickname Message
05:29 🔗 Nemo_bis That's useful information. :)
05:30 🔗 Nemo_bis I think most FTP servers in the world are probably unwatched by now
07:09 🔗 SketchCow http://devslovebacon.com/conferences/bacon-2014/talks/from-colo-to-yolo-confessions-of-the-angriest-archivist
07:09 🔗 SketchCow This one..... this one's pretty out there
07:10 🔗 lukeman sigh: http://www.thejerrysite.com
07:11 🔗 lukeman managed to find the original site still running for now: http://165.225.131.245
07:14 🔗 nico archivebot job launched against
07:17 🔗 lukeman thanks, assuming that's at me. machine froze right after i pasted.
07:31 🔗 godane SketchCow: i can grab the devslovebacon videos from there site using youtube-dl
07:34 🔗 godane i couldn't get the link to vimeo.com to work so i went with your link
07:36 🔗 godane i will see about downloading all of the devslovebacon talks
09:47 🔗 ohhdemgir SN4T14, SketchCow the host pissed it's pants around 80% into the second stage of ftp-nab, I couldn't reach anything on port 21 for the following 24 hours so the output was never processed, it also generated a stupid amount of abuse claims at my other host which resulted in them forwarding 140+ emails to me about it, mostly from edu sites
09:48 🔗 ersi classy
09:50 🔗 schbirid #yolo
09:50 🔗 ohhdemgir SN4T14, no complaints they just (or automatically) stopped us from reaching anything on port 21, letting them know what we were doing first would of been a good idea, generally isn't my style though ..
09:55 🔗 aggrosk I'm 20 minutes into that talk at devslovebacon; lovin' it so far.
10:03 🔗 nico new CCC
10:03 🔗 nico masscan port 21 + massive ftp grab :)
10:04 🔗 midas new list? :D
10:04 🔗 midas i wonder what happens if i run this on one of my hosted boxes
10:08 🔗 Nemo_bis so the script just trashed the output from the first 80 % ?
10:12 🔗 midas just a small tip to not break the internet, limit the bandwidth a tad :p
10:12 🔗 midas ok it might take a bit longer but thats fine with me
10:13 🔗 Nemo_bis yes, make it 10 or 100 times slower than what ohhdemgir did and they won't notice (or 10 times slower from 10 machines?)
10:14 🔗 ohhdemgir Nemo_bis, I dumped it, not sure what it did, I still have the file from the first stage, should run again, I'm shocked no one else has but after running it I can see why.. midas yeahhh, limiting, that's a bad word!!
10:14 🔗 Nemo_bis :D
10:14 🔗 midas im limiting it to 200Mbit now
10:14 🔗 midas 4 hours
10:14 🔗 midas probably longer anyway
10:14 🔗 Nemo_bis After that, IPv6!
10:14 🔗 midas \o/
10:15 🔗 midas great idea Nemo_bis
10:15 🔗 ohhdemgir it wasn't a bandwidth issue, I just think hitting every ftp site in the world ruffled a few feathers
10:15 🔗 midas probably yeah
10:15 🔗 midas the box im abusing isnt a livebox anyway
10:16 🔗 midas oops
10:16 🔗 midas something just broke
10:17 🔗 midas box is offline :X
10:17 🔗 Nemo_bis lol
10:18 🔗 midas that was pritty quick
10:18 🔗 midas IPMI IS DOWN
10:18 🔗 midas ...
10:18 🔗 aggrosk Now you actually have to call somebody!
10:19 🔗 midas remote reboot
10:19 🔗 aggrosk I'll cross my fingers for you
10:19 🔗 midas thanks
10:20 🔗 midas maybe they think zmap is abusive?
10:21 🔗 Nemo_bis what's scans.io https://github.com/zmap/zmap/issues/84
10:21 🔗 Nemo_bis midas: what a weird thought that would be
10:22 🔗 Nemo_bis https://github.com/zmap/zmap/issues/129
10:22 🔗 midas pussies
10:24 🔗 schbirid "stop looking at my house"
10:24 🔗 midas schbirid: you have a box at oneprovider, try zmap :D
10:24 🔗 schbirid :P
10:25 🔗 midas it kinda killed my dual xeon already
10:25 🔗 midas :P
10:25 🔗 Nemo_bis aka https://github.com/zmap/zmap/pull/156 ?
10:28 🔗 aggrosk "On a typical desktop computer with a gigabit Ethernet connection, ZMap is capable scanning the entire public IPv4 address space in under 45 minutes." ... That. Sounds. Fucking. Awesome.
10:28 🔗 schbirid -bs
10:29 🔗 Nemo_bis is the -R really necessary? https://github.com/ArchiveTeam/ftp-nab/blob/master/check-ftp.sh#L8
10:36 🔗 ohhdemgir not really
10:37 🔗 ohhdemgir listing all directories is nice though
10:37 🔗 ohhdemgir midas, like I said, is causes some havok XD
10:48 🔗 nico 12:15 ohhdemgir> it wasn't a bandwidth issue, I just think hitting every ftp site in the world ruffled a few feathers
10:48 🔗 nico that's why i want to do it from CCC's network
10:48 🔗 ohhdemgir yes, that, do that
10:48 🔗 nico last time a few people have done that from 30c3
10:56 🔗 godane i'm grabbing another news paper: http://www.colebrookchronicle.com/
11:04 🔗 midas LOL ohhdemgir
11:04 🔗 midas Hello,
11:04 🔗 midas your server is suspended for flood.
11:04 🔗 midas Do you explain that?
11:04 🔗 midas reply: wait, isnt that allowed?
11:04 🔗 ohhdemgir yiss
11:08 🔗 midas it was rate limited, what a pussy :p
11:08 🔗 ohhdemgir exactly, I tells you XD
11:09 🔗 Cameron_D I should try run masscan from school again, I crashed their firewall last time
11:09 🔗 ohhdemgir heh
11:09 🔗 ohhdemgir SketchCow, I'm enjoying this talk - http://i.imgur.com/pqmawVr.png
11:33 🔗 nico Cameron_D: the event network team i was in crashed the edge firewall off a whole belgian campus
11:34 🔗 nico s/off/of/
11:34 🔗 nico with 100 mbps in a gre tunnel
11:34 🔗 nico next time they will give us the f***** dark fiber we requested every year
11:40 🔗 midas Hello,
11:40 🔗 midas you must not use ZMap.
11:40 🔗 midas well yeah, i understand that. now give me access to my box
12:31 🔗 ersi Is there a channel for the ftp-nab project thing? If not, maybe time to create one :)
12:39 🔗 ohhdemgir it's part of the ftpsite project which I don't think there is a chan for
12:39 🔗 ohhdemgir ersi, make it :)
12:39 🔗 ohhdemgir I've ripping a few sites a day
12:39 🔗 ohhdemgir been*
12:47 🔗 midas create all the chans!
12:48 🔗 midas and let me know where to dump the new list :p
17:59 🔗 SketchCow For the record, STILL downloading the crap from games.mirrors.tds.net/
20:04 🔗 SadDM SketchCow: That was an *epic* twitter smackdown... love it.
20:07 🔗 ohhdemgir ersi, SketchCow channel for ftp site project?
20:13 🔗 SketchCow #effteepee
21:42 🔗 etesp Sorry, new here and don't know where to put this, but I'm quite worried about http://forums.spacebattles.com/
21:43 🔗 etesp there's a large number of fanfics, many of which may not be archived
21:44 🔗 dashcloud that's fine- thanks for telling us about it
21:44 🔗 etesp it's an old forum, but there's a new competitor (http://forums.sufficientvelocity.com/ ) which popped up after some site politics, and most of the staff moved over there
21:45 🔗 etesp apparently it's been in technical decay for some time
21:45 🔗 etesp the forums are still readable, but the forum index is hidden
21:45 🔗 etesp how practical is it to save fairly large forums?
21:47 🔗 etesp should I submit it somewhere in particular?
21:47 🔗 etesp especially concerning is the possibility that one solution to memory issues is pruning the database
21:48 🔗 dashcloud here is fine- someone will look at it and figure out how to archive it
21:48 🔗 etesp I /really hope/ they at least avoid the most valuable forums, but I'd rather not leave it up to chance
21:49 🔗 etesp okay, thank you :)
21:49 🔗 etesp you guys are awesome
21:49 🔗 yipdw that site isn't publicly navigable
21:49 🔗 SN4T14 etesp, it'd be kind of hard to archive that, since you say the index is hidden
21:49 🔗 etesp hang on
21:49 🔗 etesp http://forums.spacebattles.com/forums/vs-debates.4/
21:50 🔗 etesp you can still view individual forums like that
21:50 🔗 SN4T14 Actually, hang on, I have an idea
21:51 🔗 etesp the text before the number is irrelevant, I think, so you should be able to find each forum by testing numbers sequentially?
21:52 🔗 etesp e.g. http://forums.spacebattles.com/forums/testing.5/ http://forums.spacebattles.com/forums/testing.4/
21:52 🔗 SN4T14 Huh, yeah
21:53 🔗 SN4T14 Yeah, it should be pretty easy to archive it, then
21:53 🔗 etesp excellent :D
21:54 🔗 SN4T14 Looks like a pretty big forum, though
21:54 🔗 etesp yea.
21:54 🔗 etesp It's been going for a long time, and pretty high activity
21:55 🔗 etesp if it's going to take some time and your tools are set up for it
21:55 🔗 etesp prioritising the creative writing and roleplay archives would be good
21:57 🔗 etesp the competitor forum, sufficient velocity, apparently has db access, but will only pull over posts if each author gives permission
21:57 🔗 etesp which would miss out a huge amount of content just from people's who're no longer active so can't agree to it
21:58 🔗 SN4T14 Pfft, consent is overrated. :p
21:58 🔗 yipdw as a first step I've thrown the aforementioned links into archivebot
21:59 🔗 etesp shall I grab the creative writing forum for you?
22:03 🔗 etesp http://forums.spacebattles.com/forums/creative-writing.18/ http://forums.spacebattles.com/forums/creative-writing-archive.40/
22:04 🔗 etesp also, unrelated, but I'm an op on wikiapiary and am expecting to have some free time to play around with projects in a couple of months.
22:05 🔗 etesp one of the things I'm considering is making a few more bots and adapting wikiapiary to collect basic info from foums
22:06 🔗 etesp would be a whole lot less fancy than wikiapiary because more diverse software and less good statistics api
22:07 🔗 etesp but i'd like to help you guys have a good auto-updating index of forums
22:15 🔗 DFJustin hmm there used to be big-boards but it seems to be gone
22:15 🔗 DFJustin but there's http://www.thebiggestboards.com/
22:19 🔗 etesp there is that, but the apiary framework is kinda awesome. SMW queries, time series data, presented nicely, other fun things.
22:24 🔗 etesp http://forums.sufficientvelocity.com/threads/sb-server-errors.3321/page-27#post-474520 posted on SV, maybe we'll get a few more people running archive warrior :)
22:27 🔗 yipdw rm
22:27 🔗 yipdw oops
22:28 🔗 SN4T14 Silly yipdw, this isn't your shell. :p
22:28 🔗 yipdw touchpads suck
22:29 🔗 yipdw etesp: the links you added have been added to archivebot's queue
22:29 🔗 yipdw #archivebot for monitoring
22:29 🔗 etesp thank you :)
22:33 🔗 SketchCow https://archive.org/download/Mattel_Intellivision_TOSEC_2012_04_23/Mattel_Intellivision_TOSEC_2012_04_23.zip
22:33 🔗 SketchCow and....
22:33 🔗 SketchCow FINISHED --2014-06-10 21:59:04--
22:34 🔗 SketchCow Downloaded: 11188 files, 684G in 1d 2h 1m 44s (7.48 MB/s)
22:34 🔗 SketchCow Total wall clock time: 1d 2h 18m 43s
22:34 🔗 SketchCow (Planetquake 3)
22:34 🔗 etesp slightly lower priority forums (RP): http://forums.spacebattles.com/forums/a-brob-is-for-you-for-all-your-roleplaying-need.60/ http://forums.spacebattles.com/forums/story-debates-play-by-post-games.10/ http://forums.spacebattles.com/forums/story-debate-archives.15/
22:40 🔗 DFJustin why intellivision
22:41 🔗 SN4T14 All the links on that forum: http://pastebin.com/wdkhuN3Q
22:41 🔗 SN4T14 subforums, not all the links
22:42 🔗 etesp thanks SN4T14 :)
22:43 🔗 SN4T14 Although I only did 1-99, going to check if there are any >99
22:46 🔗 etesp unlikely that there's 35 they deleted/hid in a row
22:46 🔗 SketchCow ha ha
22:46 🔗 SketchCow Intellivision was a mispaste
22:46 🔗 SketchCow I'm on a mac, I'm all in crazy land
23:01 🔗 etesp i'm out for now. thanks for putting SB on the list.

irclogger-viewer