#archiveteam-bs 2015-07-18,Sat

↑back Search

Time Nickname Message
00:40 🔗 Jonimus has quit IRC (ircd.shaw.ca irc.shaw.ca)
00:40 🔗 rduser has quit IRC (ircd.shaw.ca irc.shaw.ca)
00:52 🔗 mistym has quit IRC (Remote host closed the connection)
00:57 🔗 oldcad has quit IRC (Quit: Leaving.)
01:00 🔗 Asparagir has quit IRC (Asparagir)
01:02 🔗 Asparagir has joined #archiveteam-bs
01:09 🔗 rduser has joined #archiveteam-bs
01:26 🔗 pikhq has joined #archiveteam-bs
01:33 🔗 Asparagir has quit IRC (Asparagir)
01:35 🔗 rduser has quit IRC (ircd.shaw.ca irc.shaw.ca)
01:59 🔗 xtr-201 has quit IRC (Read error: Operation timed out)
02:04 🔗 rduser has joined #archiveteam-bs
02:07 🔗 schbirid2 has joined #archiveteam-bs
02:08 🔗 Stiletto has joined #archiveteam-bs
02:08 🔗 rduser has quit IRC (ircd.shaw.ca irc.shaw.ca)
02:08 🔗 schbirid has quit IRC (Read error: Operation timed out)
02:36 🔗 rduser has joined #archiveteam-bs
02:38 🔗 kyan has joined #archiveteam-bs
02:44 🔗 mistym has joined #archiveteam-bs
02:53 🔗 kniffy has quit IRC (Ping timeout: 252 seconds)
02:54 🔗 ripvanwin has quit IRC (Read error: Operation timed out)
02:56 🔗 dashcloud has quit IRC (Read error: Operation timed out)
03:02 🔗 dashcloud has joined #archiveteam-bs
03:07 🔗 primus104 has quit IRC (Leaving.)
03:34 🔗 xf2e has quit IRC (Remote host closed the connection)
03:59 🔗 xtr-201 has joined #archiveteam-bs
04:23 🔗 aaaaaaaaa has quit IRC (Leaving)
04:44 🔗 ripvanwin has joined #archiveteam-bs
04:47 🔗 mistym has quit IRC (Remote host closed the connection)
04:54 🔗 john1 has joined #archiveteam-bs
04:55 🔗 john1 I'd like to be able to scrape youtube links off of a web page. Does a script for this already exist, that you know of?
05:04 🔗 vitzli has joined #archiveteam-bs
05:10 🔗 mistym has joined #archiveteam-bs
05:18 🔗 dashcloud has quit IRC (Read error: Operation timed out)
05:19 🔗 rduser has quit IRC (ircd.shaw.ca irc.shaw.ca)
05:23 🔗 dashcloud has joined #archiveteam-bs
05:27 🔗 rduser` has joined #archiveteam-bs
05:35 🔗 rduser` is now known as rduser
06:03 🔗 dashcloud has quit IRC (Read error: Connection reset by peer)
06:05 🔗 john1 For large amounts of storage, do you recommend cheaper drives with more duplication and maintenance, or more reliable ones?
06:11 🔗 dashcloud has joined #archiveteam-bs
06:13 🔗 BlueMaxim has quit IRC (Ping timeout: 306 seconds)
06:14 🔗 BlueMaxim has joined #archiveteam-bs
06:24 🔗 superkuh_ has quit IRC (Quit: the neuronal action potential is an electrical manipulation of reversible abrupt phase changes in the lipid bilaye)
07:18 🔗 schbirid2 http://www.the-master-list.com/
07:23 🔗 john1 schbirid2: Is that from the nineties? Has it been updated recently?
07:23 🔗 schbirid2 updated 2014 :D
07:23 🔗 john1 Nice.
07:23 🔗 john1 You want it archived?
07:23 🔗 BlueMaxim has quit IRC (Quit: Leaving)
07:27 🔗 schbirid2 dont care, just thought it was funny
07:31 🔗 john1 By the way, you know there's a modern geocities clone?
07:32 🔗 john1 https://neocities.org/browse
07:33 🔗 xmc yeah
07:36 🔗 john1 I'm not sure the old geocities had bandwidth limits though. :/
07:39 🔗 xmc oh it did
07:40 🔗 john1 All right.
07:41 🔗 john1 Is the Archive Team prepared to face legal challenges? Have they done so before?
07:46 🔗 schbirid2 that can mean many things, no one could answer that
07:47 🔗 john1 On issues of intellectual property, to be more specific.
07:47 🔗 xmc what is your threat model
07:49 🔗 john1 xmc: Basically, copyright trolls and assholes. For example, one might claim that their website is their intellectual property, and that you have no right to redistribute it. Though, I do believe this argument is no longer up for judicial debate.
07:49 🔗 xmc it's never come up and i doubt it wil
07:50 🔗 john1 I'm also wondering how DMCA requests will be handled.
07:50 🔗 schbirid2 archive.org's job, not ours
07:50 🔗 schbirid2 we dont care about copyright etc
07:51 🔗 john1 All right.
07:52 🔗 schbirid2 be aware that if you upload to IA, your mail address is available for the world to see though
07:53 🔗 john1 An email address doesn't necessarily identify a person though.
07:54 🔗 john1 Of course, not using an email address you read may also deprive you of chances to defend yourself.
07:54 🔗 john1 But I really don't care that much.
07:55 🔗 john1 I'm a copyright antagonist myself, but I ask these questions because I don't want to give pirated data to organizations that avoid it.
08:00 🔗 kniffy has joined #archiveteam-bs
08:01 🔗 schbirid2 look at recent uploads at IA 8)
08:02 🔗 john1 Yeah, I know.
08:02 🔗 mistym has quit IRC (Remote host closed the connection)
08:03 🔗 john1 The best part is, when they receive a request, they don't even delete it. They just store it indefinitely, which I believe they are allowed to do due to their legal library status.
08:03 🔗 john1 Though, I am curious how that stuff doesn't get lost, but that's a question for them I suppose.
08:05 🔗 mistym has joined #archiveteam-bs
08:06 🔗 arkiver What do you want to upload?
08:06 🔗 john1 Nothing right now. Just thinking ahead of myself.
08:09 🔗 john1 Though, I wouldn't upload anything there specifically for piracy.
08:10 🔗 mistym has quit IRC (Remote host closed the connection)
08:45 🔗 ersi Hehehe, Legal challenges.
08:45 🔗 primus104 has joined #archiveteam-bs
08:48 🔗 BlueMaxim has joined #archiveteam-bs
08:49 🔗 dashcloud has quit IRC (Read error: Operation timed out)
09:03 🔗 dashcloud has joined #archiveteam-bs
09:06 🔗 john1 Anyone else facing a similar problem? https://pastee.org/5uxhq
09:09 🔗 primus104 has quit IRC (Leaving.)
09:10 🔗 mistym has joined #archiveteam-bs
09:10 🔗 ohhdemgir has quit IRC (Read error: Operation timed out)
09:25 🔗 mistym has quit IRC (Read error: Operation timed out)
09:37 🔗 primus104 has joined #archiveteam-bs
09:37 🔗 dashcloud has quit IRC (Read error: Operation timed out)
09:40 🔗 dashcloud has joined #archiveteam-bs
09:59 🔗 Sanqui john1: looks like wpull is python2, so use pip2 :P
10:00 🔗 Sanqui hm, never mind
10:00 🔗 Sanqui weird exception then lol
10:00 🔗 Sanqui (wpull is python 3, ignore what I had said)
10:06 🔗 ohhdemgir has joined #archiveteam-bs
10:21 🔗 superkuh has joined #archiveteam-bs
10:50 🔗 BlueMaxim has quit IRC (Quit: Leaving)
11:02 🔗 godane can anyone download this: https://drive.google.com/file/d/0BxrjMy713etLcmhrWU1MVGNOT3M/view?pli=1
11:03 🔗 godane its a 1994 film called Otaku and looks like its hard to find
11:03 🔗 godane download button doesn't work for me
11:06 🔗 oldcad has joined #archiveteam-bs
11:09 🔗 schbirid2 "Not Found
11:09 🔗 schbirid2 Error 404" for me
11:09 🔗 arkiver for me too
11:09 🔗 godane i figured it out
11:09 🔗 arkiver yeah, I guess just downloading the converted video
11:10 🔗 arkiver or did you find a working link to the wmv?
11:10 🔗 godane its just the converted webm
11:12 🔗 godane i got the link from here: http://www.reddit.com/r/Documentaries/comments/3dlcrt/otaku_1994_this_classic_documentary_focuses_on_a/
11:13 🔗 dashcloud has quit IRC (Read error: Operation timed out)
11:13 🔗 schbirid2 godane: are you into torrents? it is available at a tracker i am member of
11:13 🔗 schbirid2 i mean generally, if you would use it well, i could invite you
11:14 🔗 godane ok
11:14 🔗 godane which torrent tracker?
11:14 🔗 schbirid2 pm me your mail address if you want
11:14 🔗 schbirid2 cinemageddon
11:14 🔗 godane YES
11:14 🔗 schbirid2 :D
11:16 🔗 dashcloud has joined #archiveteam-bs
11:33 🔗 Coderjoe_ is now known as Coderjoe
11:42 🔗 dashcloud has quit IRC (Read error: Operation timed out)
11:46 🔗 dashcloud has joined #archiveteam-bs
11:53 🔗 garyrh heh https://github.com/avinassh/rockstar
12:00 🔗 ivan` has joined #archiveteam-bs
12:00 🔗 ivan` john1: old Python 3, perhaps?
12:10 🔗 godane so i'm looing at cinemageddon forums
12:11 🔗 godane its alot easier to grab then underground-gamer
12:11 🔗 godane thats cause all pages of a topic is on the index pages
12:15 🔗 primus104 has quit IRC (Leaving.)
12:15 🔗 schbirid2 oi, dont get me banned
12:15 🔗 schbirid2 that was not the intention
12:15 🔗 schbirid2 not to mention that those forums are private
12:17 🔗 godane ok
12:42 🔗 szalwia what's the status of imageshack?
12:42 🔗 szalwia it seems they deleted a lot of old images and relaunched as a mobile app?
12:42 🔗 szalwia did we ever grab anything from them?
13:02 🔗 Muad-Dib has quit IRC (Ping timeout: 252 seconds)
13:14 🔗 mistym has joined #archiveteam-bs
13:22 🔗 mistym has quit IRC (Ping timeout: 492 seconds)
14:22 🔗 kyan has quit IRC (Quit: Leaving)
14:28 🔗 vitzli has quit IRC (Quit: Leaving)
14:43 🔗 Stiletto has quit IRC ()
14:44 🔗 zhongfu godane: i managed to get only the reencoded mp4 for streaming, 200MB
14:44 🔗 zhongfu I did find a torrent for that though, I'll go see if it's similar
14:53 🔗 godane i got the torrent
14:56 🔗 zhongfu ah alright
15:04 🔗 Stiletto has joined #archiveteam-bs
15:09 🔗 primus104 has joined #archiveteam-bs
15:10 🔗 Stiletto has quit IRC ()
15:26 🔗 Stiletto has joined #archiveteam-bs
15:35 🔗 Coderjoe has quit IRC (Ping timeout: 186 seconds)
15:58 🔗 john1 Sanqui: No, the setup.py told me to use python3.
15:58 🔗 john1 ivan`: Python 3.4 is still the current version, isn't it?
16:19 🔗 Coderjoe has joined #archiveteam-bs
16:45 🔗 godane i'm starting to uploaded these: https://archive.org/details/koreanet-2_cheongju_tvpro_chung-20030107
16:46 🔗 godane the tvpro part will be dropped from 2009 on
16:56 🔗 aaaaaaaaa has joined #archiveteam-bs
16:56 🔗 swebb sets mode: +o aaaaaaaaa
17:00 🔗 mistym has joined #archiveteam-bs
17:35 🔗 dashcloud has quit IRC (Read error: Operation timed out)
17:39 🔗 dashcloud has joined #archiveteam-bs
17:40 🔗 Asparagir has joined #archiveteam-bs
17:48 🔗 dashcloud has quit IRC (Read error: Operation timed out)
17:51 🔗 dashcloud has joined #archiveteam-bs
17:55 🔗 godane has quit IRC (Ping timeout: 252 seconds)
18:10 🔗 godane has joined #archiveteam-bs
18:17 🔗 dashcloud has quit IRC (Read error: Operation timed out)
18:21 🔗 dashcloud has joined #archiveteam-bs
18:27 🔗 primus104 has quit IRC (Leaving.)
18:35 🔗 godane has quit IRC (Quit: Leaving.)
18:37 🔗 godane has joined #archiveteam-bs
19:00 🔗 dashcloud has quit IRC (Read error: Operation timed out)
19:11 🔗 dashcloud has joined #archiveteam-bs
19:34 🔗 dashcloud has quit IRC (Read error: Operation timed out)
19:37 🔗 dashcloud has joined #archiveteam-bs
19:51 🔗 Kazzy has quit IRC (Quit: ZNC - http://znc.in)
19:51 🔗 Kazzy has joined #archiveteam-bs
20:00 🔗 Kazzy has quit IRC (Quit: ZNC - http://znc.in)
20:03 🔗 Kazzy has joined #archiveteam-bs
20:32 🔗 HCross has quit IRC (Ping timeout: 265 seconds)
20:35 🔗 wp494 has quit IRC (Read error: Connection reset by peer)
20:37 🔗 wp494 has joined #archiveteam-bs
20:37 🔗 wp494 has quit IRC (Excess Flood)
20:37 🔗 HCross has joined #archiveteam-bs
20:37 🔗 wp494 has joined #archiveteam-bs
21:01 🔗 ivan` john1: it is, I guess it's not that
21:26 🔗 dashcloud has quit IRC (Read error: Operation timed out)
21:29 🔗 dashcloud has joined #archiveteam-bs
21:53 🔗 Ravenloft has joined #archiveteam-bs
22:07 🔗 DopefishJ has joined #archiveteam-bs
22:07 🔗 swebb sets mode: +o DopefishJ
22:08 🔗 DFJustin has quit IRC (Read error: Operation timed out)
22:10 🔗 primus104 has joined #archiveteam-bs
22:14 🔗 espes___ has quit IRC (Ping timeout: 240 seconds)
22:23 🔗 Asparagir has quit IRC (Asparagir)
22:40 🔗 espes__ has joined #archiveteam-bs
22:46 🔗 chazchaz_ has quit IRC (Remote host closed the connection)
22:49 🔗 chazchaz_ has joined #archiveteam-bs
22:59 🔗 joepie91 well, I guess sourceforge may be really dead...
23:01 🔗 aaaaaaaaa They posted https://twitter.com/sourceforge/status/622237830186577920 a while ago
23:02 🔗 aaaaaaaaa .title https://twitter.com/sourceforge/status/622237830186577920
23:02 🔗 botpie91 aaaaaaaaa: sourceforge on Twitter: "#SourceForge directory, download and project summary pages are back online; dev services (SCM, uploads, ML's, project web) pending restoral"
23:02 🔗 joepie91 aaaaaaaaa: yes, I know
23:02 🔗 joepie91 aaaaaaaaa: tbh, it feels like they're on borrowed time right now
23:02 🔗 joepie91 what is the status of SF archival>?
23:03 🔗 phiren paused
23:03 🔗 joepie91 phiren: any particular reason?
23:03 🔗 aaaaaaaaa others might not. But I like how everything they list as back is available from the mirrors.
23:04 🔗 phiren the archival efforts started and a sourceforge admin showed up a few hours later and said "woah, you aren't following robot.txt"
23:04 🔗 aaaaaaaaa They noticed the effort, panicked, stopped the party and started banning the user agent and rsync boxes
23:05 🔗 aaaaaaaaa so SketchCow was trying to negotiate something with them.
23:06 🔗 aaaaaaaaa arkiver probably has the best picture
23:06 🔗 phiren there was a great post on reddit
23:06 🔗 phiren https://www.reddit.com/r/sysadmin/comments/3do9k0/sourceforge_is_down_due_to_storage_problems_no_eta/ct77o49
23:06 🔗 arkiver I'll write a mail to them tomorrow
23:06 🔗 arkiver will let you all check it first
23:06 🔗 arkiver SketchCow 6
23:06 🔗 arkiver ^*
23:07 🔗 phiren so the binaries aren't lost, I guess that's something
23:08 🔗 arkiver I think they'll all be restored
23:10 🔗 aaaaaaaaa may have to overnight a box, and then figure out how the hell to reverse engineer their kludge.
23:12 🔗 joepie91 I'd like an update on the SF negotiations
23:12 🔗 joepie91 because this storage issue is a giant red flag to me
23:12 🔗 joepie91 and I feel like this is going to be the next geocities, but without advance notice
23:12 🔗 arkiver joepie91: there's not much of an update yet
23:12 🔗 joepie91 it all smells of neglect
23:13 🔗 arkiver they're being cooperative
23:13 🔗 joepie91 arkiver: but the project has paused
23:13 🔗 arkiver but they don't fully understand what we exactly want, so we need to explain that better
23:13 🔗 joepie91 arkiver: okay, what is unclear about their understanding?
23:13 🔗 joepie91 in*
23:14 🔗 arkiver They asked us if we want to be a mirror.
23:15 🔗 arkiver And they want to talk with us "to ensure you can obtain a copy of Open Source content we host in a manner that uses a fair share of our delivery capacity and without impact to our other community members"
23:16 🔗 arkiver We do not want mirrors, we want a full web grab
23:16 🔗 joepie91 arkiver: right. yeah
23:16 🔗 arkiver Now we need to explain that well in a mail
23:16 🔗 joepie91 arkiver: it would perhaps be useful to point them at the wayback machine as an example of what you want
23:16 🔗 arkiver yeah
23:17 🔗 joepie91 or say 'static dump' or whatever
23:17 🔗 phiren there is no point trying to negotiate right now, they will be running around with their hair on fire
23:17 🔗 arkiver Though, I think it'll be hard for them to really understand why we want a webgrab and not just all files as a mirror
23:17 🔗 joepie91 arkiver: let me know if you need my help explaining that
23:18 🔗 * joepie91 puts on "talking to non-knowledgeable people" hat
23:18 🔗 aaaaaaaaa heh, depending on what you want, it s the best time
23:18 🔗 phiren though, we don't just want a webgrab
23:18 🔗 phiren we want all the SCM data as well
23:18 🔗 aaaaaaaaa also, don't forgt the code
23:18 🔗 arkiver joepie91: I might need that, are you available tomorrow?
23:18 🔗 joepie91 arkiver: it's slightly disturbing to me that they don't understand the requirements, though - makes me feel you're not talking to a technical person
23:18 🔗 joepie91 arkiver: depends on the time
23:19 🔗 arkiver joepie91: ok, I'll just ping you
23:19 🔗 joepie91 arkiver: if you give me an approx time (in NL timezone), I can try to schedule it in with sleep and such
23:19 🔗 arkiver Mail won't be send before you have read it
23:19 🔗 joepie91 :p
23:19 🔗 arkiver I'm sorry, I can't give some exact time
23:20 🔗 joepie91 arkiver: just ballpark. my normal sleep pattern is to go to sleep around 07:00 NL time, and wake up 7:30 - 8:00 hours later
23:20 🔗 joepie91 at this point
23:20 🔗 joepie91 but I can adjust that if need be
23:21 🔗 joepie91 so I'd be back around 16:00 or so
23:21 🔗 arkiver that's ok
23:21 🔗 joepie91 alright
23:22 🔗 aaaaaaaaa they are not technical, the contact person is in pr
23:22 🔗 joepie91 arkiver: just highlight me here in channel then, I'll probably miss a PM if I'm not actively looking for it (because my client doesn't reliably show new PM tabs to me)
23:22 🔗 joepie91 I'll make sure this tab is in view :P
23:22 🔗 arkiver OK, I'll do that
23:23 🔗 arkiver I did just sent a PM, you got it?
23:23 🔗 joepie91 arkiver: yeah, but the tab is somewhere three kilometers off screen :P
23:23 🔗 arkiver haha
23:42 🔗 dashcloud has quit IRC (Read error: Operation timed out)
23:47 🔗 dashcloud has joined #archiveteam-bs

irclogger-viewer