[00:10] *** BlueMaxim has joined #archiveteam-bs [00:12] *** tomwsmf-a has joined #archiveteam-bs [00:18] *** anjacks0n has joined #archiveteam-bs [00:21] *** anjacks0n has quit IRC (Ping timeout: 190 seconds) [00:29] *** j08nY has quit IRC (Ping timeout: 633 seconds) [00:31] *** JesseW has joined #archiveteam-bs [00:57] *** DoomTay has joined #archiveteam-bs [01:03] *** JesseW has quit IRC (Ping timeout: 370 seconds) [01:27] *** kristian_ has quit IRC (Leaving) [01:31] *** Stiletto has quit IRC (Ping timeout: 246 seconds) [01:57] *** VADemon has quit IRC (Quit: left4dead) [02:00] *** schbirid has quit IRC (Read error: Connection refused) [02:02] *** Stiletto has joined #archiveteam-bs [02:07] some one is starting to upload coursera to kat.cr : https://kat.cr/user/eru_melkor/uploads/ [02:09] I'm getting hammered on every comm channel about coursera [02:09] i'm seeing people join this channel every three hours and ask about it [02:11] *** BlueMaxim has quit IRC (Read error: Operation timed out) [02:13] *** schbirid has joined #archiveteam-bs [02:53] "mpv is a fork of mplayer2 and MPlayer" [02:53] goddamn that codebase [02:54] it gets forked as often as a salad [03:08] *** Aranje has quit IRC (Quit: Three sheets to the wind) [03:12] looks like i can login using facebook on to coursera [04:17] yipdw: mpv also tossed out like half of it. :P [04:17] pikhq: that explains why it built so fast [04:17] (... mpv takes the frankly reasonable standpoint that if ffmpeg/libav have good or superior implementations of a codec, it deserves to get removed from mpv.) [04:25] *** Coderjoe has quit IRC (Read error: Operation timed out) [04:34] *** Coderjoe has joined #archiveteam-bs [04:57] *** DoomTay has quit IRC (Quit: Page closed) [05:08] *** DoomTay has joined #archiveteam-bs [05:16] pikhq: makes sense to me. I really only stumbled upon it because my local mplayer2 build didn't handle h265 [05:20] *** BlueMaxim has joined #archiveteam-bs [05:23] *** anjacks0n has joined #archiveteam-bs [05:26] *** anjacks0n has quit IRC (Ping timeout: 190 seconds) [05:39] *** JesseW has joined #archiveteam-bs [05:58] SketchCow: i'm starting to upload some of the Coursera i have downloaded [05:58] i'm putting time onto your FOS [06:07] *** DoomTay has quit IRC (Quit: Page closed) [06:36] *** JesseW has quit IRC (Ping timeout: 370 seconds) [06:45] *** anjacks0n has joined #archiveteam-bs [06:48] *** anjacks0n has quit IRC (Ping timeout: 190 seconds) [07:05] *** anjacks0n has joined #archiveteam-bs [07:08] *** Honno_ has joined #archiveteam-bs [07:33] *** anjacks0n has quit IRC (anjacks0n) [07:54] *** tomwsmf-a has quit IRC (Read error: Operation timed out) [08:27] i love mpv [08:39] *** tomwsmf-a has joined #archiveteam-bs [08:53] *** tomwsmf-a has quit IRC (Read error: Operation timed out) [09:56] *** SDragon has joined #archiveteam-bs [11:00] *** Honno__ has joined #archiveteam-bs [11:03] *** Honno_ has quit IRC (Ping timeout: 492 seconds) [11:06] *** metalcamp has joined #archiveteam-bs [11:36] *** BlueMaxim has quit IRC (Quit: Leaving) [12:07] *** j08nY has joined #archiveteam-bs [12:17] *** j08nY has quit IRC (Quit: Leaving) [12:35] *** j08nY has joined #archiveteam-bs [12:51] *** metalcamp has quit IRC (Read error: Connection reset by peer) [13:03] *** dashcloud has quit IRC (Read error: Operation timed out) [13:06] *** dashcloud has joined #archiveteam-bs [15:19] *** RichardG has quit IRC (Quit: Keyboard not found, press F1 to continue) [15:21] *** RichardG has joined #archiveteam-bs [16:00] *** Legionof7 has joined #archiveteam-bs [16:00] I think I've seen you on anonops joe [16:00] Legionof7: was an AnonOps oper for a short while back in... 2010? then acted as a 'backseat oper' on-and-off for a few years(?) [16:01] ah thats why [16:01] was also involved in voxanon after anonops went to shit, etc [16:01] whats voxanon? [16:01] and ran anonnews.org (which is still due to come back at some point), and so on [16:01] Ah I see [16:02] Well, I've been away from Anonymous for a year? [16:02] 2? [16:02] Legionof7: spin-off network, trying to avoid the pitfalls of anonops and its crappy 'leadership' [16:02] eventually went to crap for $longStory reasons [16:02] and merged back into anonops [16:02] Legionof7: and well, I haven't really done anything on anonops for a long while now [16:02] :P [16:03] yeah, neither have I. No more Anon for me [16:03] not really worth the servers on which it's hosted anymore, thanks to said leadership [16:03] mmm [16:03] don't confuse 'anon' and 'anonops' [16:05] yeah [16:06] but im involved in a bunch of other things now [16:06] Bernie campaign, now Jill [16:06] Liquid Democracy project [16:06] Ethereum [16:06] school lol [16:11] ethereum lol [16:14] https://blog.archive.org/2016/06/16/geez-now-internet-insurance/ [16:15] "One change we could make would be to send our traffic through CloudFlare, or similar, to filter out unwelcome packets as an “Insurance against Internet attackers.”" [16:16] "would mean that our traffic would go through someone else’s hosts, which means $, privacy loss, and general loss of the end-to-end Internet." [16:17] <3 [16:26] *** Coderjoe has quit IRC (Read error: Connection reset by peer) [16:26] *** Coderjoe has joined #archiveteam-bs [16:27] *** Legionof7 has quit IRC (Quit: Page closed) [16:43] *** kristian_ has joined #archiveteam-bs [16:51] *** mr-b has quit IRC (Ping timeout: 246 seconds) [16:58] *** mr-b has joined #archiveteam-bs [17:11] *** VADemon has joined #archiveteam-bs [17:14] *** JesseW has joined #archiveteam-bs [17:22] *** Igloo has joined #archiveteam-bs [17:27] *** anjacks0n has joined #archiveteam-bs [17:28] *** anjacks0n has quit IRC (Client Quit) [17:29] *** DoomTay has joined #archiveteam-bs [17:45] *** JesseW has quit IRC (Ping timeout: 370 seconds) [17:47] *** dashcloud has quit IRC (Ping timeout: 244 seconds) [17:47] *** dashcloud has joined #archiveteam-bs [18:23] *** Honno__ has quit IRC (Ping timeout: 492 seconds) [18:40] *** JesseW has joined #archiveteam-bs [18:56] yeah, that IA blog post [18:56] I don't really know how they do it, I largely gave up trying to run any Internet infrastructure at any scale [18:56] it's impossible if you want to do other things, like breathing [18:57] too many assholes with too many weapons [18:57] the fact that some jerk hasn't DoSed archivebot is kind of amazing to me; it has basically no defense against that whatsoever (although I guess sometimes it DoSes itself ohohoho) [18:58] yeah, the cost of attack vs cost of defence ratio is way way WAY unbalanced [18:58] *** Atluxity sets mode: +oo yipdw schbirid [18:59] oh yeah [18:59] Not a lot you can do at huge scale [18:59] Except hide as much as you possibly can [18:59] and the defense cost is an ongoing one [19:00] I can't help but wonder why this problem isn't more widespread and known about [19:01] It's known -- I suspect it's one of the motives behind the [tinfoil hat on] effort to lock down the 'net [tinfoil hat off]. [19:02] maybe IA will turn to some distributed thing [19:02] for the public items [19:03] They are certainly actively interested and working on such efforts, yeah [19:09] *** ris has joined #archiveteam-bs [19:11] I'm sure that is part of the plan for Wayback Machine 2.0 [19:12] ris: mapillary currently claims to have 66,859,731 photos, and the downloads seem to be 2048x1536px, or about 300 KB per photo [19:12] :) [19:12] Which would give a total of about 20TB [19:13] which isn't bad *at all* [19:13] manageable [19:13] Wat? [19:13] I think the only way to do it if they want to run their own systems is to deploy something like OVH have done for their estate [19:13] managable [19:13] Storage space must be expensive though [19:13] either way it's spelt [19:14] the reason i thought of it is lots of openstreetmappers use it as their "source of truth" for mapping [19:15] and it's nice to think that you can rely on it being there "forever" [19:15] 20TB is big enough that it'd probably be courteous to confirm that IA is willing/interested in hosting a copy first. I expect they probably *are*, but it's good to check. [19:17] mappilaary seems to be hosting the images on https://d1cuyjsrcm0gby.cloudfront.net/8jzhTr1gPjvx9nuvCZkXSA/thumb-2048.jpg [19:17] IDK what cloudfront's charges are for mass-exports. [19:18] do we need to get mapillary? [19:19] *** anjacks0n has joined #archiveteam-bs [19:20] arkiver: I don't think there's any active hurry, no. [19:20] This is more of a general thinking-out process. [19:20] *** Aranje has joined #archiveteam-bs [19:21] just ask them [19:21] they are highly likely to be very supportive [19:21] i asked them in the past [19:21] *hopefully* we can negotiate a regular mail delivery of a hard drive with images on it from mapillary to IA. [19:21] schbirid: what did they say when you asked them? [19:22] it was on twitter iirc and impossible to find, i think it was "contact us" which i then forgot about [19:22] they are super pro open and free [19:22] *** bwn has quit IRC (Ping timeout: 244 seconds) [19:23] it would be a kick in the nuts if people started scraping them like mad even if well-intended [19:23] yeah, scraping seems like a bad idea at this point, indeed [19:24] Who, IA or mapillary? [19:25] DoomTay: mapillary [19:30] *** anjacks0n has quit IRC (anjacks0n) [19:32] *** bwn has joined #archiveteam-bs [19:34] yeah, i don't know of any specific threat to mapillary, just precaution [19:37] ris: If you want, it'd probably be good to make an entry for Mapillary on the archiveteam wiki, just to provide a central place to record the status of efforts [19:37] that's an idea [19:37] the secret word is yahoosucks [19:37] ok [19:38] I was just about to ask that [19:38] I am about to get thogether a page for Coursera [19:39] Wait, if the channels are logged, then wouldn't one be able to get the secret word from there? [19:39] Unless it changes? [19:40] yep, one can very easily get the secret word from the logs :-) [19:40] it's not that secret [19:40] *** tomwsmf-a has joined #archiveteam-bs [19:42] if a spammer bothers to write a bot (or hire a human) persistent enough to do a websearch for the wording of the "what's the secret word" question, then, well -- they are welcome to do so (we'll still revert the spamming promptly, though) [19:42] it's still quite a bit of a speedbump for your average wikispammer, at least for now [19:48] *** DoomTay has quit IRC (Quit: Page closed) [19:54] *** kristian_ has quit IRC (Leaving) [19:57] *** DoomTay has joined #archiveteam-bs [20:04] JesseW: a start http://www.archiveteam.org/index.php?title=Mapillary [20:06] *** metalcamp has joined #archiveteam-bs [20:09] ris, DoomTay: looks good [20:13] Now that I think about it, I might need clarification on project statuses. I'm starting to think Coursera would be a "special case" rather than closing [20:15] And I presume a "lost" site means it wasn't saved at all? [20:21] yeah, special case seems good [20:23] I noticed that a lot of project channels have "witty" names. I wonder what Courera's would be if it came to that [20:35] *** whydomain has joined #archiveteam-bs [20:36] What's the best way to batch rotate a large number of TIFF images without losing quality? [20:36] *** JesseW has quit IRC (Read error: Operation timed out) [20:36] (Majority of online answers suggest ImageMagick, but that is lossy) [20:42] whydomain: is there a format it'll rotate non lossy? if so, convert to that format, run it, convert back to tiff. [20:45] The warchive chunker is doing something interesting. 20% CPU util, <10% disk util. I wonder where the heck is it spending time... [20:46] It's ping-pong-ish between cores. What the heck. [20:47] *** JesseW has joined #archiveteam-bs [20:47] *** DoomTay has quit IRC (Quit: Page closed) [20:49] @Smiley: I don't think so. (It's for magazines I've been scanning to upload to IA). [20:50] In fact, ImageMagick doesn't even rotate properly, there are errors in the image. (Maybe the ~300mb image at 1200dpi is too much for it?) [20:52] So pinning the chunker to one core made it more than twice as quick. Why it a problem for next weeks zino. Todays zino is going to grab food. [20:52] http://sylvana.net/jpegcrop/jpegtran/ [20:52] Maybe this :/ [20:53] I read some of the 'why we can't rotate jpeg losslessly' on the IM forums and got confused. [20:56] I'm guessing that's only for JPEG and not TIFF. [20:57] convert to jpeg, do translation, convert back [20:58] tbh i'd expect most translations to be converting to bmp or something simple anyways [20:58] maybe that's why it errors, OOM? [21:08] *** JesseW has quit IRC (Ping timeout: 370 seconds) [21:10] Hmm, I though this would be simpler. Converting to JPEG (lossy) will lose quality. [21:11] I've just tried rotation in LXDE 'Image Viewer'. Before translation: ~240mb, after translation: ~550mb [21:24] *** Rye has quit IRC (Quit: ZNC - http://znc.in) [21:25] *** metalcamp has quit IRC (Ping timeout: 244 seconds) [21:25] *** DoomTay has joined #archiveteam-bs [21:27] *** Rye has joined #archiveteam-bs [21:30] here is my incomplete list of coursera class videos : http://pastebin.com/MBf87bJi [21:30] i have found the names but i figure someone here could download it [21:31] i only uploaded audio-002, datasci-001 and mythology-002 to FOS [21:41] godane, I'm working on small project to grab it all [21:41] cool [21:41] right now i don't have space [21:41] graphicsmagick is a nice im fork [21:42] but there is another, cant remember the name right now [21:42] also, some courses, like nlp are not available [21:42] or cannot be signed up for easily [21:42] the drive i do have space is NTFS i'm trying to slowly upload and reformat to ext4 [21:43] *and to [21:43] NTFS-3G froze my system twice for some reason [21:44] VIPS / nip2 it is http://www.vips.ecs.soton.ac.uk/index.php?title=VIPS [21:44] whydomain: ^ [21:45] but maybe you could use some tif tags for rotation? [21:55] i'm uploading 7 missing g4tv video in the 42xxx area [21:55] *videos [21:56] also jezebel.com is mostly done [21:56] i'm uploading 2015 web archives right now [21:56] i'm also downloading first 5 months of 2016 [22:00] i'm also uploading io9.gizmodo.com for 2015 [22:04] *** tomwsmf-a has quit IRC (Read error: Connection reset by peer) [22:11] *** DoomTay has quit IRC (Quit: Page closed) [22:18] *** DoomTay has joined #archiveteam-bs [22:21] *** tomwsmf-a has joined #archiveteam-bs [22:24] *** dashcloud has quit IRC (Remote host closed the connection) [22:34] *** dashcloud has joined #archiveteam-bs [22:48] *** ohhdemgir has joined #archiveteam-bs [23:17] *** mutoso_ has joined #archiveteam-bs [23:19] *** mutoso has quit IRC (Read error: Operation timed out) [23:47] *** Whopper has joined #archiveteam-bs [23:50] *** ohhdemgir has quit IRC (Read error: Operation timed out) [23:58] *** BlueMaxim has joined #archiveteam-bs [23:59] *** ohhdemgir has joined #archiveteam-bs