[00:20] http://news.softpedia.com/news/popular-wordpress-plugin-comes-with-a-backdoor-steals-site-admin-credentials-501383.shtml [00:20] if you run a wordpress site and use CCTM, and have updated it >= 0.9.8.8, you are now backdoored and all passwords and credentials are leaked [00:22] Hmm... apparently the IA CLI tool doesn't support files larger than 2 GB [00:23] http://pastebin.com/pKMgb6XD [00:23] Yep, no good for WARCs. [00:23] Or for large files [00:24] It also doesn't handle Unicode at all either [00:24] I just ended up writing my own ghetto script to do it using CURL [00:24] At least when using the bulk upload method [00:24] Good news is only one of those files is > 2 GB so I can do that one manually [01:21] Just for the record, I upload 40-80gb files every day. By the truckload. [01:22] That tool handles it just fine. [01:22] Maybe you're the tool [01:22] First culprit: Installation of Python that was made for an Apple II [01:22] Second culprit: Filesystem has goofiness (unlikely) [01:22] yeah I was gonna say, I've ia uploaded warrior projects before [01:23] Third culprit: That error isn't related to filesize, and is misleading in some way. I have pinged Jake about this but it's friggin' saturday night and he's a young man with a life. [01:23] Me, this is my life [01:24] stuck in an IRC channel with a bunch of maniacs [01:24] I spent 20 mintues writing something to remix metadata from a bunch of classic rock cds so that you get the full descriptions and names of all the tracks as searchable index, so what do I know from Saturday nights [01:24] Oh, I was out in the fresh air in my cube setting up stuff. [01:24] I'm doing huzzah [01:24] https://github.com/jjjake/internetarchive/issues/57 [01:24] I guess you're on 64bit systems [01:25] oh yeah [01:25] which is all well and good unless you're not :/ [01:25] Well yes, my big boy hair came in [01:25] And 2006 came and went [01:25] Anyway, glad it's a noted issue, and it's not some new problem. [01:26] Ha ha, that issue is posted by two other archive team members [01:26] This is why Jeff, when something fucko happens in the queue or the system, he goes "....one of yours?" [01:26] And the answer is 99.95% "yeah" [01:26] heh [01:26] * JesseW bows [01:26] does he talk to you like someone showing what damage their kids id [01:27] I mean part of that, is if anyone shows any ambition in uploading more than a handful of objects/items, we end up absorbing them. So it's a bit of a cheat. [01:27] Yes, he does [01:27] Takes a real "one of your moonbats got out of the belfry" [01:27] It's mostly because otherwise he goes through a whole procedure, and dumping it on me means he can get back to spam and dizzery [01:28] Like right now he's uploading the contents of a 6tb drive we've had for 3 years. [01:28] It's going right into restricted access, but it's off a single drive that's been on his desk for 3 years, so in the realm of the world, it's good. [01:28] nice [01:28] (It has 80% of all mainstream comics published since 1943) [01:28] Which is a thing [01:29] I'm half thinking we upload it, then jettison it to one of you maniacs to make a torrent and ruin the world [01:29] Grant Morrison will hunt you down with a peeling knife [01:30] that sounds like it could be a comic storyline [01:38] That's deeply awesome. [02:36] how can i edit a colleciton on the internet archive or add files to it like: https://archive.org/details/chaosradio_express&tab=about [02:38] SketchCow: ^ the information on this collection is wrong as this describes chaosradio and not chaosradio express [02:39] and the language is german not english [03:11] *** vitzli has joined #archiveteam-bs [03:14] SketchCow: Here is the error I got trying to upload the 2.1 GB file: http://pastebin.com/pKMgb6XD [03:14] This system is running Python 2.7.9 which is the latest Python 2 version [03:15] Ah yes, that linked GitHub issue is exactly the problem [03:15] (Though for some reason my stack trace is different, and probably wrong) [03:30] *** bwn has quit IRC (Quit: Leaving) [03:42] *** tomwsmf-a has joined #archiveteam-bs [04:24] *** dashcloud has quit IRC (Read error: Operation timed out) [04:29] *** dashcloud has joined #archiveteam-bs [05:13] *** guest has joined #archiveteam-bs [05:13] *** guest has left [05:15] *** Sk1d has quit IRC (Ping timeout: 250 seconds) [05:22] *** Sk1d has joined #archiveteam-bs [06:09] *** tomwsmf-a has quit IRC (Read error: Operation timed out) [06:12] *** mismatch_ has joined #archiveteam-bs [06:14] *** mismatch has quit IRC (Read error: Connection reset by peer) [06:15] *** goekesmi has quit IRC (Read error: Connection reset by peer) [06:16] *** xXx_ndidd has joined #archiveteam-bs [06:19] *** JesseW has quit IRC (Quit: Leaving.) [06:23] *** ndiddy has quit IRC (Read error: Operation timed out) [06:24] *** ndizzle has joined #archiveteam-bs [06:26] *** kvieta has quit IRC (Read error: Operation timed out) [06:27] *** JesseW has joined #archiveteam-bs [06:30] *** acridAxid has quit IRC (Ping timeout: 633 seconds) [06:30] *** acridAxid has joined #archiveteam-bs [06:37] *** xXx_ndidd has quit IRC (Read error: Operation timed out) [06:39] *** kvieta has joined #archiveteam-bs [06:43] *** goekesmi has joined #archiveteam-bs [07:31] botpie91: tell ivan` that https://www.youtube.com/watch?v=DydIK14AvXI must be archived [07:31] yipdw: I'll pass that on when ivan` is around. [07:32] *** VADemon has quit IRC (Quit: left4dead) [07:38] *** bzc6p has joined #archiveteam-bs [07:38] *** swebb sets mode: +o bzc6p [07:40] yipdw, Fletcher: do you have an idea about the problem with shallow archivebot crawls that no WARCs appear on IA? [07:40] https://ia801502.us.archive.org/16/items/archiveteam_archivebot_go_20160304010001/urls-vt.idiota.hu-kepfeltoltes_hu_images_2014_12-shallow-20160226-220819-21vu6.json [07:40] *** JesseW has quit IRC (Quit: Leaving.) [07:40] E.g. the "kepfeltoltes" one here [07:42] See also the #archivebot channel, I came up with this 10 hours ago. I write it here too because I can't stay on IRC now, but please reply here and I'll read back the public logs. (I can't read the #archivebot logs) [07:42] Thank you! [07:43] the json only got posted yesterday, I'd give it a few more days before panicking (they can end up on different items) [07:43] Erm, sorry, the item is https://archive.org/details/archiveteam_archivebot_go_20160304010001 [07:45] I waited for the next item, nor does it appear there. It had at least two days to upload to FOS. It might need more for 60 GB to upload, though. [07:49] BUT, I see many other jsons with no correpsonding WARCs. Is that okay? [07:49] (Small ones) [07:57] So, for me, many WARCs seem to be lost on the way (even if the 60 GB kepfeltoltes one is still uploading), but it might be me not knowing the system. [08:00] *** JesseW has joined #archiveteam-bs [08:01] Thanks in advance, I must leave now, I'll read back. [08:01] *** bzc6p has quit IRC (Quit: bzc6p) [08:11] there's an automatic view at http://archive.fart.website/archivebot/viewer/audit to flag such cases [08:30] *** JesseW has quit IRC (Quit: Leaving.) [08:32] *** vitzli has quit IRC (Read error: Connection reset by peer) [10:11] updating the results for livejournal [10:11] not updated it in 5 days :o [10:16] *** Sk2d has joined #archiveteam-bs [10:17] *** Sk1d has quit IRC (hub.se irc.du.se) [10:17] *** Boppen has quit IRC (hub.se irc.du.se) [10:19] i'm uploading more cbsnews.com videos [10:19] from july 2007 [10:21] *** schbirid has joined #archiveteam-bs [10:23] there pushed [10:29] *** kvieta has quit IRC (Read error: Operation timed out) [10:30] *** SN4T14 has quit IRC (Read error: Operation timed out) [10:30] *** beardicus has quit IRC (Read error: Operation timed out) [10:31] *** botpie91 has quit IRC (Read error: Operation timed out) [10:31] *** SN4T14 has joined #archiveteam-bs [10:32] *** closure has quit IRC (Read error: Operation timed out) [10:32] *** Sk2d is now known as Sk1d [10:32] *** toad2 has quit IRC (Read error: Operation timed out) [10:35] *** godane has quit IRC (Read error: Operation timed out) [10:36] *** godane has joined #archiveteam-bs [10:58] *** botpie91 has joined #archiveteam-bs [10:58] *** toad1 has joined #archiveteam-bs [10:59] *** beardicus has joined #archiveteam-bs [11:00] *** kvieta has joined #archiveteam-bs [11:00] *** closure has joined #archiveteam-bs [11:07] *** Boppen has joined #archiveteam-bs [11:09] *** beardicus has quit IRC (Read error: Operation timed out) [11:09] *** kvieta has quit IRC (Read error: Operation timed out) [11:10] *** closure has quit IRC (Read error: Operation timed out) [11:11] *** botpie91 has quit IRC (Read error: Operation timed out) [11:12] *** toad1 has quit IRC (Read error: Operation timed out) [11:18] *** toad1 has joined #archiveteam-bs [11:37] *** closure has joined #archiveteam-bs [11:38] *** botpie91 has joined #archiveteam-bs [11:38] *** kvieta has joined #archiveteam-bs [11:39] *** Boppen has quit IRC (Ping timeout: 194 seconds) [11:39] *** Boppen has joined #archiveteam-bs [11:47] *** Boppen has quit IRC (Ping timeout: 194 seconds) [11:58] *** beardicus has joined #archiveteam-bs [12:23] *** beardicus has quit IRC (Read error: Operation timed out) [12:29] *** beardicus has joined #archiveteam-bs [12:35] *** closure has quit IRC (Ping timeout: 633 seconds) [12:52] *** Stiletto is now known as Stilett0 [12:57] *** closure has joined #archiveteam-bs [13:16] NewsBuddy is running again :) [13:33] *** jut has joined #archiveteam-bs [13:54] emfcamp tickets are available now: https://www.emfcamp.org/tickets/choose [16:05] *** VADemon has joined #archiveteam-bs [16:29] *** bzc6p has joined #archiveteam-bs [16:29] *** swebb sets mode: +o bzc6p [16:47] *** bzc6p has quit IRC (Read error: Operation timed out) [16:59] *** chfoo has quit IRC (Quit: chfoo) [17:07] *** bzc6p has joined #archiveteam-bs [17:07] *** swebb sets mode: +o bzc6p [17:08] *** chfoo has joined #archiveteam-bs [17:14] *** kvieta has quit IRC (Read error: Operation timed out) [17:18] *** beardicus has quit IRC (Read error: Operation timed out) [17:19] *** closure has quit IRC (Ping timeout: 633 seconds) [17:30] VADemon: pease post the log on some site and give me the URL [17:31] *** botpie91 has quit IRC (Ping timeout: 633 seconds) [17:35] arkiver: that's the part I copy-pasted: http://paste.nerds.io/gufezilizi.pl [17:36] unfortunately I have already closed the file with the earlier part of the log [17:38] scriptsd are updated. [17:42] *** kvieta has joined #archiveteam-bs [17:42] *** closure has joined #archiveteam-bs [17:44] *** botpie91 has joined #archiveteam-bs [17:49] *** ErkDog has joined #archiveteam-bs [18:07] *** JesseW has joined #archiveteam-bs [18:13] *** beardicus has joined #archiveteam-bs [18:18] *** tomwsmf-a has joined #archiveteam-bs [18:19] *** ErkDog has quit IRC () [18:20] *** ErkDog_ has joined #archiveteam-bs [18:20] *** ErkDog_ is now known as ErkDog [18:27] arkiver: did we get everything from Thingiverse? [18:32] *** bzc6p sets mode: +o achip [18:32] *** bzc6p sets mode: +oooo achip chfoo chfoo- godane [18:32] *** bzc6p sets mode: +oooo HCross JesseW joepie91 Kazzy [18:32] *** bzc6p sets mode: +oooo Kenshin JW_work midas ohhdemgir [18:33] *** bzc6p sets mode: +oooo schbirid Simpbrai_ SimpBrain Start [18:33] *** bzc6p sets mode: +o wp494 [18:34] *** ndiddy has joined #archiveteam-bs [18:35] *** botpie91 has quit IRC (Read error: Operation timed out) [18:35] *** beardicus has quit IRC (Read error: Operation timed out) [18:35] *** JesseW has quit IRC (Quit: Leaving.) [18:37] *** kvieta has quit IRC (Ping timeout: 633 seconds) [18:39] *** closure has quit IRC (Read error: Operation timed out) [18:47] *** ndizzle has quit IRC (Read error: Operation timed out) [19:00] *** botpie91 has joined #archiveteam-bs [19:00] *** beardicus has joined #archiveteam-bs [19:01] *** closure has joined #archiveteam-bs [19:01] *** midas sets mode: +o closure [19:01] *** kvieta has joined #archiveteam-bs [19:29] *** beardicus has quit IRC (Read error: Operation timed out) [19:29] *** kvieta has quit IRC (Read error: Operation timed out) [19:32] *** botpie91 has quit IRC (Read error: Operation timed out) [19:34] *** closure has quit IRC (Ping timeout: 633 seconds) [19:35] *** bzc6p has quit IRC (Ping timeout: 260 seconds) [19:44] *** jut has quit IRC (Read error: Connection reset by peer) [19:51] *** bzc6p has joined #archiveteam-bs [19:51] *** swebb sets mode: +o bzc6p [19:55] *** kvieta has joined #archiveteam-bs [19:55] *** beardicus has joined #archiveteam-bs [19:56] *** botpie91 has joined #archiveteam-bs [19:58] *** JesseW has joined #archiveteam-bs [19:59] *** closure has joined #archiveteam-bs [19:59] *** midas sets mode: +o closure [20:03] *** VADemon has quit IRC (Ping timeout: 258 seconds) [20:16] *** VADemon has joined #archiveteam-bs [20:35] *** Boppen has joined #archiveteam-bs [20:39] *** metalcamp has joined #archiveteam-bs [21:19] starting to do a grab of radio4all.com [21:19] going thur the first 10000 programs first [21:31] *** bzc6p has quit IRC (Ping timeout: 250 seconds) [21:38] *** schbirid has quit IRC (Quit: Leaving) [21:43] *** bzc6p has joined #archiveteam-bs [21:43] *** swebb sets mode: +o bzc6p [21:47] *** JesseW has quit IRC (Quit: Leaving.) [22:17] *** metalcamp has quit IRC (Ping timeout: 258 seconds) [22:23] *** Boppen has quit IRC (hub.se irc.du.se) [22:39] SimpBrain: where can I find your livejournal uploads? [23:34] *** bwn has joined #archiveteam-bs [23:38] *** xXx_ndidd has joined #archiveteam-bs [23:51] *** ndiddy has quit IRC (Read error: Operation timed out)