[00:00] *** SchroSct has joined #archiveteam [00:00] Nyany: sup [00:16] *** _sagitair has quit IRC (Read error: Operation timed out) [00:23] *** Dispenser has joined #archiveteam [00:25] *** odemg has joined #archiveteam [00:28] *** GreenObse has quit IRC (Read error: Operation timed out) [00:29] *** yan has joined #archiveteam [00:32] .help [00:36] Hello, there's interest to get CC-BY-SA ipernity photos on Wikimedia Commons https://commons.wikimedia.org/wiki/Commons:Bots/Work_requests#Upload_of_Ipernity_files [00:38] Dispenser, #ipfinite [00:44] *** raiden_0x has joined #archiveteam [00:48] *** Kelt has quit IRC (Ping timeout: 250 seconds) [01:06] *** odemg has quit IRC (Remote host closed the connection) [01:12] *** MarcTheYo has joined #archiveteam [01:26] *** User405 has joined #archiveteam [01:26] *** User404 has quit IRC (Read error: Connection reset by peer) [01:29] *** vitzli has joined #archiveteam [01:32] *** odemg has joined #archiveteam [01:35] *** icedice has quit IRC (Quit: Leaving) [01:39] *** odemg has quit IRC (Remote host closed the connection) [01:59] what's better, 1 concurrent or 6? need to settle a bet [02:00] ...Under what circumstances? [02:01] Presumably 6 concurrent would make your download fastest. [02:01] But it's easy to overload a host that way, or get your IP banned. [02:03] currently with the IMDB [02:03] *** kris33 has joined #archiveteam [02:16] *** odemg has joined #archiveteam [02:27] *** Nyany has quit IRC (Quit: Page closed) [02:39] *** aschmitz has joined #archiveteam [02:45] SchroSct: Use 1 or 2 concurrent since IMDB rate limits by IP [02:46] *** Nyany has joined #archiveteam [03:01] *** Ravenloft has joined #archiveteam [03:18] *** Nyany has quit IRC (Quit: Page closed) [03:28] *** ndiddy has quit IRC (Read error: Connection reset by peer) [03:48] *** Coderjoe is now known as Coderjo [03:51] *** pizzaiolo has quit IRC (Remote host closed the connection) [03:58] *** odemg has quit IRC (Ping timeout: 260 seconds) [04:07] *** Ravenloft has quit IRC (Ping timeout: 633 seconds) [04:37] *** _sagitair has joined #archiveteam [04:39] *** raiden_0x has quit IRC (Read error: Connection refused) [04:40] *** maelstrom has quit IRC (Ping timeout: 250 seconds) [04:43] *** maelstrom has joined #archiveteam [05:00] *** odemg has joined #archiveteam [05:02] *** deetwelve has quit IRC (Ping timeout: 260 seconds) [05:08] *** odemg2 has joined #archiveteam [05:15] *** zyphlar has joined #archiveteam [05:18] *** odemg has quit IRC (Read error: Operation timed out) [05:29] *** Sk1d has quit IRC (Ping timeout: 194 seconds) [05:31] *** maelstrom has quit IRC (Quit: Leaving) [05:38] *** Sk1d has joined #archiveteam [05:38] *** Sk1d has quit IRC (Connection Closed) [05:42] *** GreenObse has joined #archiveteam [05:48] *** odemg2 has quit IRC (Read error: Operation timed out) [06:20] is there anything in particular causing archive.org / videobot / etc to not archive facebook videos / streams correctly? [06:21] it seems like most breaking news is happening there and our archives of it are just abysmal [06:36] *** odemg2 has joined #archiveteam [06:45] *** GreenObse has quit IRC (Read error: Operation timed out) [07:02] *** Honno has joined #archiveteam [07:07] *** GreenObse has joined #archiveteam [07:08] *** MMovie has quit IRC (Read error: Operation timed out) [07:09] *** MMovie has joined #archiveteam [07:15] *** odemg2 has quit IRC (Read error: Operation timed out) [07:18] I find it interesting how the IMDB grab is very evenly distributed in terms of contributors. [07:18] But the ftp-gov grab is definitely operating on a power rule. [07:19] *** odemg2 has joined #archiveteam [07:22] That's what happens when strict limits are in place with limited resources. [07:22] *** GreenObse has quit IRC (Read error: Operation timed out) [07:24] Ah, so the tracker is doing it on purpose to maximally distribute the number of IP's grabbing? [07:25] Also who is HCross and where do they get 40 terabytes of bandwidth from? O_o [07:25] *** schbirid has joined #archiveteam [07:26] namespace: also, the ftp-gov grab doesn't work with the virtual machine warrior, which is easier to install [07:26] Yeah. And it doesn't work on any of my VPS's. [07:26] so only particularly devoted people will be able to work on it, as opposed to the wider base of people who can contribute to the imdb grab [07:27] Otherwise I'd put in somewhere like 2tb. [07:27] (Well, as much as I could saturate my connection with really.) [07:28] Right now me and my friends contribute to ftp-gov as TeamAndrew and we only managed 330gb, which is a lot less than I wanted. [07:28] *** GreenObse has joined #archiveteam [07:28] * namespace was aiming for 1tb, at least [07:29] imdb is limited, they ratelimit you even on low concurrency [07:33] Yeah. Though the good news is that at a rate of 470~ grabs a minute we should get the whole thing with a slight margin for retries on broken ones or whatever. [07:33] (The tracker should really include grabs per minute or second as part of the metrics.) [07:34] Or is not including them an intentional design choice? [07:38] *** odemg2 has quit IRC (Read error: Operation timed out) [07:39] *** odemg2 has joined #archiveteam [07:42] *** GreenObse has quit IRC (Read error: Operation timed out) [07:51] *** GreenObse has joined #archiveteam [07:59] *** odemg2 has quit IRC (Read error: Operation timed out) [08:03] *** Kelt has joined #archiveteam [08:06] *** GreenObse has quit IRC (Read error: Operation timed out) [08:19] *** GreenObse has joined #archiveteam [08:32] *** MarcTheYo has quit IRC (Ping timeout: 190 seconds) [08:40] *** MMovie2 has joined #archiveteam [08:41] *** MMovie has quit IRC (Read error: Operation timed out) [09:44] *** VADemon has joined #archiveteam [09:45] *** schbirid2 has joined #archiveteam [09:46] *** atomotic has joined #archiveteam [09:46] *** schbirid has quit IRC (Read error: Operation timed out) [09:59] *** schbirid2 has quit IRC (Excess Flood) [10:00] *** schbirid2 has joined #archiveteam [10:02] *** BlueMaxim has quit IRC (Read error: Operation timed out) [10:02] *** BlueMaxim has joined #archiveteam [10:39] *** username1 has joined #archiveteam [10:42] *** zyphlar has quit IRC (Quit: Connection closed for inactivity) [10:42] *** schbirid2 has quit IRC (Read error: Operation timed out) [10:45] *** icedice has joined #archiveteam [10:49] Is anyone here good with MySQL? [11:03] *** Sveklan has joined #archiveteam [11:03] *** Svekla has quit IRC (Read error: Connection reset by peer) [11:20] 0 and interactive_timeout = 0 should have MySQL wait forever and never time out, right? [11:41] looks like 1 is the minimum limit [11:41] I'll set it to 99999 then [12:25] *** icedice has quit IRC (http://www.kiwiirc.com/ - A hand crafted IRC client) [12:31] *** BlueMaxim has quit IRC (Leaving) [12:33] *** pizzaiolo has joined #archiveteam [12:40] *** GreenObse has quit IRC (Remote host closed the connection) [12:54] *** MarcTheYo has joined #archiveteam [13:21] *** odemg has joined #archiveteam [13:23] *** schbirid2 has joined #archiveteam [13:25] *** username1 has quit IRC (Read error: Operation timed out) [13:27] *** odemg has quit IRC (Remote host closed the connection) [13:30] *** atomotic has quit IRC (Quit: My MacBook has gone to sleep. ZZZzzz…) [13:34] *** atomotic has joined #archiveteam [13:34] *** i336_ has joined #archiveteam [13:48] *** odemg has joined #archiveteam [13:56] *** Honno has quit IRC (Ping timeout: 506 seconds) [14:11] *** kris33 has quit IRC (Textual IRC Client: www.textualapp.com) [14:19] *** i336_ has quit IRC (Ping timeout: 260 seconds) [14:31] *** odemg has quit IRC (Read error: Operation timed out) [14:41] *** User405 has quit IRC (Remote host closed the connection) [14:41] *** User405 has joined #archiveteam [14:42] *** odemg has joined #archiveteam [14:49] *** Nyany has joined #archiveteam [14:50] *** atomotic has quit IRC (Quit: My MacBook has gone to sleep. ZZZzzz…) [15:07] *** Honno has joined #archiveteam [15:38] *** atomotic has joined #archiveteam [15:42] *** kris33 has joined #archiveteam [15:52] *** odemg has quit IRC (Remote host closed the connection) [15:55] *** odemg has joined #archiveteam [15:57] *** maelstrom has joined #archiveteam [16:00] *** icedice has joined #archiveteam [16:09] *** mona has quit IRC (Ping timeout: 260 seconds) [16:11] *** mona has joined #archiveteam [16:14] *** vitzli has quit IRC (Leaving) [16:28] *** odemg has quit IRC (Remote host closed the connection) [16:28] *** odemg has joined #archiveteam [16:45] *** Kelt has quit IRC (Ping timeout: 250 seconds) [16:50] *** signius has quit IRC (Quit: Leaving) [17:05] *** fie_ has quit IRC (Ping timeout: 506 seconds) [17:09] http://de.engadget.com/2017/02/07/imdb-stellt-foren-ein-archiveteam-springt-ein/ [17:13] *** fie_ has joined #archiveteam [17:18] *** icedice has quit IRC (Quit: Leaving) [17:19] Our wiki article is linked on the page. I'm going to update it [17:19] kk. Probably useful to include warrior information. [17:21] *** odemg has quit IRC (Remote host closed the connection) [17:22] *** odemg has joined #archiveteam [17:24] Hi! Can AT archive Kenai.com? "The Kenai.com site will be closing permanently on April 28, 2017.", it's a collaborative hosting site for free and open source projects, launched by Sun Microsystems and now owned by Oracle. [17:25] Sveklan, we have an archivebot job going on it. http://dashboard.at.ninjawedding.org/3?showNicks=1 [17:25] Filter: b71yhng51njwlys4k7kso6m25 [17:26] rocode, thank you [17:29] Is anyone backing up the repositories, rocode ? [17:33] PurpleSym, looks like we are getting the files: https://kenai.com/projects/request/sources/source-code-repository/content/build.xml?rev=62 [17:38] *** sagitaire has joined #archiveteam [17:44] *** sagitaire has quit IRC (Read error: Operation timed out) [17:51] *** atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) [18:25] *** VADemon has quit IRC (Ping timeout: 255 seconds) [18:27] *** VADemon has joined #archiveteam [19:06] *** jrwr has quit IRC (Read error: Connection reset by peer) [19:10] *** VADemon has quit IRC (Quit: left4dead) [19:10] *** _sagitair is now known as raiden_0x [19:13] *** icedice has joined #archiveteam [19:14] Is there some important reason why wget-lua is distributed as source? [19:15] Ask in #archiveteam-bs [19:16] Blah, that's where I thought I clicked. [19:17] *** VADemon has joined #archiveteam [19:19] *** atomotic has joined #archiveteam [19:24] *** Nyany has quit IRC (Ping timeout: 268 seconds) [19:35] *** Stiletto has quit IRC () [19:49] not related to IMDB, but I recently read that MIT Video will be discontinued as of February 28th (http://video.mit.edu/). They don't host the videos, but links with metadata (tags, etc.). Would it be worth starting a project for this? [19:49] *** marvinw is now known as ivan [19:52] *** Stil3tt0 has joined #archiveteam [19:53] *** MIT has joined #archiveteam [19:53] *** Honno has quit IRC (Ping timeout: 506 seconds) [19:58] Jeeez, by my rough estimate it was going to take all nine days to grab IMDB, but assuming the current batch is the whole thing, we grabbed almost the whole site in one night. XD [20:14] *** SmileyG has quit IRC (http://www.milkme.co.uk - You'll never understand.) [20:17] *** odemg has quit IRC (Remote host closed the connection) [20:17] *** odemg has joined #archiveteam [20:27] yeah, is going fast now [20:58] *** pizzaiolo has quit IRC (Read error: Connection reset by peer) [20:59] *** pizzaiolo has joined #archiveteam [21:26] *** atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) [21:30] *** schbirid2 has quit IRC (Quit: Leaving) [21:36] *** deetwelve has joined #archiveteam [21:38] *** ndiddy has joined #archiveteam [21:39] *** wp494 has quit IRC (Ping timeout: 260 seconds) [21:44] *** balrog has quit IRC (Read error: Operation timed out) [21:44] *** Mayonaise has quit IRC (Read error: Operation timed out) [21:44] *** yakfish has quit IRC (Read error: Operation timed out) [21:45] *** yakfish has joined #archiveteam [21:45] *** btfo has quit IRC (Read error: Operation timed out) [21:45] *** ivan has quit IRC (Read error: Operation timed out) [21:45] *** marvinw has joined #archiveteam [21:45] *** sep332 has quit IRC (Read error: Operation timed out) [21:45] *** Zialus has quit IRC (Read error: Operation timed out) [21:45] *** MMovie2 has quit IRC (Read error: Operation timed out) [21:45] *** aschmitz has quit IRC (Read error: Operation timed out) [21:46] *** REiN^ has quit IRC (Read error: Operation timed out) [21:46] *** SmileyG has joined #archiveteam [21:46] *** remsen has quit IRC (Ping timeout: 365 seconds) [21:46] *** remsen has joined #archiveteam [21:47] *** beardicus has quit IRC (Read error: Operation timed out) [21:48] *** XPEric has quit IRC (Read error: Operation timed out) [21:49] *** XPEric has joined #archiveteam [21:50] *** Svekla has joined #archiveteam [21:51] *** remsen has quit IRC (Read error: Operation timed out) [21:51] *** TC01 has quit IRC (Read error: Operation timed out) [21:51] *** SadDM has quit IRC (Read error: Operation timed out) [21:51] *** yakfish has quit IRC (Read error: Operation timed out) [21:52] *** jspiros has quit IRC (Read error: Operation timed out) [21:52] *** marvinw has quit IRC (Read error: Operation timed out) [21:52] *** mhazinsk has quit IRC (Ping timeout: 615 seconds) [21:52] *** phuzion has quit IRC (Ping timeout: 615 seconds) [21:53] *** Sveklan has quit IRC (Ping timeout: 246 seconds) [21:53] *** TC01 has joined #archiveteam [21:54] *** aMunster has quit IRC (Write error: Broken pipe) [21:56] *** aMunster has joined #archiveteam [21:57] *** balrog has joined #archiveteam [21:57] *** Smiley has quit IRC (Write error: Broken pipe) [21:58] *** sep332 has joined #archiveteam [21:58] *** beardicus has joined #archiveteam [21:58] *** aschmitz has joined #archiveteam [22:00] *** BiggieJon has joined #archiveteam [22:00] *** remsen has joined #archiveteam [22:01] *** MMovie has joined #archiveteam [22:15] *** marvinw has joined #archiveteam [22:15] *** yakfish has joined #archiveteam [22:15] *** Zialus has joined #archiveteam [22:17] *** phuzion has joined #archiveteam [22:21] *** btfo has joined #archiveteam [22:21] *** BlueMaxim has joined #archiveteam [22:26] *** mhazinsk has joined #archiveteam [22:28] *** odemg has quit IRC (Remote host closed the connection) [22:34] *** balrog has quit IRC (Read error: Operation timed out) [22:45] *** _sagitair has joined #archiveteam [22:45] *** Honno has joined #archiveteam [22:47] *** Kelt has joined #archiveteam [22:48] *** odemg has joined #archiveteam [22:49] *** VADemon has quit IRC (Quit: left4dead) [22:51] *** balrog has joined #archiveteam [22:52] *** SadDM has joined #archiveteam [22:55] *** Honno has quit IRC (Ping timeout: 600 seconds) [22:56] *** jspiros has joined #archiveteam [22:57] *** wp494 has joined #archiveteam [22:57] *** raiden_0x has quit IRC (Read error: Operation timed out) [23:05] *** Mayonaise has joined #archiveteam [23:19] *** RichardG_ has joined #archiveteam [23:20] *** MIT has quit IRC (Ping timeout: 271 seconds) [23:22] *** RichardG has quit IRC (Read error: Operation timed out) [23:34] *** RichardG_ is now known as RichardG [23:34] *** Nyany has joined #archiveteam [23:42] boom. https://twitter.com/RoguePOTUSStaff/ is dead. http://www.zdnet.com/article/white-house-chief-information-security-officer-departs/ may be related. [23:46] sad [23:52] *** i336_ has joined #archiveteam [23:54] *** Nyany has quit IRC (Ping timeout: 268 seconds) [23:54] *** compu_85 has joined #archiveteam [23:57] *** Nyany has joined #archiveteam [23:58] *** icedice has quit IRC (Quit: Leaving)