[00:01] *** RichardG_ has joined #archiveteam-bs [00:02] *** RichardG has quit IRC (Ping timeout: 250 seconds) [00:52] *** kristian_ has quit IRC (Quit: Leaving) [01:31] *** RichardG_ has quit IRC (west.us.hub irc.mzima.net) [01:31] *** Start has quit IRC (west.us.hub irc.mzima.net) [01:31] *** SmileyG has quit IRC (west.us.hub irc.mzima.net) [01:31] *** signius has quit IRC (west.us.hub irc.mzima.net) [01:31] *** Jordan has quit IRC (west.us.hub irc.mzima.net) [01:31] *** tapedrive has quit IRC (west.us.hub irc.mzima.net) [01:39] *** RichardG_ has joined #archiveteam-bs [01:39] *** Start has joined #archiveteam-bs [01:39] *** SmileyG has joined #archiveteam-bs [01:39] *** signius has joined #archiveteam-bs [01:39] *** Jordan has joined #archiveteam-bs [01:39] *** tapedrive has joined #archiveteam-bs [02:23] *** RichardG_ is now known as RichardG [02:35] *** vitzli has joined #archiveteam-bs [02:52] *** VADemon has quit IRC (Read error: Operation timed out) [03:00] *** TheKiwi has joined #archiveteam-bs [03:30] *** ravetcofx has joined #archiveteam-bs [03:53] *** jrwr has quit IRC (Remote host closed the connection) [04:08] *** Igloo^_^ has quit IRC (Ping timeout: 250 seconds) [04:14] *** Igloo^_^ has joined #archiveteam-bs [04:27] *** vitzli has quit IRC (Quit: Leaving) [05:24] *** Aranje has joined #archiveteam-bs [05:50] *** Sk1d has quit IRC (Ping timeout: 250 seconds) [05:57] *** Sk1d has joined #archiveteam-bs [06:18] *** Start has quit IRC (Remote host closed the connection) [06:22] *** Start has joined #archiveteam-bs [06:23] *** jrwr has joined #archiveteam-bs [06:54] *** Start has quit IRC (Quit: Disconnected.) [07:16] *** Aranje has quit IRC (Quit: Three sheets to the wind) [08:21] *** tfgbd_znc has quit IRC (Read error: Connection reset by peer) [08:44] *** BlueMaxim has quit IRC (Quit: Leaving) [08:46] *** ravetcofx has quit IRC (Read error: Operation timed out) [10:43] *** GE has joined #archiveteam-bs [10:57] *** Stilett0 has joined #archiveteam-bs [11:01] *** Stiletto has quit IRC (Ping timeout: 376 seconds) [11:43] *** GE has quit IRC (Quit: zzz) [12:21] *** Yoshimura has joined #archiveteam-bs [13:45] *** krazedkat has joined #archiveteam-bs [13:54] *** GE has joined #archiveteam-bs [14:03] *** VADemon has joined #archiveteam-bs [14:17] *** VADemon has quit IRC (Read error: Operation timed out) [14:30] *** sep332_ has joined #archiveteam-bs [15:02] *** Start has joined #archiveteam-bs [15:49] *** Start has quit IRC (Quit: Disconnected.) [15:51] *** RichardG_ has joined #archiveteam-bs [15:52] *** RichardG has quit IRC (Read error: Operation timed out) [15:54] *** RichardG_ is now known as RichardG [16:17] *** Aranje has joined #archiveteam-bs [18:11] *** ravetcofx has joined #archiveteam-bs [18:13] *** Aranje has quit IRC (Ping timeout: 260 seconds) [18:35] *** VADemon has joined #archiveteam-bs [19:13] *** RichardG has quit IRC (Quit: Keyboard not found, press F1 to continue) [20:22] *** ndiddy has joined #archiveteam-bs [20:41] is this in IA yet? http://www.collection.archivist.info/ [20:47] *** Yoshimura has quit IRC (Read error: Operation timed out) [21:32] *** kristian_ has joined #archiveteam-bs [22:00] *** ZizzyDizz has joined #archiveteam-bs [22:00] Oever here now. [22:01] ZizzyDizz: quick glance at weathermap suggests that archive.org itself has a 50gbps link to ~the internet~ [22:01] Ayy [22:01] http://theponyarchive.com/ [22:01] whee, I'm 2% of IA's max outgoing [22:01] my site [22:01] I don't know how much the rsync targets can handle though, for archiveteam projects [22:02] My main archive only has 100mbps [22:02] For video streaming we drop active files onto a secondary domain and then use the 1gbps connection [22:02] any given rsync target can probably take in between 100mbps and 10gbps depending on where it's hosted... but for the purpose of uploading your entire archive to IA you'd probably best upload to archive.org directly [22:02] paging SketchCow [22:03] can you upload to archive.org directly with rsync? I didn't know that you could. [22:03] ZizzyDizz: no, not with rsync [22:03] ZizzyDizz: the rsync bit refers to contributing to archiveteam projects [22:03] :p [22:03] since those projects use rsync for uploading data before it gets packed up and shipped off to IA [22:04] my web server is only an atom and uhh, it takes a hit when scrapers are used, it's been being scraped by google and other search engines for about 8 months now [22:04] Had to actually block google and stuff from it due to slow downs [22:04] i thought you had a massive cdn [22:04] The main site is 100mbps, bounce that over to 1gbps and if needed it goes over to a 10gbps network or google drive [22:04] this is a very confusing conversation [22:05] how does one accumulate 12tb of ponies [22:05] I got tired of seeing random videos and channels disappear [22:05] so I grabbed everything [22:06] I know that feeling [22:06] I was searching for more archives when I found archiveteam [22:06] *** dashcloud has quit IRC (Quit: No Ping reply in 180 seconds.) [22:06] I now grab things other than ponies if I feel that their channel may be at risk [22:07] Youtube Heroes is putting my own team on edge [22:07] Considering youtube follows their own rules and not those of pure fair use. [22:07] *** dashcloud has joined #archiveteam-bs [22:14] joepie91: is there a version of warrior that works from linux shell? [22:15] the 'warrior' is purely the VM that automates a few things to make life easy for people [22:15] otherwise, you can run project scripts manually [22:15] depending on the project, everything is usually on https://github.com/archiveteam [22:16] Kaz is correct, and if you don't like vm, there is a docker container too [22:18] Will check it out thanks. [22:19] That's a lot of scripts. [22:19] I'll wait and see which one I should run. [22:24] *** jrwr has quit IRC (Leaving) [22:44] current live Projects: livejournal-discovery. Low bandwith, max 2 instances per IP. yahoo answers. Medium bandwith, max 4 instances per IP. lower recommended. urlteam. Permanent Project. low bandwith, high concurrent possible. newsbuddy. Very high bandwith, medium manual work for script-updates, not a tracker project. Very CPU and bw intensive. IA. [22:44] BAK. Backing up Internet Archive. low to mid bandwith, high permanent storage. [22:57] I'd go with backing up IA but I need to get new drives in a proper array. [22:58] I'll try to set up the yahoo answers as a single thread later to help out. [23:09] *** BlueMaxim has joined #archiveteam-bs [23:13] *** Start has joined #archiveteam-bs [23:23] *** GE has quit IRC (Quit: zzz) [23:46] *** godane has quit IRC (Leaving.) [23:48] chfoo: any chance you're around? I was trying to run wpull through a socks proxy but can't figure it out