[00:10] *** nox_ is now known as nox [00:23] *** wacky has quit IRC (Read error: Connection reset by peer) [00:38] ok, I've written up what I meant. Hopefully it makes more sense now. [00:38] PR: https://github.com/ArchiveTeam/NewsGrabber/pull/25 [00:42] *** Ravenloft has joined #archiveteam [00:45] *** Start has quit IRC (Read error: Connection reset by peer) [00:45] *** Start has joined #archiveteam [01:07] *** Elegance has quit IRC (Connection closed) [01:07] *** Elegance has joined #archiveteam [01:10] *** Emcy_ has joined #archiveteam [01:12] *** Emcy has quit IRC (Ping timeout: 250 seconds) [01:21] *** BlueMaxim has joined #archiveteam [01:56] arkiver: what's the status of the ftp warrior project? It isn't mentioned on http://archiveteam.org/index.php?title=FTP ... [01:57] specifically, has ftp.netbsd.org been grabbed? [02:13] *** wp494_ is now known as wp494 [02:18] *** philpem has quit IRC (Ping timeout: 260 seconds) [02:24] *** acridAxid has quit IRC (Ping timeout: 260 seconds) [03:07] *** nertzy2 has joined #archiveteam [03:18] *** ploopkazo has quit IRC (Read error: Operation timed out) [03:19] *** ploopkazo has joined #archiveteam [03:50] *** nertzy2 has quit IRC (Quit: This computer has gone to sleep) [04:07] *** Elegance has quit IRC (Read error: Operation timed out) [04:11] *** Elegance has joined #archiveteam [04:43] *** Ghost_of_ has joined #archiveteam [04:57] *** VADemon_ has quit IRC (left4dead) [06:01] *** zhongfu has joined #archiveteam [06:07] *** acridAxid has joined #archiveteam [06:55] *** Elegance has quit IRC (Read error: Connection reset by peer) [06:59] *** Elegance has joined #archiveteam [07:08] *** GLaDOS has quit IRC (Ping timeout: 260 seconds) [07:09] *** GLaDOS has joined #archiveteam [07:13] *** FAMAS has joined #archiveteam [07:13] this user wishes to know regarding the differences between the warrior system and the archivebot system [07:16] mainly scale, we set up a warrior project when there are millions of accounts to be archived on a service [07:17] archivebot runs on a couple of dedicated VPSes whereas the warrior is 100+ volunteers running the virtual machine image [07:18] this user is requesting to channel operators that voice mode be given and setup for retainment on subsequent logins in channel #archivebot for purposes of site archival [07:29] *** FAMAS has quit IRC (Ping timeout: 240 seconds) [07:35] *** JesseW has quit IRC (Leaving.) [07:46] this user might want to use something other than webchat [07:59] This user is weird [08:03] this user laughs a bit [08:08] Right now, I'm watching way too much Star Wars Clone Wars and scanning CD-ROMs like crazy [08:08] Well, ISO happened, this is flat-out scanning. [08:21] *** schbirid has joined #archiveteam [08:45] *** atomotic has joined #archiveteam [08:46] *** JesseW has joined #archiveteam [09:11] *** Sketchcow has quit IRC (Read error: Connection reset by peer) [09:11] *** Sketchcow has joined #archiveteam [09:11] *** swebb sets mode: +o Sketchcow [09:12] *** ohhdemgir has quit IRC (Read error: Operation timed out) [09:29] *** Start has quit IRC (Ping timeout: 311 seconds) [09:36] *** Start has joined #archiveteam [09:58] *** JesseW has quit IRC (Leaving.) [10:00] *** atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) [10:07] *** Start has quit IRC (Ping timeout: 311 seconds) [10:26] *** Start has joined #archiveteam [11:02] *** atomotic has joined #archiveteam [11:07] *** Start has quit IRC (Excess Flood) [11:08] *** Start has joined #archiveteam [11:34] *** atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) [11:46] *** Ghost_of_ has quit IRC (Quit: Leaving) [12:00] *** lytv has quit IRC (Ping timeout: 252 seconds) [12:16] *** nertzy2 has joined #archiveteam [12:17] *** GLaDOS has quit IRC (Read error: Operation timed out) [12:18] users should stay connected until people can add them to the autovoice list. [12:19] *** GLaDOS has joined #archiveteam [12:25] *** lytv has joined #archiveteam [12:26] *** nertzy2 has quit IRC (Quit: This computer has gone to sleep) [12:49] *** GLaDOS has quit IRC (Ping timeout: 260 seconds) [12:53] *** Start has quit IRC (Ping timeout: 311 seconds) [13:29] *** ohhdemgir has joined #archiveteam [13:33] *** atomotic has joined #archiveteam [13:46] *** slyphic|a is now known as slyphic [13:49] *** Start has joined #archiveteam [14:06] *** WinterFox has quit IRC (Remote host closed the connection) [14:15] *** atomotic has quit IRC (Quit: My Mac has gone to sleep. ZZZzzz…) [14:25] *** yipdw has quit IRC (Read error: Operation timed out) [14:25] *** yipdw has joined #archiveteam [14:26] *** Kenshin has quit IRC (Read error: Operation timed out) [14:27] *** cadbury has quit IRC (Read error: Operation timed out) [14:28] *** Kenshin has joined #archiveteam [14:28] *** [phire] has quit IRC (Read error: Operation timed out) [14:28] *** wutno has quit IRC (Read error: Operation timed out) [14:30] *** Ghost_of_ has joined #archiveteam [14:30] *** brayden has quit IRC (Read error: Operation timed out) [14:31] *** nertzy has joined #archiveteam [14:35] *** Lord_Nigh has quit IRC (Ping timeout: 606 seconds) [14:35] *** Lord_Nigh has joined #archiveteam [14:35] *** Famicoman has quit IRC (Read error: Operation timed out) [14:37] *** Start_ has joined #archiveteam [14:38] *** Start has quit IRC (Read error: Connection reset by peer) [14:38] *** dan-- has quit IRC (Ping timeout: 606 seconds) [14:38] *** gibigian1 has joined #archiveteam [14:41] *** godane has quit IRC (Ping timeout: 606 seconds) [14:41] *** godane has joined #archiveteam [14:42] *** gibigiana has quit IRC (Read error: Connection reset by peer) [14:43] *** atomotic has joined #archiveteam [14:44] *** ivan` has quit IRC (Ping timeout: 606 seconds) [14:44] *** Start_ has quit IRC (Quit: Disconnected.) [14:46] *** nertzy has quit IRC (Quit: This computer has gone to sleep) [14:48] *** marvinw has joined #archiveteam [14:50] *** wyatt8740 has quit IRC (Read error: Operation timed out) [14:51] *** Famicoman has joined #archiveteam [14:51] *** [phire] has joined #archiveteam [14:52] *** BlueMaxim has quit IRC (Quit: Leaving) [14:52] *** wyatt8740 has joined #archiveteam [14:53] *** nertzy has joined #archiveteam [14:54] *** superkuh_ has quit IRC (Ping timeout: 606 seconds) [14:55] *** superkuh_ has joined #archiveteam [14:59] *** SilSte has joined #archiveteam [15:03] *** afics has quit IRC (Ping timeout: 606 seconds) [15:05] *** nertzy has quit IRC (Quit: This computer has gone to sleep) [15:06] *** megaminxw has quit IRC (Quit: Leaving.) [15:09] *** cadbury has joined #archiveteam [15:09] *** afics has joined #archiveteam [15:09] *** afics has quit IRC (Excess Flood) [15:09] *** afics has joined #archiveteam [15:10] *** wyatt8740 has quit IRC (Read error: Operation timed out) [15:16] *** wyatt8740 has joined #archiveteam [15:27] *** dan- has joined #archiveteam [15:51] *** nertzy has joined #archiveteam [16:00] *** Start has joined #archiveteam [16:35] *** nertzy has quit IRC (Quit: This computer has gone to sleep) [16:52] *** atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) [17:06] *** Start has quit IRC (Quit: Disconnected.) [17:06] *** JesseW has joined #archiveteam [17:13] *** FAMAS has joined #archiveteam [17:14] *** Start has joined #archiveteam [17:16] *** FAMAS has quit IRC (Client Quit) [17:31] *** JesseW has quit IRC (Leaving.) [17:41] *** Ghost_of_ has quit IRC (Quit: Leaving) [17:41] *** philpem has joined #archiveteam [17:59] *** putt has joined #archiveteam [18:17] *** marvinw is now known as ivan` [18:28] *** putt has quit IRC (Quit: Page closed) [18:30] *** acridAxid has quit IRC (Read error: Operation timed out) [18:38] *** nertzy has joined #archiveteam [18:38] http://www.gwern.net/Archiving URLs <- should be linked from the wiki somewhere [18:38] *** Start has quit IRC (Quit: Disconnected.) [18:48] *** wutno has joined #archiveteam [18:55] Then add it on the wiki o_o [18:56] Not sure where it'd fit though. Maybe under "Recommended reading" [19:11] I should have mentioned my comment was a "Note to self". I don't have access to my wiki account while I'm at work. [19:14] *** nertzy has quit IRC (Quit: This computer has gone to sleep) [19:20] *** Akaibu has joined #archiveteam [19:38] *** pikhq has quit IRC (Read error: Connection reset by peer) [19:41] *** Start has joined #archiveteam [19:43] *** pikhq has joined #archiveteam [19:47] *** Morbus has joined #archiveteam [19:52] *** mafrasi2_ has joined #archiveteam [19:52] *** mafrasi2 has quit IRC (Read error: Connection reset by peer) [19:55] *** scyther has joined #archiveteam [19:58] *** mafrasi2 has joined #archiveteam [19:58] *** mafrasi2_ has quit IRC (Read error: Connection reset by peer) [19:59] ivan`: ping? [20:00] Have you heard from tree33 about that trucker's youtube channel we were grabbing? [20:00] I'm sitting in 193GB of video here at work that I'd like to get rid of sooner than later. [20:05] *** brayden has joined #archiveteam [20:05] *** swebb sets mode: +o brayden [20:16] phuzion: I can take it off your hands [20:16] I only need the ones that aren't on this list https://gist.githubusercontent.com/ivan/c963e2b238a1891e4e34/raw/7d95859dd2cc8970d996d09bbbe5cf865d9face9/gistfile1.txt [20:16] ivan`: Here's my list of files on my NAS at work https://clbin.com/RjV7S [20:17] phuzion: are the .part files complete? [20:18] Actually, those don't even exist anymore. Let me give you a new listing [20:19] Here's a new list for ya [20:19] https://clbin.com/7G2Ly [20:21] ok sec [20:23] FYI: http://techcrunch.com/2016/01/04/yahoo-shuts-down-yahoo-screen-its-home-for-original-content/ [20:24] phuzion: these are the only ones I need, then: cp -al *sPiN0NSAa1E* *0WMFHn_QB_w* *YwPPUJrgvGU* *d8KSi-ZhAxY* *KzbUfyqh044* *KzKpfRsRv3w* *e5E6W-GdwT0* *ROUEb0Qg1mc* *tnmNimjUjio* *b71Yol2hc_w* *q8x6-fnsiGE* *upY9gNKbFCE* *qfhTfLqyEfE* *wlWwtYWJlcg* *5i9aXZrkbws* ~/dest [20:24] phuzion: can you rsync them over? [20:24] or should I grab them from somewhere [20:25] ivan`: got a box I can rsync these to? [20:25] sec [20:26] *** nertzy has joined #archiveteam [20:27] ivan`: Also, check these, if there's anything there, I have those sets of files at home. [20:27] https://clbin.com/3dPiW https://clbin.com/1tabk [20:35] phuzion: EDWKZukmSCQ from https://clbin.com/3dPiW is unique [20:35] Alright, I'll rsync it over when I get home. [20:40] ivan`: can you check that 10 Good morning Annawan Illinois-ROUEb0Qg1mc.mp4 came over ok? [20:40] *** megaminxw has joined #archiveteam [20:40] And 12 Walkabouts-e5E6W-GdwT0.mp4 for that matter too [20:42] *** schbirid has quit IRC (Quit: Leaving) [20:43] phuzion: looks fine to me [20:44] Cool, just wanted to double check. Thanks. [20:45] *** Start has quit IRC (Quit: Disconnected.) [20:45] in the future see the command on http://www.archiveteam.org/index.php?title=YouTube [20:54] *** Ghost_of_ has joined #archiveteam [20:56] *** Start has joined #archiveteam [20:57] *** nertzy has quit IRC (Quit: This computer has gone to sleep) [20:59] ivan`: which command, the first one or kyan's command? [21:01] phuzion: was referring to the first one [21:01] ok cool, thanks [21:10] *** atomotic has joined #archiveteam [21:19] *** nertzy has joined #archiveteam [21:28] *** Morbus has quit IRC (Quit: http://www.disobey.com/) [21:29] *** atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) [21:30] ivan`: You'll have to pardon the wildly varying speeds going out, it seems like either my ISP or someone between me and you is having a fit and dropping speeds to like 10KB/s every so often. [21:37] phuzion: there is no rush [21:38] Yeah, I'm cool with whatever. Doesn't matter to me. If I leave, it'll just keep going until it finishes. [21:40] http://jobs.code4lib.org/job/24676/ webrecorder job [21:40] https://webrecorder.io/static/__shared/jobs/developer.pdf [21:54] *** WinterFox has joined #archiveteam [21:56] *** nertzy has quit IRC (Quit: This computer has gone to sleep) [21:59] *** scyther has quit IRC (Read error: Connection reset by peer) [22:00] iirc --title is deprecated, --download-archive is probably worth adding as well. (records a list of downloaded files that makes resuming a playlist download much faster) [22:01] (I'm not experienced enough with youtube-dl to be comfortable editing the recommended options on the wiki) [22:02] ivan`: Files sent. [22:05] also .. "Since the end of April 2015 and version 2015.04.26 youtube-dl uses -f bestvideo+bestaudio/best as default format selection" [22:09] *** slyphic is now known as slyphic|a [22:10] Yahoo! is shutting down Yahoo Screen, their video streaming service: https://variety.com/2016/digital/news/yahoo-shutters-video-service-yahoo-screen-exclusive-1201671374/ [22:10] Have we archived that yet? [22:10] They've got some original content on their [22:10] *there [22:10] yahoo screen [22:11] * MrRadar slaps forehead [22:11] wait, it's already shut down? [22:12] Weird, it works for me in Firefox but in Chrome it redirects to their homepage [22:12] also, that is a freckin sucky news page [22:13] OK, sometimes it redirects and sometimes it doesn't [22:13] This link works: https://www.yahoo.com/tv/other-space-episode-1-into-the-great-211801697.html [22:13] is that from yahoo screen? [22:14] *** nertzy has joined #archiveteam [22:14] Maybe not... [22:15] OK, according to this article: http://arstechnica.com/business/2016/01/yahoo-yanks-yahoo-screen-hub-scatters-original-content-across-sites/ [22:15] Yahoo shutdown their Screen site [22:16] But put their original content up at this URL: https://www.yahoo.com/tv/tagged/originals [22:16] *** Start has quit IRC (Quit: Disconnected.) [22:16] So at least that content still exists (... for now) [22:19] *** Lord_Nigh has quit IRC (Read error: Operation timed out) [22:26] *** brayden has quit IRC (Quit: Leaving) [22:28] *** Ghost_of_ has quit IRC (Read error: Operation timed out) [22:29] *** Ghost_of_ has joined #archiveteam [22:43] JW_work: I really like the idea of addin the wikidata IDs to newsgrabber [22:44] *** brayden has joined #archiveteam [22:44] *** swebb sets mode: +o brayden [22:50] arkiver: excellent! glad to hear it [23:01] arkiver: if you have some spare time, feel free to merge the PR [23:01] Did you test it? [23:01] (looks fine, but you never know what pops up) [23:02] I have tested that it is picked up and displayed correctly by my services.html page. I have not tested whether it breaks the actual functionality of newsgrabber, as I don't know how to test that without disrupting the existing one. [23:02] ok, then it should be fine [23:02] I'll merge the update soon, but it won't be in newsgrabber yet [23:02] cool — no hurry [23:03] ok [23:03] OldFriends scripts are updated! [23:03] ivan`: sending that last video your way now. Should be there in about 1-2 minutes. [23:03] It'll still be a while to get wikidata entries for all 300+ sources in any case. :-) [23:07] p.s join #newsgrabber etc etc [23:07] JW_work: ^ [23:07] *** Lord_Nigh has joined #archiveteam [23:08] New items added for oldfriends! [23:10] *** Start has joined #archiveteam [23:12] *** Lord_Nigh has quit IRC (Ping timeout: 252 seconds) [23:14] *** RedType has quit IRC (Ping timeout: 258 seconds) [23:39] *** nertzy has quit IRC (Quit: This computer has gone to sleep) [23:42] *** Lord_Nigh has joined #archiveteam [23:50] *** RedType has joined #archiveteam [23:53] SketchCow: have you seen what I wrote about what makes NewsGrabber different from the other projects? [23:53] What do you think? [23:55] I passed it to the TV News guy. [23:55] I just wanted something, since there's always this floating tension when an Archive Team project appears to overlap IA efforts. [23:55] Nothing wrong with that Sketchcow [23:56] SketchCow: I see [23:56] I hope they're fine with the project and see the advanatages too [23:56] advantages*, or what makes it different