[00:07] *** bzc6p_ has joined #archiveteam [00:07] *** swebb sets mode: +o bzc6p_ [00:11] *** bzc6p has quit IRC (Ping timeout: 615 seconds) [00:22] *** philpem has quit IRC (Ping timeout: 252 seconds) [00:34] *** JesseW has joined #archiveteam [00:42] *** JesseW has quit IRC (Read error: Operation timed out) [01:07] *** SiBurning has joined #archiveteam [01:10] Is arkiver around? Think we need to go slow with yuku [01:11] *** Atom__ has joined #archiveteam [01:25] *** JesseW has joined #archiveteam [01:45] *** JesseW has quit IRC (Read error: Operation timed out) [01:50] *** VADemon has quit IRC (left4dead) [02:01] *** primus104 has quit IRC (Leaving.) [02:06] *** godane has quit IRC (Ping timeout: 252 seconds) [02:29] *** zenguy_pc has quit IRC (Read error: Connection reset by peer) [02:32] *** xk_id_ has quit IRC (Remote host closed the connection) [02:36] *** godane has joined #archiveteam [02:40] *** xk_id has joined #archiveteam [02:42] *** MMovie1 has joined #archiveteam [02:42] *** xk_id has quit IRC (Read error: Connection reset by peer) [02:43] *** xk_id has joined #archiveteam [02:44] *** MMovie has quit IRC (Ping timeout: 310 seconds) [02:44] *** zenguy_pc has joined #archiveteam [02:48] *** zenguy_pc has quit IRC (Read error: Connection reset by peer) [03:03] *** xk_id has quit IRC (Remote host closed the connection) [03:04] *** zenguy_pc has joined #archiveteam [03:08] *** xk_id has joined #archiveteam [03:19] *** xk_id has quit IRC (Ping timeout: 615 seconds) [03:38] *** zenguy_pc has quit IRC (Read error: Connection reset by peer) [03:55] *** zenguy_pc has joined #archiveteam [03:56] *** zenguy_pc has quit IRC (Read error: Connection reset by peer) [03:56] *** xk_id has joined #archiveteam [04:13] *** zenguy_pc has joined #archiveteam [04:13] *** wyatt8740 has quit IRC (Remote host closed the connection) [04:15] *** wyatt8740 has joined #archiveteam [04:20] *** aaaaaaaaa has quit IRC (Leaving) [04:22] *** SiBurning is now known as stevieo [04:51] *** stevieo has quit IRC () [04:53] *** JesseW has joined #archiveteam [04:53] *** xk_id has quit IRC (Remote host closed the connection) [04:55] *** JesseW1 has joined #archiveteam [04:59] *** JesseW has quit IRC (Read error: Operation timed out) [05:02] *** zenguy_pc has quit IRC (Read error: Connection reset by peer) [05:11] *** JesseW1 has quit IRC (Read error: Operation timed out) [05:19] *** zenguy_pc has joined #archiveteam [05:22] *** JesseW has joined #archiveteam [05:31] *** xk_id has joined #archiveteam [06:18] *** JesseW has quit IRC (Read error: Operation timed out) [06:20] *** pokeball9 has quit IRC (Quit: Connection closed for inactivity) [06:33] *** bzc6p_ is now known as bzc6p [06:33] *** Dark_Star has quit IRC (Ping timeout: 606 seconds) [07:55] *** atomotic has joined #archiveteam [08:03] *** primus104 has joined #archiveteam [08:19] *** WinterFox has joined #archiveteam [08:33] *** Ungstein1 has joined #archiveteam [08:35] *** Ungstein has quit IRC (Ping timeout: 252 seconds) [08:53] *** primus104 has quit IRC (Leaving.) [09:18] *** pokeball9 has joined #archiveteam [09:21] *** zenguy_pc has quit IRC (Read error: Connection reset by peer) [09:29] *** swebb has quit IRC (Read error: Operation timed out) [09:29] *** atlogbot has quit IRC (Read error: Operation timed out) [09:30] *** dserodio has quit IRC (Read error: Operation timed out) [09:30] *** Laverne has quit IRC (Read error: Operation timed out) [09:31] *** dcmorton has quit IRC (Ping timeout: 369 seconds) [09:35] *** Ymgve__ has joined #archiveteam [09:35] *** Ymgve__ has quit IRC () [09:36] *** BlueMaxim has quit IRC (Read error: Operation timed out) [09:37] *** zenguy_pc has joined #archiveteam [09:37] *** Ymgve__ has joined #archiveteam [09:38] *** zenguy_pc has quit IRC (Read error: Connection reset by peer) [09:38] *** BlueMaxim has joined #archiveteam [09:40] *** BlueMaxim has quit IRC (Read error: Connection reset by peer) [09:41] *** Ymgve has quit IRC (Ping timeout: 506 seconds) [09:43] *** dserodio has joined #archiveteam [09:43] *** ohhdemgir has quit IRC (Read error: Operation timed out) [09:43] *** slyphic has quit IRC (Read error: Operation timed out) [09:43] *** ohhdemgir has joined #archiveteam [09:44] *** atlogbot has joined #archiveteam [09:44] *** Laverne has joined #archiveteam [09:45] *** dcmorton has joined #archiveteam [09:45] *** swebb has joined #archiveteam [09:48] *** slyphic has joined #archiveteam [09:55] *** zenguy_pc has joined #archiveteam [10:06] *** signius has quit IRC (Ping timeout: 310 seconds) [10:18] *** signius has joined #archiveteam [10:24] *** xk_id has quit IRC (Remote host closed the connection) [10:33] *** atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) [10:33] *** Ungstein has joined #archiveteam [10:34] *** Ungstein1 has quit IRC (Ping timeout: 252 seconds) [10:44] *** nertzy has quit IRC (Ping timeout: 252 seconds) [11:09] *** primus104 has joined #archiveteam [11:26] *** atomotic has joined #archiveteam [11:40] *** pokeball9 has quit IRC (Quit: Connection closed for inactivity) [11:42] *** bzc6p_ has joined #archiveteam [11:46] *** pokeball9 has joined #archiveteam [11:48] *** bzc6p has quit IRC (Read error: Operation timed out) [11:57] *** W1nterFox has joined #archiveteam [12:02] *** WinterFox has quit IRC (Read error: Operation timed out) [12:19] *** W1nterFox has quit IRC (Remote host closed the connection) [12:20] *** jspiros has quit IRC (Ping timeout: 186 seconds) [12:20] *** jspiros has joined #archiveteam [12:26] *** atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) [12:34] *** vitzli has joined #archiveteam [12:40] *** hamham has joined #archiveteam [12:40] *** hamham has quit IRC (Client Quit) [12:45] acchan: yw [13:19] *** scyther has joined #archiveteam [13:20] *** jtvjan has joined #archiveteam [13:22] Is there an archive for swfupload? [13:22] Link? [13:22] http://www.swfupload.com/files/41122Baraska.swf [13:24] Not that I can find [13:25] Ok, that's sad [13:25] I'm new here so maybe HCross or MrRadar could help better [13:25] Oh, it's not that bad [13:26] Thanks for trying! [13:26] There are some other swf archives [13:26] I actually only first came here in September for the blip.tv grab [13:26] How would I find them? [13:27] http://swfchan.net is a major one [13:28] I might want to throw that into the archivebot sometime today/tomorrow [13:29] Isn't that a giant site? [13:29] Idk,I would have to go though it [13:30] See about how big,shouldn't be more than 100k [13:31] Ok [13:33] I just asked it for a random guy on reddit who needed that file because it has voiceover of his friend that has passed away [13:33] Rip [13:34] thats archiving for you, shit be sad. seen it before and probably see it again [13:34] I'm not sure if that was supposed to be funny or serious [13:34] +will somewhere [13:34] that's serious. [13:34] I meant to say that to pokeball [13:37] Midasses adress is dat.serveert.me, that means That serves me in Dutch [13:38] *rejoin* [13:39] #archiveteam-bs please [13:39] *** primus104 has quit IRC (Leaving.) [13:39] *** jtvjan_ has joined #archiveteam [13:40] Back [13:40] jtvjan_: that's something for #archiveteam-bs ;) [13:40] Sorry [13:41] See you there [13:41] Ok, got a little script working to generate items [13:41] items for yuku [13:41] I need help getting a list of all forums hosted by yuku [13:42] *** terburg has joined #archiveteam [13:42] *** jtvjan has quit IRC (Ping timeout: 240 seconds) [13:42] *** jtvjan_ is now known as jtvjan [13:44] yuku forums will be split into 10 thread packs [13:47] *** jtvjan has left [13:51] *** philpem has joined #archiveteam [13:53] *** Start has quit IRC (Quit: Disconnected.) [14:02] arkiver: shout, what you need? [14:06] *** VADemon has joined #archiveteam [14:07] arkiver: http://www.yuku.com/popular [14:07] arkiver: this is a good start - votes are reset every day, it seems, so you'd want to scrape it every day [14:07] to get all forums with at least 1 vote [14:07] arkiver: ah, they have a sitemap: http://www.yuku.com/sitemap.xml [14:07] excellent [14:33] locations only from the sitemap if anyone wants it: http://paste.nerds.io/raw/qotusufefe [14:42] *** jspiros has quit IRC (Ping timeout: 186 seconds) [14:52] *** Start has joined #archiveteam [15:03] *** Froggypwn has quit IRC (Ping timeout: 268 seconds) [15:10] *** primus104 has joined #archiveteam [15:31] *** Dark_Star has joined #archiveteam [15:44] *** afics has quit IRC (Read error: Operation timed out) [15:56] *** primus104 has quit IRC (Leaving.) [16:02] *** Start has quit IRC (Quit: Disconnected.) [16:05] *** JesseW has joined #archiveteam [16:18] *** JesseW has quit IRC (Leaving.) [16:18] *** Start has joined #archiveteam [16:23] 3626 subdomains according to achip's file [16:43] *** primus104 has joined #archiveteam [16:44] *** nertzy has joined #archiveteam [17:01] *** scyther has quit IRC (Read error: Connection reset by peer) [17:06] *** terburg has quit IRC (Quit: terburg) [17:22] *** insane_al has joined #archiveteam [17:38] *** Start has quit IRC (Quit: Disconnected.) [17:41] *** vitzli has quit IRC (Quit: Leaving) [17:49] *** bithippo has joined #archiveteam [17:53] When uploading objects to the IA, is there a best place to specify the source URL if the upload is through the manual uploader? [17:53] I've been using the key "url" in the additional metadata section near the bottom [17:54] Not sure if that's the best way to be doing it though [17:54] (if I'm in the wrong place to ask about this, sorry!) [17:54] bithippo, #internetarchive [17:54] :hat tip: [17:54] *** bithippo has left [17:59] *** logan2 has joined #archiveteam [18:01] *** logan has quit IRC (Ping timeout: 258 seconds) [18:08] *** insane_al has quit IRC (Read error: Operation timed out) [18:11] *** primus104 has quit IRC (Leaving.) [18:30] *** aaaaaaaaa has joined #archiveteam [18:32] *** Atom-- has joined #archiveteam [18:40] *** Atom__ has quit IRC (Ping timeout: 506 seconds) [18:54] *** SimpBrain has joined #archiveteam [19:10] *** primus104 has joined #archiveteam [19:25] *** matthusby has quit IRC (Ping timeout: 252 seconds) [19:35] *** WinterFox has joined #archiveteam [19:45] *** Yiffiel_d has quit IRC (Read error: Connection reset by peer) [19:48] *** Start has joined #archiveteam [19:48] *** dan- has quit IRC (Ping timeout: 252 seconds) [19:49] *** Lord_Nigh has quit IRC (Ping timeout: 252 seconds) [19:54] *** Lord_Nigh has joined #archiveteam [19:55] joepie91: achip: thanks! [19:56] *** dan- has joined #archiveteam [20:01] Running the script now [20:02] Ignoring anything matching \...f$ currently, those will probably be done using archivebot [20:18] *** primus has joined #archiveteam [20:18] I guess you have already seen this: http://techcrunch.com/2015/10/23/youtube-red-creators/ [20:19] ESPN pulling its videos off of youtube [20:19] "Today, the majority of ESPN’s video content has been pulled off of YouTube in the US, as the sports network currently can’t participate in the YouTube Red service due to rights issues surrounding its content." [20:20] I guess it might be still available in Europe? I don't follow ESPN so I'm not sure how to check if it's still available here [20:23] primus: try https://www.youtube.com/channel/UC9upN-_pZg9_4kbo34T2c-A [20:24] On CenturyLink in the US that channel shows "no content" [20:24] ouch ... this channel has no content :-( [20:24] MrRadar, same here in the UK on Virgin Media [20:25] i'm in Slovenia ... ISP is Telekom Slovenia [20:25] Hold on, I'll ask if anyone knows another [20:26] https://www.youtube.com/channel/UCVSSpcmZD2PwPBqb8yKQKBA [20:26] Same, "no content" [20:26] https://www.youtube.com/channel/UCxFt75OIIvoN4AaL7lJxtTg [20:26] https://www.youtube.com/channel/UC401Z_s3vldTH1FNyjx4GGQ [20:26] https://www.youtube.com/user/ESPN/channels [20:26] The one ending in xtTg has content for me [20:26] X Games works [20:26] Germany, no content. I dont think youtube can HIDE videos per region [20:26] https://www.youtube.com/channel/UCIIKPy27YWW5yhc0qvr4KnA [20:27] Gameday has set videos to private [20:27] The one ending in 4GGQ has videos but they're all hidden [20:27] *** BlueMaxim has joined #archiveteam [20:27] OVH and Virgin media work on the xgames ones [20:27] https://www.youtube.com/channel/UCQZi2YXSxc6BSK4mZHql8ag [20:27] that is all anyone here knows [20:27] 4KnA has no content [20:27] l8ag ditto [20:28] yep, none has any content except GameDay, which has private videos [20:28] Should we throw X-Games at ArchiveBot just in case? [20:29] This looks like official channel: https://www.youtube.com/user/ESPN/featured [20:30] That looks like an old channel. The newest video is from 3 years ago [20:30] Good catch, i didn't look at that at all. [20:33] *** godane has left [20:33] *** godane has joined #archiveteam [20:34] according to a coworker, he watched stuff on the ESPN channel yesterday [20:35] Well, the article was posted an hour ago, so i think that's recent news [20:35] 11 of ESPN’s 13 channels are impacted by this issue, while only X-Games and Nacion ESPN are still live. [20:35] that's from article [20:35] right, I was responding to the "oldest is 3 years ago" [20:36] sorry [20:36] *** Start has quit IRC (Quit: Disconnected.) [20:36] its fine, probably their lawyers saying "the stuff from 3 years ago is still good, put it back up" [20:37] Taking a copy of https://www.youtube.com/playlist?list=PLGSIEmIEDaU8wW9hvNqT5EtTZbkQ6G9eB as I type, then will upload [20:37] It's so weird that they can distribute the content as long as it's only ad-supported but can't if it's paid from subscribers (and not to their particular channel but rather to YouTube in general). Sigh... [20:38] it should be a big youtube-dl job [20:39] Just out of curiosity, do you download highest resolution only or you download all available files of video? [20:40] I do just the highest quality. Getting all of the different qualities would take like 30x the bandwidth/disk space [20:40] How do you tell youtube-dl where to spit the files? [20:41] You need to include the output path in the output file name template: [20:42] https://github.com/rg3/youtube-dl/blob/master/README.md#filesystem-options [20:42] Or just cd to a directory and run youtube-dl within it [20:42] so something like youtube-dl.exe -o "E:\YouTube DL\Best Of XGames\%(title)s.%(ext)s" 'https://www.youtube.com/playlist?list=PLGSIEmIEDaU8wW9hvNqT5EtTZbkQ6G9eB' [20:42] HCross, youtube-dl strips all non-ASCII chars by default [20:43] yeah, its all coming in nicely now [20:45] HCross: the AT Wiki has more options you might want to specify: http://archiveteam.org/index.php?title=Youtube#Recommended_way_to_archive_Youtube_videos [20:46] That saves more stuff like subtitles, annotations, and the metadata (description, etc.) [20:47] youtube-dl.exe: error: using output template conflicts with using title, video I [20:47] D or auto number [20:48] I guess remove the --title parameter [20:48] Since you are using an explicit template [20:59] 493 files on the way to the IA [21:21] *** bzc6p__ has joined #archiveteam [21:29] *** bzc6p_ has quit IRC (Read error: Operation timed out) [21:47] *** bzc6p_ has joined #archiveteam [21:52] *** xk_id has joined #archiveteam [21:53] *** bzc6p__ has quit IRC (Read error: Operation timed out) [22:03] *** goekesmi has quit IRC (Remote host closed the connection) [22:05] *** goekesmi has joined #archiveteam [22:17] *** Start has joined #archiveteam [22:33] *** bzc6p__ has joined #archiveteam [22:38] *** signius has quit IRC (Remote host closed the connection) [22:38] *** bzc6p_ has quit IRC (Read error: Operation timed out) [22:48] *** Ymgve__ is now known as Ymgve [22:59] *** bzc6p_ has joined #archiveteam [23:02] *** Start has quit IRC (Read error: Connection reset by peer) [23:06] *** Start has joined #archiveteam [23:06] *** bzc6p__ has quit IRC (Read error: Operation timed out) [23:59] *** Start_ has joined #archiveteam [23:59] *** Start has quit IRC (Read error: Connection reset by peer)