[00:27] *** matthusb_ has joined #archiveteam [00:28] *** matthusby has quit IRC (Read error: Operation timed out) [00:31] *** matthusb_ has quit IRC (Read error: Operation timed out) [00:59] *** matthusby has joined #archiveteam [01:01] *** Soni has joined #archiveteam [01:01] hi [01:01] can we archive youtube annotations? [01:01] it seems like they'll be deleted at some point soon (2019) [01:01] the videos will remain ofc but the annotations will be gone [01:01] we should archive them if we can [01:04] *** matthusby has quit IRC (Remote host closed the connection) [01:05] *** matthusby has joined #archiveteam [01:06] *** Sk1d has quit IRC (Read error: Operation timed out) [01:08] *** matthusby has quit IRC (Read error: Operation timed out) [01:10] *** Sk1d has joined #archiveteam [01:20] *** headacheb has joined #archiveteam [01:22] *** hdch has quit IRC (Ping timeout: 265 seconds) [01:24] *** Sk1d has quit IRC (Read error: Operation timed out) [01:24] *** headacheb has quit IRC (Client Quit) [01:27] *** Sk1d has joined #archiveteam [01:33] *** matthusby has joined #archiveteam [01:33] *** matthusby has quit IRC (Remote host closed the connection) [01:39] *** bithippo has joined #archiveteam [01:41] Sorry to bother, is there a channel devoted to grabbing Youtube annotations before they're deleted? [01:41] *** Sk1d has quit IRC (Read error: Operation timed out) [01:44] *** Sk1d has joined #archiveteam [02:16] *** Burninate has joined #archiveteam [02:17] *** bithippo has quit IRC (Read error: Connection reset by peer) [02:17] *** pizzaiolo has quit IRC (Quit: pizzaiolo) [02:17] Wondering if there's any plan in play for https://www.reddit.com/r/DataHoarder/comments/a0sus8/youtube_will_delete_existing_video_annotations_on/ [02:18] https://www.archiveteam.org/index.php?title=YouTube makes the job of archiving the _video_ out to be... intimidating... but the annotations are apparently a simple XML [02:25] I don't really know how you'd effectively spider something that size though. [02:26] I guess I'm not the first [02:29] The annotations function in the videos I watch is something like 60% no-value-added, 15% quite helpful supplemental videos, 15% special effects / closed captioning, and 5% "Sorry ignore this part of the video I got it completely wrong and I found out a month later, but didn't want to re-upload" [02:29] *** matthusby has joined #archiveteam [02:29] or thereabouts [02:30] Those last 5%, it may make the entire rest of the video superfluous or change its meaning [02:31] Since on YT, you can't re-upload an edited video to the same URL / page. [02:31] But you can re-annotate [02:32] *** matthusby has quit IRC (Remote host closed the connection) [02:32] *** matthusby has joined #archiveteam [02:34] *** godane has quit IRC (Remote host closed the connection) [02:35] *** godane has joined #archiveteam [02:36] *** matthusby has quit IRC (Remote host closed the connection) [02:37] *** matthusby has joined #archiveteam [02:38] *** Sk1d has quit IRC (Read error: Operation timed out) [02:43] *** Sk1d has joined #archiveteam [02:57] *** Sk1d has quit IRC (Read error: Operation timed out) [03:00] *** Sk1d has joined #archiveteam [03:06] Burninate: i'm working on it independently [03:06] we'll see how it goes [03:33] *** adinbied has quit IRC (Ping timeout: 265 seconds) [03:52] *** hdch has joined #archiveteam [03:52] *** hook54321 has quit IRC (Quit: Connection closed for inactivity) [04:13] *** qw3rty117 has joined #archiveteam [04:15] *** alex___ has quit IRC (Ping timeout: 633 seconds) [04:16] *** Despatche has joined #archiveteam [04:18] *** qw3rty116 has quit IRC (Read error: Operation timed out) [04:23] *** odemgi_ has joined #archiveteam [04:25] *** odemgi has quit IRC (Read error: Operation timed out) [04:25] *** Despatche has quit IRC (Read error: Connection reset by peer) [04:26] *** odemg has quit IRC (Ping timeout: 265 seconds) [04:26] *** Despatche has joined #archiveteam [04:29] *** hook54321 has joined #archiveteam [04:30] *** Despatche has quit IRC (Client Quit) [04:31] *** matthusb_ has joined #archiveteam [04:33] *** matthusby has quit IRC (Read error: Operation timed out) [04:38] *** odemg has joined #archiveteam [04:49] *** Martle has quit IRC (Quit: Leaving) [04:53] *** Mateon1 has quit IRC (Ping timeout: 268 seconds) [04:53] *** Mateon1 has joined #archiveteam [04:56] *** nertzy has quit IRC (Quit: This computer has gone to sleep) [05:03] *** DanielA has quit IRC (Ping timeout: 268 seconds) [05:06] *** BlueMax has quit IRC (Quit: Leaving) [05:08] *** BlueMax has joined #archiveteam [05:14] *** Despatche has joined #archiveteam [05:14] *** tjg1_ has joined #archiveteam [05:15] *** tjg1 has quit IRC (west.us.hub irc.Prison.NET) [05:15] *** wacky has quit IRC (west.us.hub irc.Prison.NET) [05:15] *** moufu has quit IRC (west.us.hub irc.Prison.NET) [05:15] *** dtm has quit IRC (west.us.hub irc.Prison.NET) [05:15] *** achip has quit IRC (west.us.hub irc.Prison.NET) [05:19] *** moufu_ has joined #archiveteam [05:19] *** Sk1d has quit IRC (Read error: Operation timed out) [05:20] *** wacky_ has joined #archiveteam [05:22] *** Sk1d has joined #archiveteam [05:39] *** DanielA has joined #archiveteam [05:42] *** Pixi` has joined #archiveteam [05:45] *** Pixi has quit IRC (Read error: Operation timed out) [06:04] *** Pixi has joined #archiveteam [06:08] *** Pixi` has quit IRC (Read error: Operation timed out) [06:09] *** DanielA has quit IRC (Ping timeout: 268 seconds) [06:13] *** dtm has joined #archiveteam [06:17] *** achip has joined #archiveteam [07:02] *** alex___ has joined #archiveteam [07:11] *** DanielA has joined #archiveteam [07:15] *** godane has quit IRC (Read error: Operation timed out) [07:17] *** patrickod has joined #archiveteam [07:17] *** wacky_ has quit IRC (ircd.choopa.net irc.mzima.net) [07:17] *** tjg1_ has quit IRC (ircd.choopa.net irc.mzima.net) [07:17] *** Dimtree has quit IRC (ircd.choopa.net irc.mzima.net) [07:17] *** wp494 has quit IRC (ircd.choopa.net irc.mzima.net) [07:17] *** Jogie has quit IRC (ircd.choopa.net irc.mzima.net) [07:17] *** Xena has quit IRC (ircd.choopa.net irc.mzima.net) [07:17] *** kisspunch has quit IRC (ircd.choopa.net irc.mzima.net) [07:17] *** saper has quit IRC (ircd.choopa.net irc.mzima.net) [07:17] *** znak_ has quit IRC (ircd.choopa.net irc.mzima.net) [07:17] *** mistym- has quit IRC (ircd.choopa.net irc.mzima.net) [07:17] *** patricko- has quit IRC (ircd.choopa.net irc.mzima.net) [07:17] *** Zebranky_ has quit IRC (ircd.choopa.net irc.mzima.net) [07:17] *** cf has quit IRC (ircd.choopa.net irc.mzima.net) [07:17] *** Valentine has quit IRC (ircd.choopa.net irc.mzima.net) [07:17] *** svchfoo3 has quit IRC (ircd.choopa.net irc.mzima.net) [07:19] *** Zebranky has joined #archiveteam [07:20] *** _Dimtree has joined #archiveteam [07:23] *** DanielA has quit IRC (Ping timeout: 268 seconds) [07:23] *** Xe has joined #archiveteam [07:26] *** svchfoo3 has joined #archiveteam [07:26] *** wacky_ has joined #archiveteam [07:26] *** tjg1_ has joined #archiveteam [07:26] *** wp494 has joined #archiveteam [07:26] *** cf has joined #archiveteam [07:26] *** Jogie has joined #archiveteam [07:26] *** kisspunch has joined #archiveteam [07:26] *** znak_ has joined #archiveteam [07:26] *** mistym- has joined #archiveteam [07:26] *** Valentine has joined #archiveteam [07:26] *** irc.mzima.net sets mode: +o mistym- [07:26] *** swebb sets mode: +o mistym- [07:26] *** svchfoo1 sets mode: +o svchfoo3 [07:32] *** Sk1d has quit IRC (Read error: Operation timed out) [07:32] *** _Dimtree is now known as Dimtree [07:34] *** Sk1d has joined #archiveteam [07:41] *** saper has joined #archiveteam [07:46] *** hdch has quit IRC (Ping timeout: 265 seconds) [07:52] *** Despatche has quit IRC (Quit: Error: Connection reset by peer) [08:12] *** DanielA has joined #archiveteam [08:13] *** hdch has joined #archiveteam [08:18] *** pikhq has quit IRC (Ping timeout: 260 seconds) [08:19] *** yuitimoth has joined #archiveteam [08:20] *** DanielA has quit IRC (Quit: 76.8.60.160) [08:27] *** yuitimoth has quit IRC (Ping timeout: 252 seconds) [08:38] *** yuitimoth has joined #archiveteam [09:13] *** svchfoo3 has quit IRC (Read error: Operation timed out) [09:14] *** Sk1d has quit IRC (Read error: Operation timed out) [09:15] *** svchfoo3 has joined #archiveteam [09:15] *** svchfoo1 sets mode: +o svchfoo3 [09:18] *** Sk1d has joined #archiveteam [09:42] *** Mikal_i2p has quit IRC (Read error: Operation timed out) [09:48] *** Mikal_i2p has joined #archiveteam [09:51] *** pikhq has joined #archiveteam [10:27] *** BlueMax has quit IRC (Read error: Connection reset by peer) [10:49] *** hdch has quit IRC (Quit: oops) [11:04] *** brayden has quit IRC (Ping timeout: 260 seconds) [11:19] *** brayden has joined #archiveteam [11:19] *** swebb sets mode: +o brayden [11:42] *** jtvjan has joined #archiveteam [11:53] Burninate: you can get the annotations as an XML file by going to https://www.youtube.com/annotations_invideo?features=1&legacy=1&video_id=dQw4w9WgXcQ. I don't think there's a way to see if a video has annotations just from search results (at least not via the official API), so you'd have to check each video individually for annotations. [13:21] *** vicarage has joined #archiveteam [13:25] I've been running warrior for a couple of weeks, but the only active project seems to be URLteam 2, every other project I try always says there is nothing to be done. Is anything else happening that's more challenging than a few urls a minute? [13:27] *** Sk1d has quit IRC (Read error: Operation timed out) [13:29] *** Sk1d has joined #archiveteam [13:37] vicarage: Not at the moment, no. [13:41] If there is, will it become the preferred project, as the url one will take a decade if my maths is right [13:43] *** Sk1d has quit IRC (Read error: Operation timed out) [13:43] *** matthusb_ has quit IRC (Remote host closed the connection) [13:46] *** Sk1d has joined #archiveteam [14:01] vicarage: Yeah, when there's another project ongoing, we usually switch the default project (i.e. "ArchiveTeam's Choice") to that. [14:02] Cheers, I'll leave it on default and not worry about it. [14:03] *** matthusby has joined #archiveteam [14:27] *** vicarage has quit IRC (Quit: Page closed) [16:00] *** PurpleSym has quit IRC (Quit: *) [16:00] *** PurpleSym has joined #archiveteam [16:01] *** svchfoo1 sets mode: +o PurpleSym [16:02] *** matthusb_ has joined #archiveteam [16:02] *** matthusby has quit IRC (Read error: Connection reset by peer) [16:06] *** Sk1d has quit IRC (Read error: Operation timed out) [16:10] *** Sk1d has joined #archiveteam [16:22] *** hook54321 has quit IRC (Quit: Connection closed for inactivity) [16:32] *** jtvjan has quit IRC (ZNC - http://znc.in) [16:33] *** jtvjan has joined #archiveteam [17:04] *** icedice has joined #archiveteam [17:08] Can someone who does Twitter grab Take a look at Harry Leslie Smith (@Harryslaststand): https://twitter.com/Harryslaststand please [17:25] HCross: Scraping now. [17:30] Thank you [18:08] *** Martle has joined #archiveteam [18:09] *** Valentine has quit IRC (Read error: Operation timed out) [18:19] *** Sk1d has quit IRC (Read error: Operation timed out) [18:20] *** Valentine has joined #archiveteam [18:22] *** Sk1d has joined #archiveteam [18:34] *** Sk1d has quit IRC (Read error: Operation timed out) [18:39] *** Sk1d has joined #archiveteam [19:16] ivan, true, fuckers are deleting annotations on January 15th too!! [19:24] *** Despatche has joined #archiveteam [19:30] *** BlueMax has joined #archiveteam [19:32] *** VerifiedJ has joined #archiveteam [19:34] *** fireglow has joined #archiveteam [19:34] Hello [19:35] Can somebody tell me what the warcprox --rethink-services-url should look like? [19:42] ah, seems like it's something like rethinkdb://localhost/brozzler/services [19:44] I'm trying to get brozzler to run. This is all new and confusing. Is there software you recommend for self-hosted archiving of websites? [20:01] *** godane has joined #archiveteam [20:06] *** jonasbits has quit IRC (Remote host closed the connection) [20:06] *** jonasbits has joined #archiveteam [20:07] *** nertzy has joined #archiveteam [20:11] *** Martle has quit IRC (Read error: Connection reset by peer) [20:11] *** Martle has joined #archiveteam [20:37] *** nertzy has quit IRC (Quit: This computer has gone to sleep) [20:42] *** hdch has joined #archiveteam [20:57] *** Mikal_i2p has quit IRC (Read error: Operation timed out) [20:58] *** Mikal_i2p has joined #archiveteam [21:19] *** Mikal_i2p has quit IRC (Ping timeout: 260 seconds) [21:41] *** BlueMax has quit IRC (Read error: Connection reset by peer) [21:44] *** Petri152 has quit IRC (Read error: Operation timed out) [21:45] *** wp494 has quit IRC (Ping timeout: 506 seconds) [21:45] *** Stilett0 has joined #archiveteam [21:46] *** Stiletto has quit IRC (Ping timeout: 268 seconds) [21:46] *** wp494 has joined #archiveteam [21:47] *** Despatche has quit IRC (Read error: Operation timed out) [22:00] *** Petri152 has joined #archiveteam [22:00] *** Mikal_i2p has joined #archiveteam [22:07] *** hdch has quit IRC (Remote host closed the connection) [22:13] *** hdch has joined #archiveteam [22:31] *** Mikal_i2p has quit IRC (Read error: Operation timed out) [22:42] *** victorbje has joined #archiveteam [23:27] *** BlueMax has joined #archiveteam [23:36] *** Stiletto has joined #archiveteam [23:38] *** Stilett0 has quit IRC (Read error: Operation timed out) [23:39] *** Pixi has quit IRC (Quit: Pixi) [23:40] *** Pixi has joined #archiveteam [23:57] *** VerifiedJ has quit IRC (Quit: Leaving) [23:59] *** ivan has quit IRC (Read error: Operation timed out) [23:59] *** balrog has quit IRC (Read error: Operation timed out) [23:59] *** ivan has joined #archiveteam