[00:17] STILL uploading from the FTP ingestion point [00:45] *** BlueMaxim has joined #archiveteam-bs [00:54] *** Coderjoe has quit IRC (Read error: Connection reset by peer) [00:57] *** DoomTay has joined #archiveteam-bs [01:10] *** username1 has joined #archiveteam-bs [01:13] *** schbirid2 has quit IRC (Read error: Operation timed out) [01:16] *** Coderjoe has joined #archiveteam-bs [01:59] *** kristian_ has joined #archiveteam-bs [02:27] STILL [02:27] STILL [02:27] But way better than it used to be [02:27] good [02:28] It's still going, but I've jammed my queue solid. [02:40] A new feature in the ia uploader python is that with a setting, if it tries to upload and it's there, it just skips it. [02:40] And deletes the double. [02:40] That's so quick. [02:40] Going to use the filename VA-Greg Street Presents - The Vault-2015... [02:40] Title: VA-Greg Street Presents - The Vault-2015 [02:40] Item Name: VA-Greg_Street_Presents_-_The_Vault-2015 [02:40] VA-Greg_Street_Presents_-_The_Vault-2015: [02:40] 01 Greg Street Feat. Young Jeezy - Run The Check Up.mp3 already exists, skipping. [02:40] 02 Scotty Atl - Str8 Drop.mp3 already exists, skipping. [02:40] 03 Rich Homie Quan Feat. Problem - No Way.mp3 already exists, skipping. [02:40] 04 Rico Love - Skit.mp3 already exists, skipping. [02:40] 05 Rico Love Feat. Rocko - On Ten.mp3 already exists, skipping. [02:40] 06 Yo Gotti - War Ready.mp3 already exists, skipping. [02:40] 07 Greg Street Feat. London Wilson - Wonderful Place-The Fun [02:40] etc [02:40] That sounds handy [02:57] various podcasters of the world, why don't you all have a complete rss feed?! the last 10 or 20 is not good enough! i've seen a full rss feed, it's super easy to grab with basically any pod-catcher.having to use jdownloader to grab your back archives sucks!!! [02:58] also extratorent is dyning?! abnother one?! [02:59] I think part of the reason is just how BIG it would be [02:59] I mean, imagine what would happen if you subscribe to a feed with 100+ items for the first time [03:52] *** RichardG has quit IRC (Quit: Keyboard not found, press F1 to continue) [04:07] *** i0npulse has quit IRC (Quit: leaving) [04:20] *** DoomTay has quit IRC (Quit: Page closed) [04:27] *** Sk1d has quit IRC (Ping timeout: 250 seconds) [04:32] *** VADemon has quit IRC (Quit: left4dead) [04:34] *** Sk1d has joined #archiveteam-bs [04:52] *** i0npulse has joined #archiveteam-bs [05:13] *** tomwsmf has quit IRC (Read error: Operation timed out) [05:13] *** kristian_ has quit IRC (Leaving) [05:54] *** Start has quit IRC (Quit: Disconnected.) [05:55] *** Start has joined #archiveteam-bs [06:22] *** RichardG has joined #archiveteam-bs [06:43] *** RichardG has quit IRC (Ping timeout: 501 seconds) [06:47] *** RichardG has joined #archiveteam-bs [06:50] *** dashcloud has quit IRC (Ping timeout: 260 seconds) [06:52] *** dashcloud has joined #archiveteam-bs [07:27] *** Honno has joined #archiveteam-bs [07:48] *** Dyskette has quit IRC (Ping timeout: 260 seconds) [08:13] *** dashcloud has quit IRC (Read error: Operation timed out) [08:16] *** dashcloud has joined #archiveteam-bs [08:18] *** alfie has quit IRC (Ping timeout: 260 seconds) [08:37] *** alfie has joined #archiveteam-bs [08:50] *** dan- has quit IRC (Ping timeout: 260 seconds) [08:53] *** dashcloud has quit IRC (Ping timeout: 260 seconds) [08:59] *** dashcloud has joined #archiveteam-bs [09:07] *** dan- has joined #archiveteam-bs [09:46] looks like cinemageddon is down [11:45] *** Medowar has quit IRC (Read error: Connection reset by peer) [11:54] *** brayden__ has quit IRC (Read error: Connection reset by peer) [11:55] *** brayden has joined #archiveteam-bs [11:55] *** swebb sets mode: +o brayden [12:15] *** Marcelo has joined #archiveteam-bs [12:38] *** logchfoo3 starts logging #archiveteam-bs at Sat Aug 13 12:38:27 2016 [12:38] *** logchfoo3 has joined #archiveteam-bs [12:39] *** closure has quit IRC (Read error: Operation timed out) [12:39] *** dashcloud has quit IRC (Read error: Operation timed out) [12:40] *** dashcloud has joined #archiveteam-bs [12:59] *** Marcelo has quit IRC (Quit: http://chat.efnet.org ) [13:07] *** beardicus has joined #archiveteam-bs [13:07] *** beardicus has quit IRC (Remote host closed the connection) [13:20] *** jspiros has quit IRC (Read error: Operation timed out) [13:21] *** jspiros has joined #archiveteam-bs [13:39] *** dashcloud has quit IRC (Read error: Operation timed out) [13:43] *** dashcloud has joined #archiveteam-bs [13:46] *** mutoso has quit IRC (Quit: Lost terminal) [14:04] *** beardicus has joined #archiveteam-bs [14:04] *** beardicus has quit IRC (Remote host closed the connection) [14:17] *** dashcloud has quit IRC (Ping timeout: 260 seconds) [14:17] *** dashcloud has joined #archiveteam-bs [14:23] *** beardicus has joined #archiveteam-bs [14:31] *** RichardG has quit IRC (Read error: Operation timed out) [14:40] *** RichardG has joined #archiveteam-bs [15:06] *** DoomTay has joined #archiveteam-bs [15:22] *** BlueMaxim has quit IRC (Quit: Leaving) [15:35] *** fusl has quit IRC (Read error: Connection reset by peer) [15:39] *** Baljem has quit IRC (Ping timeout: 250 seconds) [15:40] *** dashcloud has quit IRC (Read error: Operation timed out) [15:46] *** Baljem has joined #archiveteam-bs [15:46] *** dashcloud has joined #archiveteam-bs [15:50] *** Swizzle has joined #archiveteam-bs [16:01] *** dashcloud has quit IRC (Read error: Connection reset by peer) [16:03] *** dashcloud has joined #archiveteam-bs [16:12] *** JesseW has joined #archiveteam-bs [16:15] *** DopefishJ has joined #archiveteam-bs [16:15] *** swebb sets mode: +o DopefishJ [16:17] *** DFJustin has quit IRC (Ping timeout: 260 seconds) [16:20] *** Whopper_ has quit IRC (Ping timeout: 260 seconds) [16:25] *** Whopper has joined #archiveteam-bs [16:26] *** JesseW has quit IRC (Ping timeout: 370 seconds) [16:41] *** fusl has joined #archiveteam-bs [16:44] *** DFJustin has joined #archiveteam-bs [16:44] *** swebb sets mode: +o DFJustin [16:46] *** DopefishJ has quit IRC (Ping timeout: 268 seconds) [16:47] *** DoomTay has quit IRC (Quit: Page closed) [16:49] *** Kaz has quit IRC (Ping timeout: 285 seconds) [16:52] *** dan- has quit IRC (Ping timeout: 255 seconds) [16:52] *** alfie has quit IRC (Ping timeout: 255 seconds) [16:52] *** winr4r has quit IRC (Ping timeout: 245 seconds) [16:52] *** alfie has joined #archiveteam-bs [16:54] *** winr4r has joined #archiveteam-bs [16:54] *** Famicoman has quit IRC (Ping timeout: 240 seconds) [16:59] *** _desu___ has quit IRC (Read error: Connection reset by peer) [17:00] *** Kazzy has joined #archiveteam-bs [17:02] *** Rye has quit IRC (Quit: ZNC - http://znc.in) [17:03] *** twrist has joined #archiveteam-bs [17:04] *** dan- has joined #archiveteam-bs [17:06] *** Igloo_ has joined #archiveteam-bs [17:07] *** coretx_ has joined #archiveteam-bs [17:08] *** Rye has joined #archiveteam-bs [17:09] *** sigkell has quit IRC (Ping timeout: 350 seconds) [17:09] *** Famicoma1 has quit IRC (Remote host closed the connection) [17:09] *** ItsYoda has quit IRC (Write error: Connection reset by peer) [17:09] *** GLaDOS has quit IRC (Write error: Connection reset by peer) [17:09] *** twrist is now known as GLaDOS [17:09] *** winr4r has quit IRC (hub.se efnet.port80.se) [17:09] *** DFJustin has quit IRC (hub.se efnet.port80.se) [17:09] *** closure_ has quit IRC (hub.se efnet.port80.se) [17:09] *** davidar has quit IRC (hub.se efnet.port80.se) [17:09] *** BartoCH has quit IRC (hub.se efnet.port80.se) [17:09] *** godane has quit IRC (hub.se efnet.port80.se) [17:09] *** Meroje has quit IRC (hub.se efnet.port80.se) [17:09] *** Atluxity has quit IRC (hub.se efnet.port80.se) [17:09] *** Rickster has quit IRC (hub.se efnet.port80.se) [17:09] *** Jeroen52 has quit IRC (hub.se efnet.port80.se) [17:09] *** FalconK has quit IRC (hub.se efnet.port80.se) [17:09] *** Ctrl-S___ has quit IRC (hub.se efnet.port80.se) [17:09] *** Boltsie has quit IRC (hub.se efnet.port80.se) [17:09] *** deathy has quit IRC (hub.se efnet.port80.se) [17:09] *** zhongfu has quit IRC (hub.se efnet.port80.se) [17:09] *** Sanqui has quit IRC (hub.se efnet.port80.se) [17:09] *** johtso has quit IRC (hub.se efnet.port80.se) [17:09] *** HCross2 has quit IRC (hub.se efnet.port80.se) [17:09] *** JSharp___ has quit IRC (hub.se efnet.port80.se) [17:09] *** r3c0d3x has quit IRC (hub.se efnet.port80.se) [17:09] *** yipdw has quit IRC (hub.se efnet.port80.se) [17:09] *** Igloo has quit IRC (hub.se efnet.port80.se) [17:09] *** coretx has quit IRC (hub.se efnet.port80.se) [17:09] *** SN4T14 has quit IRC (hub.se efnet.port80.se) [17:09] *** Muad-Dib has quit IRC (hub.se efnet.port80.se) [17:09] *** lesderid has quit IRC (hub.se efnet.port80.se) [17:10] *** dashcloud has quit IRC (Read error: Operation timed out) [17:11] *** winr5r has joined #archiveteam-bs [17:12] *** yipdw_ has joined #archiveteam-bs [17:14] *** dashcloud has joined #archiveteam-bs [17:26] *** closure has joined #archiveteam-bs [17:29] cinemageddon... https://www.reddit.com/r/trackers/comments/49d9iy/is_your_tracker_down_ask_here_instead_of_making_a/d6d8ghw [17:31] *** dashcloud has quit IRC (Read error: Operation timed out) [17:33] *** godane has joined #archiveteam-bs [17:45] Awww [17:45] The best part is how it's the worst of all possible worlds. [17:46] *** dashcloud has joined #archiveteam-bs [17:52] *** dashcloud has quit IRC (Read error: Operation timed out) [17:56] *** dashcloud has joined #archiveteam-bs [18:08] *** dashcloud has quit IRC (Read error: Operation timed out) [18:12] *** Famicoman has joined #archiveteam-bs [18:18] *** dashcloud has joined #archiveteam-bs [18:19] does "wget --retry-connrefused" retry on 502 errors? [18:28] *** JesseW has joined #archiveteam-bs [18:37] *** zenguy has quit IRC (Read error: Operation timed out) [18:40] *** zenguy has joined #archiveteam-bs [18:44] *** Medowar has joined #archiveteam-bs [18:45] *** dashcloud has quit IRC (Read error: Operation timed out) [18:47] *** dashcloud has joined #archiveteam-bs [18:50] *** bsmith093 has quit IRC (Ping timeout: 244 seconds) [19:08] *** DoomTay has joined #archiveteam-bs [19:20] Down to the last 231gb in the FTP hopper. [19:20] I ended up deleting a few "what the fuck man" [19:20] Like... an entire download of Invader Zim [19:21] Are we seriously worried about Invader Zim? [19:21] Compare to a rare cut of Dog Day Afternoon from an internal tape. [19:21] *** dashcloud has quit IRC (Read error: Operation timed out) [19:24] *** dashcloud has joined #archiveteam-bs [19:52] *** DoomTay has quit IRC (Quit: Page closed) [20:06] *** DoomTay has joined #archiveteam-bs [20:06] *** DoomTay has quit IRC (Client Quit) [20:07] legacy.com 's slogan is ironic [20:07] "Where Life Stories Live On" [20:07] *** DoomTay has joined #archiveteam-bs [20:08] That's odd. When U try and join multiple channels at the same time, I get the message "# Cannot join channel (+b)", but when I join channels individually, there's no issue [20:09] what channels? [20:09] #archiveteam, #archiveteam-bs, #archovebot [20:10] Sorry, #archiveteam, #archiveteam-bs, #archivebot [20:16] anyway, guest books and obituaries on legacy.com expire about a month after they are put up, and then you have to pay to view either of them and allow people to post in the guestbook for longer. It's a monopoly. [20:21] hook54321: that sounds like something caling out for a script similar to youtube-dl... [20:21] hook54321: Why is it a monopoly, though? [20:22] I mean, they aren't writing any of the obituaries, they just partner with newspapers, who I assume get money from the partnerships. [20:22] *** Coderjoe has quit IRC (Read error: Operation timed out) [20:27] JesseW: what do you mean similar to youtube-dl? [20:31] a custom scraper, with an easy way to update it when the site changes to make it harder [20:32] hook54321: not writing the obits doesn't make them a monopoly or not... [20:32] eh [20:33] couldn't we just make something that uses the sitemap and only archives certain URLs based on the date? [20:33] *date last updated [20:34] *** Coderjoe has joined #archiveteam-bs [20:35] until they remove the sitemap [20:35] and/or block the IPs [20:35] And sometimes the sitemap may sit untouched for years [20:35] I'm looking at one such case right now [20:35] the benefits of a youtube-dl -like solution is that individuals can get copies of the guestbooks they care about [20:37] *** RichardG has quit IRC (Ping timeout: 370 seconds) [20:42] Lots of the people doing genealogy stuff are elderly though. [20:43] creating a wiki account [20:43] WHAT FORSOOTH, PRITHEE TELL ME THE SECRET WORD [20:44] actually, wait, don't answer that. [20:44] Part of me wants to give a joke answer [20:45] yahoosucks [20:46] http://archive.fart.website/bin/irclogger_log_search/archiveteam?search=what+forsooth&action=search [20:47] hook54321: yes, that's the easier way [20:47] for those who know the logs exist [20:48] they could also google it [20:48] SketchCow: i mostly grab rare stuff :-D [20:48] at least for the FOS [20:49] now all original broadcast of Invader Zim WOC maybe something [20:49] I think he said earlier that that was kinda pointless [20:50] cause it would have the original ads in it [20:50] i know [20:50] Hmm...I wonder if that was conveyed in metadata or something? [20:52] but yeah, I'm pretty sure he axed it [20:53] *** RichardG has joined #archiveteam-bs [21:02] *** JesseW1 has joined #archiveteam-bs [21:05] *** JesseW has quit IRC (Ping timeout: 370 seconds) [21:10] I really like how beyond some point (AVX, maybe) plugging in x86 instruction mnenomics into Google yields no introductory-level pages [21:10] e.g. https://www.google.com/search?q=vcvtsi2ssq [21:11] i know cvtsi2ss is "convert doubleword integer to single-precision floating point" and I guess the "q" is "quad" for something but I have no idea what the v is [21:14] oh, I guess the v is the vector extensions prefix [21:19] *** JesseW1 has quit IRC (Ping timeout: 370 seconds) [21:32] *** dashcloud has quit IRC (Read error: Operation timed out) [21:35] *** dashcloud has joined #archiveteam-bs [21:38] how feasible would it be to automatically archive guestbooks a little bit before they expire? [21:40] creepy [21:42] creepy? it's information written mostly about people who have passed away, written by people who are still alive... [21:43] i thought you meant random guestbooks, sorry [21:43] doing that regularly is a sure way to get blocked i guess [21:43] would not be great to have IA robots.txted [21:43] We could change the useragent, would prevent them from figuring it out, probably for awhile. [21:44] I kinda did mean random, although it isn't technically random if we are basing it off of the time it was posted. [21:47] *** DoomTay has quit IRC (Quit: Page closed) [22:23] *** DoomTay has joined #archiveteam-bs [22:23] *** DoomTay has left [22:23] *** DoomTay has joined #archiveteam-bs [22:23] *** DoomTay has left [22:23] *** DoomTay has joined #archiveteam-bs [22:23] *** Kazzy is now known as Kaz [22:32] *** dashcloud has quit IRC (Read error: Operation timed out) [22:33] *** dashcloud has joined #archiveteam-bs [22:38] *** Stiletto has quit IRC (Read error: Connection reset by peer) [22:53] *** tomwsmf has joined #archiveteam-bs [23:01] *** JesseW has joined #archiveteam-bs [23:37] *** Swizzle has quit IRC (Quit: Leaving) [23:54] * JesseW is reading through the logs [23:54] DoomTay: back on the 9th, you said; "I kinda feel like writing articles on "too late" site death situations even though that would only be useful to time travellers" [23:55] Ya [23:55] I'm not sure what you meant, but adding already dead sites to the Deathwatch page seems quite useful to me [23:56] Yeah "already dead" was exactly what I was going for [23:56] and if you find more info about them than fits in a short entry, making a page also seems fine [23:56] \already dead and too late to crawl [23:56] it's useful to historians, too [23:56] I added cia.vc recently