[00:00] #archiveteam Archive Team: We're not archive.org [00:02] Well, looks like I knocked DiscussionApps offline. Oops. [00:07] yes i know, I just hope somebody in here (presumably interested in archiving) know something. [00:10] *** omglolbah has quit IRC (Ping timeout: 745 seconds) [00:11] *** chirlu` has quit IRC (Read error: Operation timed out) [00:12] *** chirlu` has joined #archiveteam-bs [00:12] *** tuluu has quit IRC (Ping timeout: 252 seconds) [00:13] *** tuluu has joined #archiveteam-bs [00:14] How soon before we become archive.org :) [00:17] *** omglolbah has joined #archiveteam-bs [01:03] *** systwi has joined #archiveteam-bs [01:07] It's been an hour since I stopped the DiscussionApps stuff, and it's still down. :-/ [01:08] there's a fine line between DDoS and DPoS [01:08] congrats for finding it [01:09] Heh [01:09] And it wasn't even distributed, just 25 connections from one IP. [01:43] die you qwarc it? [02:08] *** manjaro-u has quit IRC (Read error: Operation timed out) [03:42] *** DogsRNice has quit IRC (Read error: Connection reset by peer) [04:11] JAA was that liability link supposed to be 404? (As a joke, I assume) [04:12] *** kiska18 has quit IRC (Remote host closed the connection) [04:12] *** Ryz has quit IRC (Remote host closed the connection) [04:13] *** kiska18 has joined #archiveteam-bs [04:13] *** Fusl____ sets mode: +o kiska18 [04:13] *** Fusl sets mode: +o kiska18 [04:13] *** Fusl_ sets mode: +o kiska18 [04:13] *** Ryz has joined #archiveteam-bs [04:24] *** Fusl____ is now known as Fusl__ [04:25] *** ShellyRol has quit IRC (Read error: Connection reset by peer) [04:26] *** ShellyRol has joined #archiveteam-bs [04:43] *** odemgi_ has joined #archiveteam-bs [04:45] *** scorche has quit IRC (Read error: Operation timed out) [04:46] *** scorche has joined #archiveteam-bs [04:48] *** odemgi has quit IRC (Read error: Operation timed out) [04:51] *** qw3rty has joined #archiveteam-bs [04:54] *** odemg has quit IRC (Ping timeout: 745 seconds) [04:57] *** qw3rty2 has quit IRC (Ping timeout: 745 seconds) [04:58] *** odemg has joined #archiveteam-bs [05:07] *** Fusl has quit IRC (Excess Flood) [05:08] *** Fusl has joined #archiveteam-bs [05:08] *** Fusl__ sets mode: +o Fusl [05:08] *** Fusl_ sets mode: +o Fusl [05:08] *** Stiletto has quit IRC (Read error: Operation timed out) [05:16] *** Stiletto has joined #archiveteam-bs [05:38] lmao well [05:38] *** af10b3e5e has quit IRC (https://i.imgur.com/xacQ09F.mp4) [05:38] *** d5f4a3622 has joined #archiveteam-bs [05:38] so the cannes lions international festival of creativity has stuff preserved going back to only 2001 [05:39] except [05:39] How much is it to subscribe for one person? [05:39] Unfortunately we don’t offer single user subscriptions; the minimum subscription is for 5 users and costs €5,895 for 12 months. [05:49] *** cppchrisc has quit IRC (Ping timeout: 496 seconds) [07:07] *** asdf0101 has quit IRC (The Lounge - https://thelounge.chat) [07:07] *** markedL has quit IRC (Quit: The Lounge - https://thelounge.chat) [07:08] *** markedL has joined #archiveteam-bs [07:08] *** asdf0101 has joined #archiveteam-bs [07:23] *** Stiletto has quit IRC (Read error: Operation timed out) [07:26] *** Stiletto has joined #archiveteam-bs [07:54] *** Datechnom has quit IRC (Read error: Connection reset by peer) [07:59] *** Panda has joined #archiveteam-bs [08:42] JAA: Thanks. what are the stats for your weatherzone grab? [08:46] I also don't know what is going on with that AB job at almost 14m responses - still going through individual threads. [08:47] The /topic/pagenumber links don't seem redundant to me though. [09:55] *** BlueMax has quit IRC (Read error: Connection reset by peer) [10:03] *** kyledrake has quit IRC (Read error: Connection reset by peer) [10:03] *** kyledrake has joined #archiveteam-bs [10:27] *** Panda is now known as Datechnom [11:31] JAA I meant to do this ages ago it may be out of date now https://github.com/JustAnotherArchivist/snscrape/pull/53 [11:41] *** VerifiedJ has joined #archiveteam-bs [11:59] Fusl: Yup, that was qwarc. Everything was fine while I was scanning the "app IDs", but as soon as I started retrieving the actual contents, it collapsed. [11:59] phillipsj: That's the joke. [12:01] dxrt: 22400 threads or so. What keeps the AB job going is that post IDs are used to identify threads, and you can access a thread by the ID of any post inside it. The AB job is retrieving each thread as many times as there are posts in it, and there's no way to really stop that. [12:02] The WARC is 1.58 GiB, but note that I did not retrieve any images etc. [12:03] Dallas: Sweet, thanks, I'll test it later. [12:16] JAA: I need some labels to stick on all my harddrives that read: "Upon my death, please mail these harddrives to the care of Archive Team, PO BOX 22222, The Internet USA". Can you send me a PDF [12:25] is there anything on them you don't want others to find or saved forever? [12:27] more websites are switching to svg for company logos, can we allow .svg on the wiki ? [12:27] I'm not convinced that IA is interested in your midget porn collection, Raccoon. [12:34] :'( even on beta? [12:35] markedL: the wikipedia trend, i found, is to intentionally wonkify .svg protected marks, so they're not very crisp in print. [12:36] furry seems acceptable however [12:37] (so i steal them from company PDF reports instead) [12:37] you could make your own [12:41] one of my favorite marks is the MR OUCH signage warning logo. you can see an example of wikipedia's wonkification in their svg. [12:41] So DiscussionApps has recovered at some point in the last 12 hours. I didn't get banned either. On another note, apparently there are only about 540 existing forums on it. So most of what has been posted to that platform is already lost due to their deletion of inactive boards. :-/ [12:42] JAA: ever have luck contacting (any) site owners for permission to access deleted content? [12:46] Raccoon: Well, if it's actually deleted, not just blocked, good luck with that... [12:46] markedL: See the licensing section, and the edit history. they intentionally round all sharp corners so it only renders nicely at low resolution. https://en.wikipedia.org/wiki/File:Mr_Ouch.svg [12:47] And I have a feeling that they actually delete stuff at that platform. [12:47] Anyway, archival to continue later. [12:47] JAA: well, yeah. but i've learned that it's actually rather difficult to delete stuff if you own the hardware and aren't just renting it. then it's somebody else's job. [12:48] Raccoon: Depends on your definition of deletion, but 'rm' is pretty effective normally if there are disk writes happening. So not really difficult. [12:49] big companies will throw away encryption keys as deletion [12:49] Or DELETE FROM a database followed by a DB compaction. [12:49] JAA: then there's the CD, DVD, thumb drive, dropbox, aws backups [12:50] ¯\_(ツ)_/¯ Feel free to try. [12:55] JAA: is there an old time radio channel on efnet [13:12] *** Smiley has joined #archiveteam-bs [13:14] *** SmileyG has quit IRC (Read error: Operation timed out) [13:26] *** deevious has quit IRC (Quit: deevious) [13:31] *** sHATNER_ is now known as sHATNER [14:11] *** klg_ has quit IRC (Ping timeout: 252 seconds) [14:12] *** omglolbah has quit IRC (Read error: No route to host) [14:14] *** omglolbah has joined #archiveteam-bs [14:21] *** deevious has joined #archiveteam-bs [15:12] need a tracker pause in #gotshot / yourshot [15:25] *** betamax_ has joined #archiveteam-bs [15:28] *** tapedrive has left [15:28] *** betamax has quit IRC (Quit: leaving) [15:29] *** betamax_ is now known as betamax [15:31] *** tapedrive has joined #archiveteam-bs [16:16] *** omglolba- has joined #archiveteam-bs [16:16] *** omglolbah has quit IRC (Read error: Connection reset by peer) [17:45] Where are there so many snapshots here https://web.archive.org/web/*/oreilly.com ? [17:46] A lot of them are from archiveteam / archivebot, JAA? [17:46] *** second is now known as sec^nd [18:02] sec^nd: A lot? Have you seen how many captures of the Facebook homepage we did with AB? [18:02] https://web.archive.org/web/collections/*/facebook.com doesn't even load because there are so many. lol [18:02] But no idea why oreilly.com is grabbed often. [18:09] *** zhongfu_ has joined #archiveteam-bs [18:17] *** zhongfu has quit IRC (Ping timeout: 745 seconds) [18:55] does AT or IA receive a lot of death threats? [19:07] Fusl, can this run on mips at 4xURI's/second https://transfer.notkiska.pw/mzi48/avatars.url [19:59] *** killsushi has joined #archiveteam-bs [20:25] Google's gonna stop indexing Adobe Flash related content: https://webmasters.googleblog.com/2019/10/goodbye-flash.html [20:25] ...Which means I'll be retrieving back to Flashpoint to mine out tons and tons of Adobe Flash websites for 'em big time [20:48] My DiscussionApps archival is running quite smoothly now at a reduced concurrency and about 5k requests per minute. [21:06] so i found this today: https://www.facebook.com/photo.php?fbid=10211828680272463&set=a.4145890425591&type=3&theater [21:06] thats a rare picture of me [21:20] *** benjins has joined #archiveteam-bs [21:23] *** benjinsmi has quit IRC (Read error: Operation timed out) [21:26] good news about deadspin : https://archive.org/search.php?query=subject%3A%22deadspin.com%22&sort=-publicdate [21:26] i have everything from before 2016-11-30 [21:27] *** ShellyRol has quit IRC (Ping timeout: 252 seconds) [21:32] *** schbirid has quit IRC (Quit: Leaving) [21:32] *** ShellyRol has joined #archiveteam-bs [21:50] yourshot tracker can started at 5/min, or better would be someone lend me a target with 1.3TB free [22:13] *** BlueMax has joined #archiveteam-bs [23:08] *** wyatt8740 has joined #archiveteam-bs