[00:08] *** Deewiant has quit IRC (Ping timeout: 268 seconds) [00:09] *** Deewiant has joined #archiveteam-bs [01:14] *** ta9le has quit IRC (Quit: Connection closed for inactivity) [03:25] *** betamax has joined #archiveteam-bs [03:53] *** Sk1d has quit IRC (Read error: Operation timed out) [03:57] *** Sk1d has joined #archiveteam-bs [05:20] that's why i hate how google, yahoo and the like FORCE new accounts made with vpns/tor to add a phone number. i never "rawdog" the internet. if i want to make a google account without using my real ip i should be f-ing allowed to! [05:21] i mean i get it, they are trying to weed out spammers, but come on! isn't recaptcha enough? [05:21] and don't get me started about how much i hate recaptca... [06:08] *** Mateon1 has quit IRC (Read error: Operation timed out) [06:08] *** Mateon1 has joined #archiveteam-bs [06:40] *** Sk2d has joined #archiveteam-bs [06:40] *** Sk1d has quit IRC (Read error: Operation timed out) [06:40] *** Sk2d is now known as Sk1d [07:23] *** midas1 has quit IRC (Read error: Operation timed out) [07:23] *** svchfoo3 has quit IRC (Read error: Operation timed out) [07:24] *** midas1 has joined #archiveteam-bs [07:25] *** svchfoo3 has joined #archiveteam-bs [07:25] *** svchfoo1 sets mode: +o svchfoo3 [09:34] *** SilSte has joined #archiveteam-bs [09:38] *** Silvan has quit IRC (Ping timeout: 480 seconds) [09:39] w0rmhole: I think the problem is it's still super easy to create tons of fake accounts that way, so abuse is not really minimized. I completely agree, but I also understand their desire. [09:40] *** jut has joined #archiveteam-bs [09:41] *** jut_ has joined #archiveteam-bs [09:41] *** jut has quit IRC (Client Quit) [09:43] *** jut_ is now known as jut [09:47] *** wp494 has quit IRC (Ping timeout: 255 seconds) [09:47] *** wp494 has joined #archiveteam-bs [09:51] *** jut has quit IRC (Quit: WeeChat 1.4) [09:52] *** jut has joined #archiveteam-bs [10:11] *** Silvan has joined #archiveteam-bs [10:16] *** SilSte has quit IRC (Ping timeout: 480 seconds) [11:04] *** HCross has quit IRC (Remote host closed the connection) [11:04] *** deathy has quit IRC (Remote host closed the connection) [11:04] *** voltagex has quit IRC (Remote host closed the connection) [11:04] *** davidar has quit IRC (Remote host closed the connection) [11:05] *** davidar has joined #archiveteam-bs [11:05] *** deathy has joined #archiveteam-bs [11:07] *** voltagex has joined #archiveteam-bs [11:07] *** HCross has joined #archiveteam-bs [11:24] *** svchfoo3 has quit IRC (Read error: Operation timed out) [11:25] *** ta9le has joined #archiveteam-bs [11:27] *** svchfoo3 has joined #archiveteam-bs [11:28] *** svchfoo1 sets mode: +o svchfoo3 [12:26] *** superkuh has quit IRC (Read error: Operation timed out) [12:53] w0rmhole: this is why you make sdf.org email accounts :) [13:08] *** superkuh has joined #archiveteam-bs [13:14] *** superkuh has quit IRC (Read error: Connection reset by peer) [13:21] *** superkuh has joined #archiveteam-bs [14:13] *** superkuh has quit IRC (Quit: the neuronal action potential is an electrical manipulation of reversible abrupt phase changes in the lipid bilaye) [14:29] *** superkuh has joined #archiveteam-bs [14:48] *** BlueMax has quit IRC (Leaving) [15:36] I'm about to delete /u/thejuze and /u/wubthecaptain on reddit. Can anyone throw those on ArchiveBot, please? [16:09] JAA: Ping, sorry. Please? [16:12] lindalap: Ok. I'll use old.reddit.com for this since the redesigned website requires further clicks to see all comments etc. [16:13] Fine by me. [16:13] I still browse on old.reddit.com myself, personally. [16:13] Same. The new site is awful. [16:20] lindalap: For the record, those jobs will grab your profile pages but not the threads. [16:20] It's fine. Thank you. [17:56] *** sep332 has quit IRC (Read error: Operation timed out) [18:25] *** SmileyG has joined #archiveteam-bs [18:27] *** Smiley has quit IRC (Ping timeout: 252 seconds) [19:06] *** schbirid has joined #archiveteam-bs [19:52] I came across NEPIS/NSCEP today. Not sure what NEPIS stands for, but NSCEP is the "National Service Center for Environmental Publications"; it seems that they changed the name sometime. It lives at https://nepis.epa.gov/ and https://www.epa.gov/nscep and is a collection of US Environmental Protection Agency documents (scientific/technical reports, guidelines, pamphlets, etc.). Based on a quick look, [19:52] there are around 82k documents on there, going back decades. Although new documents are still being uploaded, the entire thing looks quite antiquated (especially the document display and PDF download pages). I checked IA and while there are a few items from perma.cc and a lot of URLs in the Wayback Machine, it doesn't look like it has been archived systematically. Would this be worth pursuing? If so, [19:52] Wayback Machine-compatible or items for each document (or both)? [19:53] *** m007a83 has quit IRC (Remote host closed the connection) [19:53] JAA: both [19:53] *** m007a83 has joined #archiveteam-bs [19:56] *** beardicus has quit IRC (Read error: Operation timed out) [19:57] *** vectr0n_ has joined #archiveteam-bs [19:58] Ok, sounds like a "fun" project. I'll look into it. [19:58] Yes [19:58] Might be too small for a warrior project though [19:59] *** nightpool has quit IRC (Read error: Operation timed out) [19:59] *** nightpool has joined #archiveteam-bs [19:59] Yeah, actually grabbing it should be fast. Figuring out how to grab it will likely take much longer. [20:01] They allow you to download a pdf [20:01] https://nepis.epa.gov/ allows * [20:02] *** Odd0002_ has joined #archiveteam-bs [20:02] Yeah, but when I tried one earlier, it showed some weird "generating your PDF"-type thing before it actually served the PDF. So that'll be one thing to figure out. [20:03] *** balrog_ has joined #archiveteam-bs [20:03] *** swebb sets mode: +o balrog_ [20:05] Did this kind of stuff before with some project that required waiting time [20:05] Load https://nepis.epa.gov/EPA/html/DLwait.htm?url=/Exe/ZyPDF.cgi/200093EE.PDF?Dockey=200093EE.PDF [20:05] wait some time [20:05] Load https://nepis.epa.gov/Exe/ZyPDF.cgi/200093EE.PDF?Dockey=200093EE.PDF [20:05] That´d how it was fixed back then [20:05] That´s* [20:05] Yeah [20:05] *** vectr0n has quit IRC (Ping timeout: 600 seconds) [20:05] *** vectr0n_ is now known as vectr0n [20:08] arkiver: On a related note, what's the status on the guidelines.gov/National Guidline Clearinghouse documents? I saw https://archive.org/details/guidelinesgov, but that only has 384 items in it. [20:08] *** Odd0002 has quit IRC (Ping timeout: 600 seconds) [20:08] *** Odd0002_ is now known as Odd0002 [20:09] *** balrog has quit IRC (Ping timeout: 960 seconds) [20:09] *** balrog_ is now known as balrog [20:09] *** beardicus has joined #archiveteam-bs [20:14] JAA: yeah, coming yp [20:14] up* [20:34] :-) [21:11] *** schbirid has quit IRC (Quit: Leaving) [21:26] *** BlueMax has joined #archiveteam-bs [21:44] *** Stilett0 has quit IRC (Read error: Operation timed out) [22:02] who has access to archiveteam@archiveteam.org ? [22:02] SketchCow ^ [22:27] *** BlueMax has quit IRC (Leaving) [22:52] Whats going on with the Yuku project? [23:00] Google Groups: "Gone within a year" (SketchCow, 2016-06-07). is this worth keeping on the wiki or is this an in memory thing? [23:12] *** sep332 has joined #archiveteam-bs [23:23] *** ta9le has quit IRC (Quit: Connection closed for inactivity) [23:58] i was slow this month with uploading DTIC archive files [23:58] i only did 24k items this past month: https://archive.org/details/@chris85?and[]=addeddate:2018-07