[00:00] *** apache2 has quit IRC (Read error: Connection reset by peer) [00:00] *** DopefishJ has joined #archiveteam-bs [00:00] *** swebb sets mode: +o DopefishJ [00:00] *** apache2 has joined #archiveteam-bs [00:02] *** DFJustin has quit IRC (Ping timeout: 257 seconds) [00:04] *** t2t2 has quit IRC (Ping timeout: 259 seconds) [00:04] *** t2t2 has joined #archiveteam-bs [00:05] *** thejsa has quit IRC (Ping timeout: 260 seconds) [00:05] *** arkiver has quit IRC (Ping timeout: 260 seconds) [00:05] *** thejsa has joined #archiveteam-bs [00:07] *** arkiver has joined #archiveteam-bs [00:41] *** flashfire has quit IRC (Quit: http://www.mibbit.com ajax IRC Client) [01:02] *** flashfire has joined #archiveteam-bs [01:19] *** flashfure has joined #archiveteam-bs [01:19] *** flashfire has quit IRC (Quit: http://www.mibbit.com ajax IRC Client) [01:19] *** ta9le has quit IRC (Quit: Connection closed for inactivity) [01:28] *** BlueMax has quit IRC (Leaving) [01:48] *** arctic has joined #archiveteam-bs [01:51] *** BlueMax has joined #archiveteam-bs [01:55] *** chazchaz has quit IRC (Ping timeout: 360 seconds) [02:00] *** chazchaz has joined #archiveteam-bs [02:26] Who is project leader for #effteepee [03:03] SketchCow: i'm uploading a big 20gb+ file to FOS [03:03] so don't touch the captures for awhile [03:30] *** arctic has quit IRC (Read error: Connection reset by peer) [03:31] *** arctic has joined #archiveteam-bs [03:36] *** arctic has quit IRC (Ping timeout: 255 seconds) [03:36] *** arctic has joined #archiveteam-bs [03:47] *** odemg has quit IRC (Ping timeout: 268 seconds) [03:50] *** archodg has quit IRC (Read error: Operation timed out) [03:57] *** archodg has joined #archiveteam-bs [04:00] *** odemg has joined #archiveteam-bs [04:55] *** Lord_Nigh has quit IRC (Read error: Operation timed out) [04:59] *** Lord_Nigh has joined #archiveteam-bs [05:26] *** arctic has quit IRC (Remote host closed the connection) [05:28] *** Flashfire has joined #archiveteam-bs [05:32] *** achip has quit IRC (west.us.hub irc.Prison.NET) [05:52] godane: https://www.ebay.com/itm/362385907999 2009 VHS as part of a press kit for House of the Dead: Overkill [06:04] *** achip has joined #archiveteam-bs [06:05] *** Pixi` has quit IRC (Read error: Connection reset by peer) [06:05] *** m007a83 has quit IRC (Read error: Connection reset by peer) [06:06] *** balrog has quit IRC (Read error: Operation timed out) [06:06] *** Atom__ has joined #archiveteam-bs [06:06] *** balrog has joined #archiveteam-bs [06:06] *** swebb sets mode: +o balrog [06:06] *** Pixi has joined #archiveteam-bs [06:08] *** m007a83 has joined #archiveteam-bs [06:09] *** superkuh has quit IRC (Ping timeout: 268 seconds) [06:09] *** superkuh has joined #archiveteam-bs [06:09] *** Sue has quit IRC (Remote host closed the connection) [06:10] *** Sue has joined #archiveteam-bs [06:10] *** Atom-- has quit IRC (Read error: Operation timed out) [06:21] *** wp494 has quit IRC (Ping timeout: 492 seconds) [06:21] *** wp494 has joined #archiveteam-bs [07:25] *** Stilett0 has quit IRC (Read error: Operation timed out) [07:57] *** flashfloo has quit IRC (Quit: Connection closed for inactivity) [08:46] *** fie has joined #archiveteam-bs [09:33] *** ta9le has joined #archiveteam-bs [10:04] *** Flashfire has quit IRC (Quit: Bye) [10:11] *** Darkstar has quit IRC (Ping timeout: 260 seconds) [10:11] *** kiska3 has joined #archiveteam-bs [10:12] *** kiska3 has quit IRC (Client Quit) [10:14] *** Darkstar has joined #archiveteam-bs [11:03] *** Stilett0 has joined #archiveteam-bs [11:52] *** fie has quit IRC (Quit: Leaving) [13:29] so i think i'm getting repeat airings of MST3K [13:29] one was the phantom creeps [13:30] i already uploaded the 1994-07 version on comedy central here: [13:30] https://archive.org/details/MST3K_The_Phantom_Creeps_Comedy_Central_WOC_1994-07 [13:30] we are now get a 1995-08 or 1995-09 airing of it [13:43] so i also have 1995-09 airing of this: https://archive.org/details/MST3K_Wild_Rebels_Comedy_Central_WOC_1994-07 [13:43] now with bad tracking [13:48] 'Special bonus bad tracking!' [13:50] i'm just going to put the rest of the tape as is in one file [13:50] will have Wild Rebels with City Limits [13:51] wild rebels part of the tape has very bad tracking where city limits is very good tracking comparing to wild rebels [13:51] also city limits is in complete [14:21] It's in IA complete or it's incomplete? [14:39] its going be incomplete [14:39] city limits i mean [14:40] also know this tape was a bit fuzzy [14:40] nevermind that was the other tape i was editing [15:14] I'm travelling and at an Apple II conference all week, as well as at HOPE (Hackers on Planet Earth) this weekend. [15:16] as opposed to HOPM (Hackers on Planet Mars)? [15:16] and I'm about to try uploading a massive file to archive.org [15:26] I wish I could go to HOPE as well, have fun SketchCow [15:26] So I was going to look whether we missed anything on guideline.gov, but their servers are so slow and produce so many error pages right now that I can't get much done anyway. [15:26] (Search works fine, but accessing the actual content is slow/broken.) [15:54] *** REiN^ has quit IRC (Read error: Operation timed out) [16:14] *** BlueMax has quit IRC (Leaving) [16:41] *** REiN^ has joined #archiveteam-bs [16:43] *** schbirid has joined #archiveteam-bs [16:45] *** eientei95 has quit IRC (Quit: ZNC 1.6.5 - http://znc.in) [16:51] *** eientei95 has joined #archiveteam-bs [16:55] *** REiN^ has quit IRC (Read error: Operation timed out) [16:55] *** REiN^ has joined #archiveteam-bs [17:12] I noticed that guideline.gov offers PDF, XML, and DOC downloads of the guideline summaries, and these downloads work fine despite the issues I mentioned before. I'm grabbing these now. (The DOC files are actually broken, but I'll keep them anyway.) [17:28] *** ta9le has quit IRC (Quit: Connection closed for inactivity) [18:55] *** wp494 has quit IRC (hub.efnet.us irc.Prison.NET) [18:55] *** achip has quit IRC (hub.efnet.us irc.Prison.NET) [18:57] *** Mateon1 has quit IRC (Ping timeout: 260 seconds) [18:57] *** Mateon1 has joined #archiveteam-bs [19:12] *** wp494 has joined #archiveteam-bs [19:26] *** achip has joined #archiveteam-bs [20:41] *** jschwart has joined #archiveteam-bs [21:33] *** schbirid has quit IRC (Quit: Leaving) [21:43] So apparently guideline.gov is going down around now, redirecting to ahrq.gov instead. My grab of the downloads is still running and retrieving content successfully though. XMLs were completed a few hours ago already, PDFs are almost there (10k of 11.3k documents done). I'll probably skip the DOCs which are broken anyway. [21:51] *** jschwart has quit IRC (Konversation terminated!) [21:57] It seems that requests are hit and miss for the moment. [21:58] Powering through with retries is likely to hit the majority of it. [21:59] Probably depends on the location. My download grab's not encountering any errors (except 500s for what I assume are inexistent documents). [21:59] Maybe... but archivebot is successfully downloading about half of all pages. [21:59] The other half get a 503. [22:00] Yeah, pages were broken beyond being usable already this afternoon as mentioned above. [22:00] For the record, the URLs I'm grabbing are https://www.guideline.gov/summaries/downloadcontent/ngc-XXX?contentType=TTT where X goes from 1 to 12000 and TTT is "xml", "pdf", or "word". [22:01] They are, but I'm not sure whether they are *consistently* broken. [22:01] But then I don't really know what is going on with the archivebot at the moment. [22:01] IDs on the website go to the 50k range, but there appear to be huge gaps. Not entirely sure what the relation between the IDs is. [22:03] The ArchiveBot job is almost certainly only grabbing redirects to arhq.gov now. [22:03] ahrq.gov* [22:03] Also, fun fact: https://guideline.gov/ is serving an invalid certificate. [22:06] My PDF grab is getting "403 Site Disabled" now. [22:07] I think it did finish *just* in time. [22:08] Last actual PDF grabbed at 21:58:18 UTC, first 403 Site Disabled at 22:05:25 UTC. Phew. [22:08] Oh, you're right -- archivebot is just grabbing nonsense now. [22:10] I've stopped my grab for obvious reasons. Nothing more to grab there. [22:10] Should we regex away most of that archivebot job? [22:11] Yep [22:11] *** flashfloo has joined #archiveteam-bs [22:11] Just adding the entire search page to the ignore list should do the job. [22:34] *** vectr0n_ has joined #archiveteam-bs [22:34] *** arkiver has quit IRC (Read error: Operation timed out) [22:34] *** nightpool has quit IRC (Read error: Operation timed out) [22:34] *** dxrt has quit IRC (Write error: Broken pipe) [22:34] *** tyzoid has quit IRC (Write error: Broken pipe) [22:34] *** Lord_Nigh has quit IRC (Write error: Broken pipe) [22:34] *** beardicus has quit IRC (Read error: Operation timed out) [22:34] *** C4K3 has quit IRC (Read error: Operation timed out) [22:34] *** nightpool has joined #archiveteam-bs [22:34] *** kiska has quit IRC (Read error: Operation timed out) [22:34] *** vectr0n has quit IRC (Read error: Operation timed out) [22:34] *** sep332 has quit IRC (Read error: Operation timed out) [22:35] *** decay_ has quit IRC (Read error: Operation timed out) [22:35] *** vectr0n_ is now known as vectr0n [22:35] *** dxrt has joined #archiveteam-bs [22:35] *** twigfoot has quit IRC (Write error: Broken pipe) [22:35] *** Mayonaise has quit IRC (Write error: Broken pipe) [22:35] *** decay_ has joined #archiveteam-bs [22:35] *** twigfoot has joined #archiveteam-bs [22:36] *** Mayonaise has joined #archiveteam-bs [22:36] *** unlobito has quit IRC (Read error: Operation timed out) [22:36] *** unlobito has joined #archiveteam-bs [22:36] *** REiN^ has quit IRC (Read error: Operation timed out) [22:37] Hey, a reporter (who I actually like) asks when the announcement about the NGC shutting down first came up [22:37] Any citation on that [22:37] *** Kenshin has quit IRC (Read error: Operation timed out) [22:37] *** Kenshin has joined #archiveteam-bs [22:38] *** PotcFdk has quit IRC (Read error: Operation timed out) [22:38] *** arkiver has joined #archiveteam-bs [22:38] *** Lord_Nigh has joined #archiveteam-bs [22:39] *** Dimtree has quit IRC (Read error: Operation timed out) [22:45] Found it. May 14, 2018. [22:45] Was just about to write that, yep. https://web.archive.org/web/20180518192547/https://www.guideline.gov/home/announcements [22:49] *** kiska has joined #archiveteam-bs [22:50] *** beardicus has joined #archiveteam-bs [22:53] *** REiN^ has joined #archiveteam-bs [22:56] *** sep332 has joined #archiveteam-bs [22:58] *** C4K3 has joined #archiveteam-bs [22:58] *** PotcFdk has joined #archiveteam-bs [22:58] *** tyzoid has joined #archiveteam-bs [23:12] *** RichardG_ is now known as RichardG [23:16] *** Dimtree has joined #archiveteam-bs [23:36] so wikipedia has some thing way off [23:36] Did you want me to try and run up an archiveteam facebook account? [23:36] 09:34 <+flashfure> that would honestly be a good idea. We can get the cookies off of it so we can archive more facebook accounts if needed [23:36] 09:35 <+flashfure> JAA what do you think? [23:36] 09:36 <+flashfure> Make it so that all the operators know the password to grab cookies when they need [23:36] Sketchcow what do you think of this idea? [23:37] tmnt 2 page is off [23:38] i got a trailer of tmnt 2 secret of the ooze on one of my tapes and it's saying it came out march 22 [23:40] fixed it [23:40] i was changed today it looks like based on history [23:44] I think an account on facebook might be good, the more people that know about us the better. [23:44] We´d need frequent updates though [23:45] we already have https://twitter.com/archiveteam [23:45] but the last tweet on our twitter page is from 2016 [23:45] arkiver: This was more about archiving Facebook accounts that require a login. The one we'd like to archive currently is the one of Maria Butina. [23:45] heh [23:45] I also agree we need a page though [23:46] Which is a much more complex topic. We can't use it in ArchiveBot anyway, plus there's also the issue to keep the cookie out of the archives until they're no longer valid. [23:47] but still it would be nice if we could post more on our twitter page [23:47] but I think tweets coming from me would be very boring [23:48] didn`t we give some other people access to the twitter page some time ago? [23:48] Yeah, at least that was the plan. Through TweetDeck, to be precise.