[00:02] https://twitter.com/joepie91/status/877676538019409926 -- "Gathering weak npm credentials -- how poor passwords and auth mechanisms put 52% of the Node.js ecosystem at risk https://github.com/ChALkeR/notes/blob/master/Gathering-weak-npm-credentials.md" [00:02] (since we don't have a titlebot here) [00:04] amazing [00:04] :p [00:04] From the community that brought you left-pad... [00:05] MrRadar: nope, from the developer community [00:05] yes, them [00:05] :p [00:05] MrRadar: I'm almost certain that you could run these numbers for basically every package registry that doesn't either outsource auth or centralize packaging, and end up with the same result [00:05] because the other registries *also* don't seem to have any measures against this [00:06] if it weren't for github being reasonably okay at finding leaked credentials and preventing bruteforcing etc., you could probably do this on github too [00:06] but nobody seems to have run the numbers for any of the other registries yet :) [00:06] Yeah, I was just taking a cheap shot :P [00:06] actually, I think somebody might have done this for CPAN at some point [00:06] but I'm not sure [00:10] *** BlueMaxim has joined #archiveteam-bs [00:14] *** antomatic has joined #archiveteam-bs [00:14] *** swebb sets mode: +o antomatic [00:24] MrRadar: anyway, be wary with the cheap shots like that, because it can very easily be interpreted seriously by people, and lead to people going "oh it's just Node being dumb" (ie. missing the point and leaving all the non-Node shit insecure) [00:24] I regularly speak to people who would interpret your statement totally seriously :) [00:26] Yeah, understood [00:28] thank you - i was poking around there [00:32] *** dan- has quit IRC (Ping timeout: 260 seconds) [00:37] arkiver: i paused all the other warriors, and the imzy ones are just happily exchanging http requests [00:37] i can't see why the script isn't doing anything [00:53] ah ha... [00:54] any idea why i'd be getting: "I give up... process wgetdownload returned exit code -6 [00:54] *** luckcolor has joined #archiveteam-bs [01:15] FYI: imgur has started serving up HTML + JS pages for direct image links [01:15] which for some reason requires JS and loads React, to show a fucking image [01:16] this means that any kind of referer-less archiving is very likely to fail if imgur does not recognize your client as a headless client [01:16] (also, what the fuck imgur?) [01:27] it's my opinion that theres an emerging evil-pattern to purposely have sites crap the bed w/o javascript to force you to enable it so they can get their analytics [01:27] *** dan- has joined #archiveteam-bs [01:28] w/o javascript, shopify will literally load the content of the website (as in, you can read the website in view-sources://) but the page renders as a blank white page [01:28] someone should write a plugin to decrapify this bullshit [01:31] joepie91: : direct links work for me [01:31] ex http://i.imgur.com/Nj0znc8.png [01:32] Same here, though I was seeing some very strange redirects when I was doing a bunch of imgur archiving last eek [01:32] It was redirecting from i.imgur.com/whatever.png to URLs along the lines of i.imgur.com/original/w/h/a/whatever ... which redirected to themselves [01:34] are you sure they were direct image links [01:34] i just downloaded that image using wget and it worked fine [01:34] http://i.imgur.com/aR6mdR1.png [01:34] Yes, or at least they should have been [01:34] And it wasn't consistent for me [01:34] Sometimes the direct links would work as expected, sometimes it would redirect [01:35] It only affected < 1 % of the grab, so I just re-ran the ones that got the redirect [01:38] so do you know a better free imagehost [01:39] most of the other ones i know of delete images after so long [01:45] honestly, they all turn to crap in time... [01:46] seems like the best thing atm is to use S3/B2 or spin up a VPS [01:46] and I get why those are not perfect replacements [01:50] *** bitBaron has quit IRC (Quit: My computer has gone to sleep. ZZZzzz…) [01:51] ndiddy: some people are reporting HTML pages, others are reporting direct links still work [01:51] ndiddy: but imgur is detecting headless clients and referers, so you can't reproduce with wget [01:51] yeah I started noticing it last week I think [01:51] they're definitely throwing up HTML+JS for *some* clients though [01:51] even right click, save as on an image from firefox produces an html file sometimes [01:51] joepie91: just set your referer to some random site [01:52] ndiddy: overriding referer globally will break a bunch of stuff :) [01:52] and that doesn't change that creators of archival tools need to be aware of this [02:03] there is no good free imagehost. it's a hopeless venture and they all fall eventually [02:03] I've long since switched to self-hosting images that I share [02:04] all free image hosts to date have either shut down, become unusable, or are in the process of doing either of those [02:10] https://i.imgflip.com/1rasb4.jpg [02:26] ha [02:26] where's that from? :P [02:33] I was slightly bored and had a few minutes [02:36] nice [02:37] *** thuban2 has joined #archiveteam-bs [02:55] *** Crusher has joined #archiveteam-bs [02:55] *** Crusher_ has quit IRC (Read error: Connection reset by peer) [02:55] Has anyone else noticed that twitter recently blocked access by non-logged-in users to the /with_replies pane; e.g. https://twitter.com/textfiles/with_replies [02:56] That's quite a big change, and I was surprised to not be able to find *any* mention of it anywhere. [02:56] It does? [02:57] I clicked on it an... Oh wait, I'm logged in... [02:58] kisspunch: regarding #internetarchive.bak -- probably good to discuss your idea of a Windows client in there, too. [03:09] Somebody2: I thought I saw that today, but I wasn't sure, so I checked, and it's definitely happening now [03:11] i've just realized it's faster for me to install ubuntu using the network installer than by disk... [03:12] *** crusher2 is now known as Crusher2 [03:34] *** th1x has quit IRC (Ping timeout: 633 seconds) [03:36] *** th1x has joined #archiveteam-bs [04:24] *** Crusher2 has quit IRC (Ping timeout: 268 seconds) [04:43] *** Sk1d has quit IRC (Ping timeout: 250 seconds) [04:50] *** Sk1d has joined #archiveteam-bs [07:04] *** schbirid has joined #archiveteam-bs [07:27] *** Jonison has joined #archiveteam-bs [07:59] *** SHODAN_UI has joined #archiveteam-bs [09:08] *** j08nY has joined #archiveteam-bs [11:41] *** lucysun has quit IRC (Quit: Page closed) [12:03] *** icedice has joined #archiveteam-bs [12:05] *** BlueMaxim has quit IRC (Read error: Operation timed out) [12:07] *** BlueMaxim has joined #archiveteam-bs [12:11] *** BlueMaxim has quit IRC (Client Quit) [12:38] *** Crusher has quit IRC (Ping timeout: 492 seconds) [12:42] arkiver: weffey had to kill API access because the DB was getting overloaded [12:47] *** user0815 has joined #archiveteam-bs [12:57] *** qw3rty has joined #archiveteam-bs [13:09] *** Hiccup has joined #archiveteam-bs [13:15] *** pizzaiolo has joined #archiveteam-bs [13:31] Hi. How would I go about adding downloads to a WARC file? I used https://webrecorder.io to record the non-download stuff, but there doesn't seem to be a way to add downloads. [13:35] *** SilSte has quit IRC (Quit: http://quassel-irc.org - Chat comfortably. Anywhere.) [13:40] *** SilSte has joined #archiveteam-bs [13:44] arkiver: Maybe try halving the rate on Imyz? [13:46] The current rate is hundreds of concurrent sessions and the DB connection is getting overloaded. [14:03] *** Odd0002 has joined #archiveteam-bs [14:05] *** Hiccup has left [14:06] *** icedice has quit IRC (Read error: Connection reset by peer) [14:08] Kaz: ^^^ [14:08] (In case you're around) [14:22] *** thuban3 has joined #archiveteam-bs [14:25] *** thuban2 has quit IRC (Read error: Operation timed out) [14:51] *** qw3rty has left [14:55] Hello [14:56] tracker limit 200 -> 50 [15:22] *** RichardG has quit IRC (Read error: Connection reset by peer) [15:22] *** qw3rty has joined #archiveteam-bs [15:23] *** RichardG has joined #archiveteam-bs [15:28] Ten days and 1.6 million URLs later, my Tilt grab is finally done. :-) [15:32] The website grab, that is. I'll set up the API grab later. [15:42] *** Crusher has joined #archiveteam-bs [16:05] *** Odd0002 has quit IRC (Remote host closed the connection) [16:17] *** kristian_ has joined #archiveteam-bs [16:23] *** SketchCo1 is now known as SketchCow [16:24] Is that who I think it is? [16:25] depends on who you think who they are [16:25] *Jason Scott's name echoes through the irc* [16:30] Ahem.. silliness aside, that is him if I'm not mistaken, right? [16:31] Yes [16:32] mhmm [16:33] *** pizzaiolo has quit IRC (Remote host closed the connection) [16:36] ? [16:40] I must thank you for your excellent talks, I found them online and that's what inspired me to help in whatever way I can [16:40] So thank you. [16:45] SketchCow: is my hero [16:45] I hope to get to see one of his talks, or even better meet him at a vendor booth or something :) [16:46] *** nyany has quit IRC (Ping timeout: 506 seconds) [16:47] I love talking to bombastic people, its great fun. I am working on my old BBS Archives, most of them are Japanese [16:47] lol: https://twitter.com/mike_hasarms/status/877779717935333377 [16:49] *** greycat has joined #archiveteam-bs [16:51] Hi there! I wanted to ask if anyone from Archive Team knows of Kaitai Struct project and if it's possible to discuss potential collaboration between our projects? [16:51] We've launched http://formats.kaitai.io/ recently, which is a site for formal specifications of various file formats [16:52] I guess it should be of interest to archival teams, as it provides a machine-readable, non-ambiguous specs which can be visualized as a diagram, [16:53] or interpreted with a given file to highlight file structure as a tree, [16:53] *** kristian_ has quit IRC (Read error: Operation timed out) [16:53] or compiled into a ready-made parser for that file format in many programming languages [16:54] hm, neato [16:55] I thought of whether we can interlink our projects i.e. http://fileformats.archiveteam.org/wiki/ICO <=> http://formats.kaitai.io/ico/ [16:57] i'm 100% in favor of this [16:58] *** pizzaiolo has joined #archiveteam-bs [16:58] I thought of adding `meta/xref` key into Kaitai Struct format [16:59] that could be used to add links to other projects into any .ksy file [17:00] i.e. ico.ksy would include something like [17:00] meta: [17:00] xref: [17:00] archiveteam: http://fileformats.archiveteam.org/wiki/ICO [17:00] would "archiveteam" be OK name to reference archiveteam wiki with, or would you prefer something else? [17:02] it's not really the archive team wiki it's just hosted on the same domain [17:02] *** brayden has quit IRC (Read error: Connection reset by peer) [17:02] *** brayden has joined #archiveteam-bs [17:02] *** swebb sets mode: +o brayden [17:04] yeah, it's a separate wiki [17:04] DFJustin: Ok, then how should I address it and who can I contact about it? [17:04] *** SHODAN_UI has quit IRC (Remote host closed the connection) [17:04] using the url is probably the best way to talk about it [17:06] *** bitBaron has joined #archiveteam-bs [17:06] for example, I wonder if it would be acceptable to modify FormatInfo template there to add a link to .ksy spec at formats.kaitai.io, if it exists? [17:06] It already includes other interlinks, such as `pronom={{PRONOM|fmt/978}}` [17:06] i approve [17:07] Shall we wait for some quorum / more opinions, or could I try to add it right away? [17:08] i wouldn't expect to see dissent about this. go ahead and add it if you feel comfortable :) [17:11] *** kristian_ has joined #archiveteam-bs [17:18] greycat: ohai, somebody pointed me at kaitai a while ago, quite an interesting project :) [17:18] greycat: one remark that's not really archiveteam-related: the format documentation is not very googleable [17:19] Ok, here we go: http://fileformats.archiveteam.org/wiki/ICO [17:19] googling for eg. datatypes (u1 and whatnot) yielded no useful results [17:19] What do you think about such a link? [17:19] joepie91: What could be done to improve that? [17:20] joepie91: Techincally, user guide fully describes that - http://doc.kaitai.io/user_guide.html#_basic_data_types [17:20] greycat: hmm. I think I'd call it "Kaitai Spec" instead of "Formal Spec" -- a lot of people will probably interpret "Formal" as "Official" (even if that's not what it means) [17:20] greycat: well, I'm not sure why it's not googleable in the first place [17:20] haven't looked into it too deeply [17:20] just registered it in the back of my mind :) [17:21] greycat: and yeah, I'm aware that the documentation exists, it just wasn't easy to find :p [17:22] oh bummer... [17:22] I wonder how many people encounter these problems... [17:22] I really quite like the project though [17:22] considering giving it a shot in terms of specifying the OpenTTD wire protocol [17:23] I have a hand-written implementation in JS right now, but it'd be nicer to have a proper declarative spec [17:23] about "Kaitai Spec" - technically, binary parsing project is called "Kaitai Struct", "Kaitai" is the name of team [17:23] right, "Kaitai Struct Spec" then :p [17:23] we have other projects beside "Kaitai Struct" ;) [17:24] "Kaitai Struct Spec" would be probably too long and it will stretch the box... Would be it alright? May be "Declarative Spec" then? [17:24] greycat: kaitai.io yields kaitai struct though :) [17:24] Or it will be misleading as well? [17:24] I wouldn't worry about stretching the box too much [17:24] and you can leave out the .ksy if you call it 'kaitai struct' [17:25] main goal is to have accurate information on the site, aesthetics comes second :P [17:26] Ok, then I'll make it "Kaitai Struct Spec" [17:27] greycat: also, fwiw, archiveteam is very much structured as "if you want to get something done, do it" so I wouldn't worry too much about doing something that somebody doesn't like - if they don't like it, they'll tell you and/or fix it, but there's no formal structure on who decides what [17:27] http://fileformats.archiveteam.org/wiki/ICO <= ok, here we go [17:28] well, at the very least, "Kaitai Struct" is a very new project compared to massive amount of data already gathered by archiveteam [17:28] so I thought it would be polite to ask :) [17:28] looks good :) [17:28] and yeah, no worries [17:31] ok, next step: link from formats.kaitai.io to Just Solve Format Problem wiki %) [17:31] how shall I name it? I've already got it that just "archiveteam" is a bad idea... [17:32] I saw imzy was at 50 items/min [17:32] not sure if I set that [17:32] did someone set it to that here? [17:32] greycat: "File Format wiki" is a fairly commonly used shorthand description, I think [17:32] arkiver: thoughts on a short name for the fileformats wiki? [17:32] arkiver: Kaz reduced it down from 200 earlier [17:32] why? [17:33] They said we were overloading their DB server at 200 [17:33] joepie91: ffwiki ? :P what do we need it for? [17:33] 08:44 < timmc> arkiver: Maybe try halving the rate on Imyz? [17:33] 08:46 < timmc> The current rate is hundreds of concurrent sessions and the DB connection is getting overloaded. [17:33] I can skip just a few more URLs, but not much more [17:33] for imzy [17:33] I'll do that [17:33] joepie91: "File Format wiki" it's *very* ambiguous :( [17:34] arkiver : any idea why an imzy worker would spit a wget -6 error? [17:34] no [17:34] do you have a log? [17:34] greycat: "just solve the problem" "justsolve" ? [17:34] technically, it's called "Just Solve the File Format Problem"... [17:34] Not on me, what's the best way to get one from a worker VM? [17:35] may be "jstffp"? or "justsolve" might be just fine and non-ambiguous... [17:35] arkiver: a reference from another file format spec site to the wiki [17:35] wget -6 is an authentication problem btw [17:35] arkiver: per format [17:38] greycat: idk if it's ambiguous, are there any other file format wikis? :P [17:41] i feel like there probably are ... [17:44] SketchCow: is there an "archival standard" for ripping vhs tapes? [17:52] *** bitBaron has quit IRC (Read error: Connection reset by peer) [17:52] arkiver : what's the best way to get a log out of the VM? [17:53] I remember what the error said [17:53] *** bitBaron has joined #archiveteam-bs [17:55] *** kristian_ has quit IRC (Read error: Operation timed out) [18:02] joepie91: There is http://fileformats.wikia.com, http://wiki.osdev.org/, http://www.fileformat.info/format/, etc, etc [18:03] joepie91: http://wiki.xentax.com/ is file format wiki mostly on game-related file formats [18:03] There are many of them [18:08] Archival Standard is either MPEG2, or a lossless AVI if you're an idiot [18:09] *** Crusher_ has joined #archiveteam-bs [18:09] *** Crusher has quit IRC (Bye) [18:29] There's better codecs, but MPEG-2 is "good enough" and you can be pretty sure it'll still be supported in future decades. [18:29] Makes sense. [18:33] Hmm. Something made my processor decide to clock down to 190 MHz, and I have no idea why... [18:33] >_> [18:34] Hold that thought, I just got a bluescreen printing line by line [18:35] What kind of a machine do you have? I don't think I even *could* clock down my processor to 190 MHz if I tried. [18:35] ♫ Tonight we're gonna party like it's 1999! ♫ [18:36] Slightly old mobile chips can do that, IIRC. [18:37] It's a Pentium G3258 [18:37] It *was* going happily at 4.2 GHz [18:38] What's "slightly old", pikhq? [18:38] LOL I think the Intel fan just HCF'd [18:39] It posted, so I probably didn't bake the poor chip [18:39] Chips these days are pretty good at thermal throttling before they release the magic smoke [18:40] IIRC the last mainstream desktop CPU that didn't have thermal protection was the AthlonXP [18:40] As seen in this classic video: https://www.youtube.com/watch?v=YYQSHXNFvUk [18:40] JAA: Haswell, apparently. :) [18:41] I know everyone was hanging on the edge of their seats on this one, but [18:41] https://archive.org/details/sega_sms_library?&sort=publicdate&page=1 [18:42] Well, I just found out Haswell will do whatever it takes to not die. [18:42] I have fixed all the screenshots so they're actually screenshots, and I'm now changing the right items to the Japanese model and re-rendering screenshots because not only does the SMS not boot, sometimes it WILL boot in the wrong region and show a default game. [18:42] Turns out a molex got tangled in the blade [18:43] Crusher_: Seems a good feature for a CPU to have. [18:43] Ot [18:43] It's only down to the Gs with the re-render, so there's a pile of the default game shots, now being redone. [18:44] pikhq : but then Intel doesn't get to charge me for a new one :P [18:45] *** SHODAN_UI has joined #archiveteam-bs [18:52] Well, it's back up to 4.2 GHz with no complaints, though I don't even want to think about what that frying electronics smell was [18:53] 42°C idle, Good I didn't bake the thermal paste out of existence [18:58] that was the smell of the "inject random errors" fuse melting open [18:59] Lol [19:01] I'm surprised how long it operated without the fan [19:01] It was down to convection cooling, there's no case fan circulating where the CPU is [19:02] Such is the downside of squeezing a full ATX motherboard in a cardboard box [19:04] http://imgur.com/a/DWjPu [19:06] this answers the question of the molex flopping around randomly near the fan [19:07] :-| [19:07] 1 star - shoebox contained GPU, not shoes [19:08] This is like the grown-up version of my home server, which is a raspberry pi in a cardboard box. [19:08] Lol. My pi-webserver is actually sitting about a foot away [19:09] *** nyany has joined #archiveteam-bs [19:09] It was acting as a webserver for pushing files to people and for grade 11 compsci [19:10] It was like the 90s again, a whole class of people with no stylistic taste making really cringy HTML pages [19:10] takes me back [19:11] timmc: Still better than that shoebox from Reddit though. 2/5 from my side. [19:11] Link? [19:11] *** th1x has quit IRC (Quit: Leaving) [19:11] I'd rather not. [19:11] haha [19:12] oh [19:12] *that* one [19:12] no link thank you [19:12] Yep, that one :-D [19:12] When it's closed, it has an additional fan taking in air to the 570 through that hole in the front [19:12] yeah, no link needed, that image is burned in [19:12] It's not pretty, but it works* [19:12] *so long as the molex doesn't flop into the fan [19:13] One of these days I'm going to take apart an old laptop and nail all the parts to a plywood board while keeping them connected. Maybe I can even make it look nice. [19:13] Lol [19:14] vivisection, but for Acers [19:14] In case anyone was worried about stability, this is NOT the machine I'm using for scraping :P [19:14] i had an bitcoin (well scrypt coin) mining setup with a motherboard and psu in the open and 4 GPUs hanging from a grill-like setup .. :) [19:14] "no, this is the one that hosts example.com" [19:15] "that one? yeah, thats just 8.8.8.8" [19:17] "The cloud." [19:22] SketchCow: i meant more time base correction and what sort of vcr should be used to play/digitize it [19:30] *** Crusher has joined #archiveteam-bs [19:30] *** Crusher_ has quit IRC (Read error: Connection reset by peer) [19:32] *** Crusher_ has joined #archiveteam-bs [19:32] *** Crusher has quit IRC (Read error: Connection reset by peer) [19:32] *** Crusher has joined #archiveteam-bs [19:32] *** Crusher_ has quit IRC (Read error: Connection reset by peer) [19:40] *** JAA has quit IRC (leaving) [19:41] *** JAA has joined #archiveteam-bs [20:26] hey sorry I havent been around, but wanted to remind folks that Imzy will be going entirely offline tonight, probably around midnight EST. Not sure if folks here got stuff they wanted to off the site. [20:28] We're only a third of the way [20:29] I'm updating the scripts to skip more URLs, hopefully we'll go faster then [20:29] Is there a non warrior script for imzy? [20:30] Seeing as the warrior hates me [20:30] You just run the script directly [20:31] There are instructions in the github repository's readme [20:31] It's on git? Ok. [20:31] https://github.com/archiveteam/imzy-grab [20:38] greenie: for me posts are returning 404 [20:38] or well, 301 and then redirected to the not found page [20:39] timmc ^ [20:40] hm yeah I'm not getting loads to happen on web (too lazy to open my console to see whats up). I'm no longer staff but will ping them and see whats up. There might not be much they can do at this point, I'm not sure. [20:41] I'm getting an error during get-wget-lua of Lua not found [20:41] I installed Lua 5.1 from repo but it's still getting stuck on it [20:45] Crusher: liblua5.1-dev [20:46] Thanks, installing [20:48] this is basically in the same boat as pop-unders (that is, load the real page in the popup, switch the main page to an add) in terms of filth, except that I'm less sure it's genuinely on purpose [20:48] (re: JS needed to load) [20:58] #$&$ Now it won't even tell me why it can't compile [21:00] *** C4K3_ has quit IRC (Read error: Operation timed out) [21:00] *** C4K3 has joined #archiveteam-bs [21:05] Ah ha [21:05] It's failing on all [21:06] all-am* [21:07] Specifically: "cat: css.c: no such file or directory" [21:10] Crusher: Based on my notes, here's how I installed it: packages needed: python3 gcc libssl-dev lua5.1 liblua5.1-dev make autoconf m4 flex [21:10] python3 get-pip.py --user [21:10] .local/bin/pip install --upgrade --user seesaw [21:10] I edited get-wget-lua.sh to force it to use OpenSSL. [21:11] This was on Debian Jessie. [21:12] My get-wget-lua.sh edit was CONFIGURE_SSL_OPT="--with-ssl=openssl" above the WGET_DOWNLOAD_URL line. [21:12] Kk [21:16] YESSS! Got it. Thanks JAA [21:17] :-) [21:18] *** thuban3 has quit IRC (Read error: Operation timed out) [21:19] Hmm. Any chance we could raise the rate limiter? [21:20] If I understood the message earlier correctly, it's now or never [21:20] Crusher: Last night the DB got overloaded, with hundreds of concurrent sessions. [21:20] *** thuban3 has joined #archiveteam-bs [21:21] So... running out of time, but there's only so much headroom in the app too. [21:21] Right now it looks like nobody is getting anything through... [21:22] The tracker is getting dusty at this point xD [21:23] Wait, does that mean nobody is requesting stuff right now? [21:23] or that people are, but are getting errors [21:23] I'm just getting placarded with rate limiting messages [21:24] And I haven't seen the tracker update in awhile now [21:25] Their site is still functioning (quickly) if anyone's Wondering [21:27] I don't have a single thread active [21:30] Maybe therate limiter is at 0. :-) [21:30] arkiver: ^ [21:30] hi [21:30] yes [21:30] posts were dead last time I checked [21:30] all 404 [21:30] ah [21:30] not dead anymore [21:31] I was about to say [21:31] restarted [21:31] I'll make an update to the scripts soon [21:31] I see a webpage full of cat posts xD [21:31] so be prepared to upgrade [21:31] Define upgrade? [21:31] *** SHODAN_UI has quit IRC (Remote host closed the connection) [21:31] No wait, it's working now [21:34] *** Crusher_ has joined #archiveteam-bs [21:35] Yeah, but he's adding more ignore patterns, I think. [21:35] yeah [21:36] *** Crusher has quit IRC (Read error: Connection reset by peer) [21:36] What's the error rate like at this point? [21:36] *** Crusher has joined #archiveteam-bs [21:36] *** Crusher_ has quit IRC (Read error: Connection reset by peer) [21:38] http://www.bbc.com/news/technology-40326544 [21:38] :) [21:39] Woot woot! [21:47] *** Crusher_ has joined #archiveteam-bs [21:49] Nice. [21:50] *** Crusher has quit IRC (Ping timeout: 492 seconds) [21:58] *** Crusher_ has quit IRC (Bye) [21:58] *** Crusher has joined #archiveteam-bs [22:16] *** kristian_ has joined #archiveteam-bs [22:23] Does anybody know of https://anarchivism.org/ ? [22:30] *** Jonison has quit IRC (Read error: Connection reset by peer) [22:36] *** schbirid has quit IRC (Quit: Leaving) [22:37] arkiver: Do you think imzy will finish in time at this rate? [22:37] no [22:38] but I'm trying to get the rate up [22:38] What do you think the rate would need to be? [22:38] hard to say [22:38] (assuming you don't filter out any more URLs) [22:38] a big problem is that many items seem to fail [22:38] so those need to be requeued again [22:38] and we still have to do communities [22:38] we're now only doing posts and users [22:39] but we're getting as much as possible [22:39] and might maybe just make it [22:39] * arkiver is afk for an hour [22:39] ok [22:39] *** Stiletto has quit IRC (Ping timeout: 260 seconds) [22:41] I wonder if just focusing on posts or on users would help, since otherwise every comment and post is covered twice. [23:01] *** Stilett0 has joined #archiveteam-bs [23:32] I don't suppose we can just ask for their hard drives :P [23:32] *** Crusher has quit IRC (Bye) [23:33] HELLO I FEEL BETTER [23:34] HELLO SketchCow [23:34] What happened? [23:36] Also SketchCow, if IA complains about 163.172.128.219 pegging the crap out of their Wayback API, its just the newsgrabber project [23:36] *** Crusher has joined #archiveteam-bs [23:44] arkiver: Imzy's servers appear overwhelmed again. Most of my jobs are failing due to connection timeouts of 5xx errors [23:50] *** bitBaron has quit IRC (Read error: Connection reset by peer) [23:50] And now they seem better [23:52] *** bitBaron has joined #archiveteam-bs