[00:01] *** schbirid2 has quit IRC (Read error: Operation timed out) [00:41] *** pizzaiolo has quit IRC (Ping timeout: 268 seconds) [00:49] *** pizzaiolo has joined #archiveteam-bs [01:34] *** schbirid2 has joined #archiveteam-bs [01:38] *** username1 has quit IRC (Read error: Operation timed out) [01:58] *** pizzaiolo has quit IRC (Read error: Operation timed out) [02:39] *** DFJustin has quit IRC (Remote host closed the connection) [02:43] *** DFJustin has joined #archiveteam-bs [02:43] *** swebb sets mode: +o DFJustin [03:25] I've noticed that some historically significant autism related stuff has seemingly disappeared and is no longer available to the public without a significant amount of effort and time, and that some content is likely going to have the same fate. I've started a discord server to coordinate, organize, and keep track of some autism related archiving projects. I chose discord over irc because of the organizational needs that will [03:25] likely be required. If you're interested in this, here's the link: https://discord.gg/d7WuhaH [04:50] *** Sk1d has quit IRC (Ping timeout: 194 seconds) [04:56] *** Sk1d has joined #archiveteam-bs [05:00] *** th1x has quit IRC (Read error: Operation timed out) [05:08] *** j08nY has joined #archiveteam-bs [05:18] *** mls_ is now known as mls [05:45] *** j08nY has quit IRC (Read error: Operation timed out) [06:47] i wonder what techknown is doing right that everybody else is doing wrong http://tracker.archiveteam.org/imzy/ :) [06:47] techknow* [06:55] *** icedice has joined #archiveteam-bs [07:53] *** SHODAN_UI has joined #archiveteam-bs [08:09] *** Honno has joined #archiveteam-bs [08:14] *** j08nY has joined #archiveteam-bs [08:21] *** Jonison has joined #archiveteam-bs [08:40] *** icedice2 has joined #archiveteam-bs [08:42] *** icedice has quit IRC (Ping timeout: 245 seconds) [08:47] *** j08nY has quit IRC (Quit: Leaving) [08:50] *** username1 has joined #archiveteam-bs [08:54] *** schbirid2 has quit IRC (Read error: Operation timed out) [09:11] *** TheLovina has quit IRC (Read error: Operation timed out) [09:12] *** SHODAN_UI has quit IRC (Quit: zzz) [09:13] *** schbirid2 has joined #archiveteam-bs [09:15] *** username1 has quit IRC (Read error: Operation timed out) [09:22] O.o 75575=200 https://www.imzy.com/api/accounts/profiles/thelizardqueen/comments?page=75526&per_page=25 [09:22] 75k pages of 25 comments each. How do you even post that much? [09:25] yeah, i got a 63k user here :p [09:26] *** icedice2 has quit IRC (Quit: Leaving) [09:27] Actually, that may be a bug. [09:27] Every page has the same contents. [09:27] diff -sq <(curl 'https://www.imzy.com/api/accounts/profiles/thelizardqueen/comments?page=75526&per_page=25') <(curl 'https://www.imzy.com/api/accounts/profiles/thelizardqueen/comments?page=175526&per_page=25') [09:28] arkiver: ^ [09:40] yeah, it is identical, you can replace page number with any random value [09:41] it would have been nice with a 404 if the page number is out of range [09:44] Or an empty result. That's what imzy.lua checks for before advancing to the next page. [09:49] bwhaha [09:49] ALL THE COMMENTS [09:50] ONE MILLION TIMES [09:58] *** username1 has joined #archiveteam-bs [10:02] *** schbirid2 has quit IRC (Read error: Operation timed out) [10:12] *** pizzaiolo has joined #archiveteam-bs [11:11] *** th1x has joined #archiveteam-bs [11:30] *** BlueMaxim has quit IRC (Read error: Operation timed out) [11:31] *** BlueMaxim has joined #archiveteam-bs [11:44] *** BlueMaxim has quit IRC (Quit: Leaving) [13:12] *** icedice has joined #archiveteam-bs [13:16] *** icedice has quit IRC (Client Quit) [13:36] JAA: strange [13:37] I paused imzy [13:37] I believe it did just send [] before [13:37] will recheck [14:24] *** Stiletto has quit IRC (Read error: Connection reset by peer) [14:25] *** Stilett0 has joined #archiveteam-bs [14:49] XBox E3 briefing on YouTube 1080P - 2.7GB. On Mixer (Beam) - 11GB. Anyone want the file? [15:44] *** SHODAN_UI has joined #archiveteam-bs [15:54] Looks like the old OVH forums are all gone now. Well, at least I got .de, .co.uk, most of .com (French; still running), the dedicated Kimsufi/SoYouStart/Hubic ones, and some small ones (Senegal, Tunesia, and Marocco). I'll keep an eye on it, but with OVH I wouldn't be surprised if they just nuked it without an announcement. [15:56] Also, I'd be interested in explanations for why https://web.archive.org/web/20170613112823/https://forum.ovh.com/ is blocked by a 404ing robots.txt. [16:04] Oh, they're back up. So maybe they just moved them to another server, because the IP definitely changed. [16:08] *** SHODAN_UI has quit IRC (Remote host closed the connection) [16:09] LOL, one of my Imzy jobs was up to 165866 requests before I killed it due to that infinite comments page issue [16:13] Yeah, I also have some other... interesting jobs. One has been requesting https://www.imzy.com/api/accounts/profiles/metaxu?check=true 2400 times, for example. [16:14] Same for a few other accounts. [16:16] Any thoughts on grabbing the OVH forums twice? My idea would be to first run a quick --no-offsite-links grab to make sure the content isn't lost, and then run it again without that option to also grab external links if time permits. Leads to some duplication though, obviously (if the servers stay up). [16:30] very strange [16:30] JAA: do you have a log? [16:32] *** kristian_ has joined #archiveteam-bs [16:32] arkiver: Nothing meaningful. wget.log is just two lines, one for https://www.imzy.com/api/accounts/profiles/metaxu and one for https://images.imzy.com/prod/profiles/05tmjowq.jpg . The output on the pipeline is just countless lines of N=200 for that ?check=true URL. [16:35] Notably, there is no response record in the WARC for those 2k requests, only the requests. [16:36] what wget version? [16:36] *** j08nY has joined #archiveteam-bs [16:36] Wget/1.14.lua.20160530-955376b [16:36] they were returning 206 with empty body [16:36] it's not wget issue [16:36] klg: Shouldn't that appear in the WARC though? [16:36] ah 206 would make more sense [16:37] Oh, yeah, it's 206. [16:37] Sorry, missed that on the pipeline. [16:40] Regarding OVH, I've decided I'll follow that double-archiving strategy. The forum archives are really small (.ie was only 5 MiB at over 5k threads), so it's not really a problem. [17:01] nice [17:02] are they in wayback? [17:03] As mentioned above, at least forum.ovh.com is blocked in Wayback through robots.txt, although there is no robots.txt on the server (404). [17:03] right [17:03] Maybe they're using the last 200 robots.txt? [17:03] Which would obviously be completely negate the purpose of removing a robots.txt. [17:04] s/be // [17:13] *** fie has quit IRC (Read error: Operation timed out) [17:13] For the record, I'm following the same strategy on WoT as with the OVH forums; first with --no-offsite-links, then possibly without. [17:14] *** JensRex has quit IRC (Remote host closed the connection) [17:14] *** JensRex has joined #archiveteam-bs [17:20] *** icedice has joined #archiveteam-bs [17:23] *** fie has joined #archiveteam-bs [17:29] *** pukkie has joined #archiveteam-bs [17:37] Dear Archive Team, I just checked my warrior and there's still an item for imzy doing "user:anonymous". It looks like it's archiving anonymous's comments, but imzy's server simply throws an 404 and the warrior just requests the next page. It's currently on page 779709. Should I kill and restart the warrior? I'm scared and afraid. Please help me! Thank you! [17:38] it should be safe to kill and restart :) [17:43] Thank you very much xmc! Have a nice day everyone! :) [17:44] thanks for your computer time, pukkie :) [17:47] yeah, imzy is paused at the moment [17:47] has some serious problems [17:47] also with the 206 [17:59] *** pizzaiolo has quit IRC (Quit: pizzaiolo) [18:14] arkiver: lol [18:22] *** SHODAN_UI has joined #archiveteam-bs [19:00] *** ZexaronS has quit IRC (Read error: Connection reset by peer) [19:02] *** Ravenloft has quit IRC (Ping timeout: 260 seconds) [19:08] *** pukkie has quit IRC (Ping timeout: 260 seconds) [19:17] *** Ravenloft has joined #archiveteam-bs [19:24] *** GLaDOS has quit IRC (Ping timeout: 506 seconds) [19:27] *** GLaDOS has joined #archiveteam-bs [19:39] *** kristian_ has quit IRC (Quit: Leaving) [19:46] *** Ravenloft has quit IRC (Ping timeout: 260 seconds) [19:48] *** Panasonic has joined #archiveteam-bs [19:50] *** icedice has quit IRC (Quit: Leaving) [19:51] *** BartoCH has quit IRC (Ping timeout: 260 seconds) [19:57] *** BartoCH has joined #archiveteam-bs [20:01] *** Stilett0 has quit IRC (Ping timeout: 246 seconds) [20:02] I tried to hug Carl Malamud and ha ha ha ha he does not like hugs [20:02] NOTE FOR LATER [20:05] well, at least you tried [20:11] SketchCow: Wiki fix please :) [20:11] Still cant upload files [20:12] Yeah, I'll see if I can look today [20:12] I think its a permissions issue since its having issues omving from /tmp to main storage [20:12] It [20:12] It's likely disk space [20:13] Warning: This account has reached its disk usage quota. If you need more disk space, contact your service provider. [20:13] Lol [20:13] Oops, So where is it hosted [20:14] * jrwr puts in his notes that if he sees SketchCow that he might like a hug [20:16] Should work for now [20:16] I have to go ask for more space [20:16] Nice, I always suggest linodes for mediawiki installs of this size [20:17] I've ran a few of them over the years, if you do upgrade the package i do suggest the new editor that mediawiki put out, its very nice [20:17] makes editing and format easy as pie [20:18] By the way, 120,000 visitors on average a month to archiveteam wiki [20:18] Nice [20:19] On a related note: did anything ever happen regarding setting up TLS? I remember that there was some talk about that in here a few months ago. [20:19] On PCGamingWiki (the last big one I ran) we where getting 1-1.5 Million a month for the first three months [20:19] Reddit Frontpage is a server killer, but with all the cache, it helps a ton [20:20] gotta override the session cookies and just ignore them [20:20] unless the user has a username set in the cookies, just strip out the sessions for easy caching [20:20] got a 95% hitrate after that change [20:20] I requested more disk space [20:20] Cool [20:21] If you ever need science done to the wiki, your welcome to ask [20:23] jrwr: we have a project to archive news, NewsGrabber or NewsBuddy [20:23] it can be found in #newsgrabber [20:23] Cool [20:23] the project has been paused for quite some time due to a big update that needs to be done [20:23] and that will hopefully be done soon [20:23] *** MrRadar has left [20:23] *** MrRadar has joined #archiveteam-bs [20:33] *** ZexaronS has joined #archiveteam-bs [20:38] *** Stilett0 has joined #archiveteam-bs [20:39] Yes please [20:56] ArchiveTeam wiki being bumped from 2gb space to 10gb space. [20:57] I'm $50 poorer a year [20:59] Oh, SketchCow, while you're here: is there a recommended donation for the Ted Nelson Junk mail Archive Corps project? [20:59] How much archiving does a dollar get? [20:59] (I just notied the blog post today) [21:06] BEST JUNKMAIL [21:06] the spreadsheet they posted a few days ago said it averages twelve cents a page [21:06] all-in [21:07] Problem fixed [21:07] \o/ [21:07] (Disk space) [21:07] k@savetz.com paypal [21:07] if I was nearby to volunteer I could [21:07] Costs us $10/hr for scangrrl [21:37] *** th1x has quit IRC (Read error: Connection reset by peer) [21:39] *** Jonison has quit IRC (Read error: Connection reset by peer) [22:04] https://www.reddit.com/r/DnD/comments/6h3nv1/two_of_the_biggest_dd_boards_just_went_down/ [22:24] *** SHODAN_UI has quit IRC (Remote host closed the connection) [22:25] *** johnny5 has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** wp494 has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** PotcFdk has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** rocode has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** dboard has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** luckcolor has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** godane has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** robogoat has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** wabu has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** trs80 has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** SadDM has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** jspiros has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** Panasonic has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** j08nY has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** Odd0002 has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** dashcloud has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** antomati_ has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** SilSte has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** zerkalo has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** RedType has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** Nazca has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** pikhq has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** Xamayon has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** mgrytbak has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** acridAxid has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** joepie91 has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** cf has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** chfoo has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** eprillios has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** tapedrive has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** underscor has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** espes__ has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** kvieta has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** klg has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** username1 has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** Honno has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** brayden has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** superkuh has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** dcmorton has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** odemg has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** MrRadar has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** zenguy has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** xmc has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** slyphic has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** w0rp has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** zino has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** ranma_ has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** Selavi has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** Frogging has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** htw has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** dxrt has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** twigfoot has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** atlogbot has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** chazchaz has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** atomicthu has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** TC01 has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** swebb has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** Darkstar has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** Somebody2 has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** Cameron_D has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** arkiver has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** Coderjo has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** Jonimoose has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** midas has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** yipdw has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** Baljem has quit IRC (ircd.choopa.net hub.efnet.us) [22:25] *** lainu has quit IRC (ircd.choopa.net hub.efnet.us) [22:48] *** bitBaron has joined #archiveteam-bs [23:06] *** johnny5 has joined #archiveteam-bs [23:06] *** wp494 has joined #archiveteam-bs [23:06] *** PotcFdk has joined #archiveteam-bs [23:06] *** rocode has joined #archiveteam-bs [23:06] *** dboard has joined #archiveteam-bs [23:06] *** luckcolor has joined #archiveteam-bs [23:06] *** godane has joined #archiveteam-bs [23:06] *** robogoat has joined #archiveteam-bs [23:06] *** wabu has joined #archiveteam-bs [23:06] *** trs80 has joined #archiveteam-bs [23:06] *** SadDM has joined #archiveteam-bs [23:06] *** jspiros has joined #archiveteam-bs [23:06] *** irc.colosolutions.net sets mode: +o SadDM [23:53] *** Odd0002 has joined #archiveteam-bs [23:53] *** ranma has quit IRC (Killed (ny.us.hub (Nick collision (new)))) [23:54] *** MrRadar has joined #archiveteam-bs [23:54] *** username1 has joined #archiveteam-bs [23:54] *** brayden has joined #archiveteam-bs [23:54] *** superkuh has joined #archiveteam-bs [23:54] *** dcmorton has joined #archiveteam-bs [23:54] *** odemg has joined #archiveteam-bs [23:54] *** zenguy has joined #archiveteam-bs [23:54] *** twigfoot has joined #archiveteam-bs [23:54] *** xmc has joined #archiveteam-bs [23:54] *** slyphic has joined #archiveteam-bs [23:54] *** w0rp has joined #archiveteam-bs [23:54] *** ranma has joined #archiveteam-bs [23:54] *** Selavi has joined #archiveteam-bs [23:54] *** Frogging has joined #archiveteam-bs [23:54] *** htw has joined #archiveteam-bs [23:54] *** dxrt has joined #archiveteam-bs [23:54] *** atlogbot has joined #archiveteam-bs [23:54] *** chazchaz has joined #archiveteam-bs [23:54] *** atomicthu has joined #archiveteam-bs [23:54] *** TC01 has joined #archiveteam-bs [23:54] *** swebb has joined #archiveteam-bs [23:54] *** Darkstar has joined #archiveteam-bs [23:54] *** Somebody2 has joined #archiveteam-bs [23:54] *** Cameron_D has joined #archiveteam-bs [23:54] *** arkiver has joined #archiveteam-bs [23:54] *** irc.servercentral.net sets mode: +oooo brayden xmc swebb arkiver [23:54] *** Coderjo has joined #archiveteam-bs [23:54] *** Jonimoose has joined #archiveteam-bs [23:54] *** midas has joined #archiveteam-bs [23:54] *** yipdw has joined #archiveteam-bs [23:54] *** Baljem has joined #archiveteam-bs [23:54] *** lainu has joined #archiveteam-bs [23:54] *** irc.servercentral.net sets mode: +oo Jonimoose yipdw [23:54] *** swebb sets mode: +o SadDM [23:54] *** swebb sets mode: +o DFJustin [23:54] *** swebb sets mode: +o SketchCow [23:54] *** swebb sets mode: +o balrog [23:54] *** ranma_ has joined #archiveteam-bs