[00:24] *** ris has quit IRC () [00:38] *** BlueMaxim has joined #archiveteam-bs [00:48] *** j08nY has quit IRC (Quit: Leaving) [01:00] *** JesseW has joined #archiveteam-bs [01:47] *** schbirid has quit IRC (Ping timeout: 258 seconds) [01:58] https://twitter.com/keewa/status/747928622107889664 [02:01] *** schbirid has joined #archiveteam-bs [02:07] i'm half way done with 1966 nasa docs [02:28] *** Asparagir has joined #archiveteam-bs [02:31] hi Asparagir! [02:31] Howdy JesseW! [02:31] Long time no see [02:31] btw, one of your archivebot piplines is stuck, I think. [02:31] Aw, poop. Lemme take a look. [02:35] Running a ton of updates and then will restart it. [02:53] *** VADemon has quit IRC (Quit: left4dead) [02:56] archiveteam-saves-the-day has risen phoenix-like from teh ashes [04:07] Oh great. Wayback Machine is in some kind of redirect look [04:07] *loop [04:07] I try to save an image and Firefox tells me that "the server is redirecting the request for this address in a way that will never complete." [04:08] So everytime I refresh the page, I see the page cycle through /save/, /save/_embed/ and the "lastest crawl" URL [04:08] DoomTay: remove all your cookies for archive.org [04:09] Oh wow, I thought that was just happening to me! I tried using the javascript bookmarklet to save some Imgur pages tonight and got trapped in loops. [04:10] because IA hasn't separated the wayback display functionality from the rest of the site (e.g. the save functionality, and the accounts), it often gets gummed up with random cookies from across the web [04:10] Ohhh [04:12] wow [04:14] *** Start_ has joined #archiveteam-bs [04:14] *** Start has quit IRC (Ping timeout: 260 seconds) [04:25] Hmm...clearing cookies doesn't seem to have helped completely [04:25] I also noticed it also kinda happens with pages [04:26] Like, I have WM save a page. I see a thing saying "Saving...", then when it's done I should get redirected to the crawled page, except I am redirected to squaore one with "this page has not been archived. Save this link here" [04:53] *** Sk1d has quit IRC (Ping timeout: 194 seconds) [04:54] *** tomwsmf-a has quit IRC (Read error: Operation timed out) [04:59] *** Sk1d has joined #archiveteam-bs [05:00] *** JesseW has quit IRC (Ping timeout: 370 seconds) [05:15] Yeah, we should really fix that cookie overload thing. However, DoomTay, please send your buggy save url to info@archive.org [05:16] That's the thing; there's a few, and it goes away after a while [05:17] Please send it in, and mention that. If clearing web.archive.org cookies is not the fix, then it's a bug we don't know about. [05:17] * wumpus is not here to beg. Really. Just do it. [05:18] *** vitzli has joined #archiveteam-bs [05:18] Right then. I'll finish up this batch, copy any afflicted urls, then bring this up with that address [05:18] Then I'm going to bed [05:18] Thanks. [05:29] There are actually a few other bugs I want to bring attention to, but I'm not sure if I should lump them all in one -emai lor not [05:29] My gut says yes [06:12] *** JesseW has joined #archiveteam-bs [06:13] *** DoomTay has quit IRC (Quit: Page closed) [06:18] *** rolfb has joined #archiveteam-bs [06:18] *** rolfb has quit IRC (Client Quit) [06:37] Stressing about whether to send 1 email or 10 is not productive! Just send it. I'm sure it'll end up in my lap either way. [06:41] *** Aranje has quit IRC (Quit: Three sheets to the wind) [06:43] *** dashcloud has quit IRC (Ping timeout: 244 seconds) [06:48] *** dashcloud has joined #archiveteam-bs [07:01] *** dashcloud has quit IRC (Read error: Operation timed out) [07:09] *** dashcloud has joined #archiveteam-bs [07:14] *** JesseW has quit IRC (Ping timeout: 370 seconds) [07:29] *** ohhdemgir has quit IRC (Ping timeout: 633 seconds) [08:14] *** RichardG_ has joined #archiveteam-bs [08:15] *** RichardG has quit IRC (Read error: Operation timed out) [08:20] *** RichardG_ has quit IRC (Ping timeout: 370 seconds) [08:31] *** RichardG has joined #archiveteam-bs [08:33] https://ch.linkedin.com/jobs/view/170173493 [08:46] uh, is it real? [08:58] *** vitzli has quit IRC (Quit: Leaving) [09:06] *** Honno has joined #archiveteam-bs [09:08] *** Fusl has quit IRC (Read error: Operation timed out) [09:12] Hey any resources (ala large files of images) for gesture drawing practice offline? [09:13] I see websites who offer gestures for a short time then go to the next one, wonder how they have so many gesture images [09:20] schbirid: what was that? [09:20] it's gone... [09:21] damn [09:22] https://twitter.com/tanayj/status/748313095877980160 [09:22] schbirid: why didn't you throw it into archivebot [09:22] :p [09:22] because that channel annoys the fuck out of me [09:22] why? [09:22] i tried web.archive.oprg but was robots.txt blocked [09:22] because so much clutter [09:23] so.. disable notifications...? [09:23] no! [09:23] that seems like a poor reason not to archive a thing [09:23] :p [09:23] let me be [09:23] i like to archive my notifications >:P [10:44] *** Atluxity has quit IRC (Ping timeout: 260 seconds) [10:46] *** Atluxity has joined #archiveteam-bs [10:51] *** REiN^ has quit IRC () [11:43] *** schbirid has quit IRC (Quit: Leaving) [11:53] *** schbirid has joined #archiveteam-bs [11:59] *** Fusl has joined #archiveteam-bs [12:28] *** dashcloud has quit IRC (Read error: Operation timed out) [12:32] *** dashcloud has joined #archiveteam-bs [12:44] Who uses Heritix to crawl sites? I'm only getting 2 threads active after tweaking to be a little more intrusive [13:29] *** BlueMaxim has quit IRC (Quit: Leaving) [13:41] *** dashcloud has quit IRC (Read error: Operation timed out) [13:45] *** dashcloud has joined #archiveteam-bs [13:50] *** DoomTay has joined #archiveteam-bs [14:06] http://www.friendsreunited.co.uk still no news on the so called "export function" [14:18] *** luckcolor has quit IRC (Remote host closed the connection) [14:19] *** luckcolor has joined #archiveteam-bs [14:24] *** luckcolor has quit IRC (Read error: Connection reset by peer) [14:25] *** luckcolor has joined #archiveteam-bs [14:27] Coursera's tracker shows data as MB/u. What's u? Unit? [14:27] yes [14:27] that would make sense anyway [14:31] And that's basically the same as "item"? If so, why isn't it i? [14:33] *** luckcolor has quit IRC (Read error: Connection reset by peer) [14:33] *** luckcolor has joined #archiveteam-bs [14:34] beacuse noone cares [14:43] pretty much [14:43] *** luckcolor has quit IRC (Read error: Connection reset by peer) [14:44] I think it's average MB/item basically [14:55] *** luckcolor has joined #archiveteam-bs [15:01] *** Honno has quit IRC (Quit: Leaving) [15:18] *** j08nY has joined #archiveteam-bs [15:29] *** SilSte has quit IRC (Read error: Operation timed out) [15:30] *** JesseW has joined #archiveteam-bs [15:54] *** vitzli has joined #archiveteam-bs [15:55] *** JesseW has quit IRC (Read error: Operation timed out) [16:08] I've uploaded .djvu file to the IA (opensource collection, texts mediatype), but it was not processed/derived? - no pdfs/dervied formats were made, is it the way IA works or I did something wrong with this item? [16:09] FOS basically cleared out of the buffer drive, yay [16:10] *** Start_ is now known as Start [16:10] *** Asparagir has quit IRC (Asparagir) [16:15] *** metalcamp has joined #archiveteam-bs [16:53] Unbelievable. I can't see http://web.archive.org/web/*/foxbox.tv supposedly because of robots.txt, the robots.txt file itself appears to be down [16:55] *** fie has joined #archiveteam-bs [17:08] wayback uses the last known robots.txt [17:09] vitzli: what's the item url? [17:09] i'll take a look [17:09] *** tfgbd_znc has quit IRC (Read error: Connection reset by peer) [17:11] https://archive.org/details/ComputerPress-199306djvu - but it could be because I uploaded .jpeg cover there [17:12] but it wasn't derived when it was the djvu-only item, iirc [17:12] well from the first derivation job, when there was only a djvu ... https://catalogd.archive.org/log/471342476 [17:12] it says NonSource : "DjVu" [17:12] so maybe they can't derive from djvu [17:12] has it worked elsewhere for you? [17:14] no, they were my first uploaded djvus, i think [17:16] hrm [17:16] I dunno. For a time, Oocities was down and I could browse stuff when I couldn't before [17:16] what is the source format that you're getting? [17:16] djvu [17:16] * xmc nods [17:18] i can't get the original scan and it's the only format available [17:18] hm, sorry, i can't help you much ... maybe if you can find a way to convert to pdf, and then upload both? [17:19] but that should be the deriver's job [17:20] np, thank you, I'll try to do the conversion but djvu's aren't very good as sources [17:21] *** tfgbd_znc has joined #archiveteam-bs [17:22] *** ris has joined #archiveteam-bs [17:22] yeah [17:23] what would be a better solution - to keep all original djvu's in one item (in .tar or .zip) and .pdf in individual items or keep then together as .pdf/.djvu pairs? [17:24] each publication should be one item. you can upload multiple formats but they should be the same thing. [17:24] if you want to be extra careful, you can note in the description that the djvu is the most-senior format that you have access to [17:29] thank you, will try to do it with the new items [17:47] *** goekesmi has quit IRC (Remote host closed the connection) [17:51] *** goekesmi has joined #archiveteam-bs [18:06] *** VADemon has joined #archiveteam-bs [18:07] *** SilSte has joined #archiveteam-bs [18:21] *** metalcamp has quit IRC (Ping timeout: 244 seconds) [18:21] *** metalcamp has joined #archiveteam-bs [18:37] *** Asparagir has joined #archiveteam-bs [19:18] *** vitzli has quit IRC (Leaving) [19:36] *** tomwsmf-a has joined #archiveteam-bs [19:53] *** Asparagir has quit IRC (Asparagir) [20:24] *** Aranje has joined #archiveteam-bs [20:35] *** mutoso has quit IRC (Ping timeout: 250 seconds) [20:58] *** JW_work has joined #archiveteam-bs [20:59] *** JW_work1 has quit IRC (Read error: Operation timed out) [21:29] *** metalcamp has quit IRC (Ping timeout: 244 seconds) [21:40] *** DoomTay has quit IRC (Quit: Page closed) [22:17] *** SilSte has quit IRC (Read error: Operation timed out) [22:26] *** SilSte has joined #archiveteam-bs [22:50] *** Asparagir has joined #archiveteam-bs [23:26] *** Asparagir has quit IRC (Asparagir) [23:28] *** BlueMaxim has joined #archiveteam-bs [23:44] *** DoomTay has joined #archiveteam-bs