#archiveteam-bs 2016-06-30,Thu

↑back Search

Time Nickname Message
00:24 🔗 ris has quit IRC ()
00:38 🔗 BlueMaxim has joined #archiveteam-bs
00:48 🔗 j08nY has quit IRC (Quit: Leaving)
01:00 🔗 JesseW has joined #archiveteam-bs
01:47 🔗 schbirid has quit IRC (Ping timeout: 258 seconds)
01:58 🔗 SketchCow https://twitter.com/keewa/status/747928622107889664
02:01 🔗 schbirid has joined #archiveteam-bs
02:07 🔗 godane i'm half way done with 1966 nasa docs
02:28 🔗 Asparagir has joined #archiveteam-bs
02:31 🔗 JesseW hi Asparagir!
02:31 🔗 Asparagir Howdy JesseW!
02:31 🔗 Asparagir Long time no see
02:31 🔗 JesseW btw, one of your archivebot piplines is stuck, I think.
02:31 🔗 Asparagir Aw, poop. Lemme take a look.
02:35 🔗 Asparagir Running a ton of updates and then will restart it.
02:53 🔗 VADemon has quit IRC (Quit: left4dead)
02:56 🔗 Asparagir archiveteam-saves-the-day has risen phoenix-like from teh ashes
04:07 🔗 DoomTay Oh great. Wayback Machine is in some kind of redirect look
04:07 🔗 DoomTay *loop
04:07 🔗 DoomTay I try to save an image and Firefox tells me that "the server is redirecting the request for this address in a way that will never complete."
04:08 🔗 DoomTay So everytime I refresh the page, I see the page cycle through /save/, /save/_embed/ and the "lastest crawl" URL
04:08 🔗 JesseW DoomTay: remove all your cookies for archive.org
04:09 🔗 Asparagir Oh wow, I thought that was just happening to me! I tried using the javascript bookmarklet to save some Imgur pages tonight and got trapped in loops.
04:10 🔗 JesseW because IA hasn't separated the wayback display functionality from the rest of the site (e.g. the save functionality, and the accounts), it often gets gummed up with random cookies from across the web
04:10 🔗 Asparagir Ohhh
04:12 🔗 Frogging wow
04:14 🔗 Start_ has joined #archiveteam-bs
04:14 🔗 Start has quit IRC (Ping timeout: 260 seconds)
04:25 🔗 DoomTay Hmm...clearing cookies doesn't seem to have helped completely
04:25 🔗 DoomTay I also noticed it also kinda happens with pages
04:26 🔗 DoomTay Like, I have WM save a page. I see a thing saying "Saving...", then when it's done I should get redirected to the crawled page, except I am redirected to squaore one with "this page has not been archived. Save this link here"
04:53 🔗 Sk1d has quit IRC (Ping timeout: 194 seconds)
04:54 🔗 tomwsmf-a has quit IRC (Read error: Operation timed out)
04:59 🔗 Sk1d has joined #archiveteam-bs
05:00 🔗 JesseW has quit IRC (Ping timeout: 370 seconds)
05:15 🔗 wumpus Yeah, we should really fix that cookie overload thing. However, DoomTay, please send your buggy save url to info@archive.org
05:16 🔗 DoomTay That's the thing; there's a few, and it goes away after a while
05:17 🔗 wumpus Please send it in, and mention that. If clearing web.archive.org cookies is not the fix, then it's a bug we don't know about.
05:17 🔗 * wumpus is not here to beg. Really. Just do it.
05:18 🔗 vitzli has joined #archiveteam-bs
05:18 🔗 DoomTay Right then. I'll finish up this batch, copy any afflicted urls, then bring this up with that address
05:18 🔗 DoomTay Then I'm going to bed
05:18 🔗 wumpus Thanks.
05:29 🔗 DoomTay There are actually a few other bugs I want to bring attention to, but I'm not sure if I should lump them all in one -emai lor not
05:29 🔗 DoomTay My gut says yes
06:12 🔗 JesseW has joined #archiveteam-bs
06:13 🔗 DoomTay has quit IRC (Quit: Page closed)
06:18 🔗 rolfb has joined #archiveteam-bs
06:18 🔗 rolfb has quit IRC (Client Quit)
06:37 🔗 wumpus Stressing about whether to send 1 email or 10 is not productive! Just send it. I'm sure it'll end up in my lap either way.
06:41 🔗 Aranje has quit IRC (Quit: Three sheets to the wind)
06:43 🔗 dashcloud has quit IRC (Ping timeout: 244 seconds)
06:48 🔗 dashcloud has joined #archiveteam-bs
07:01 🔗 dashcloud has quit IRC (Read error: Operation timed out)
07:09 🔗 dashcloud has joined #archiveteam-bs
07:14 🔗 JesseW has quit IRC (Ping timeout: 370 seconds)
07:29 🔗 ohhdemgir has quit IRC (Ping timeout: 633 seconds)
08:14 🔗 RichardG_ has joined #archiveteam-bs
08:15 🔗 RichardG has quit IRC (Read error: Operation timed out)
08:20 🔗 RichardG_ has quit IRC (Ping timeout: 370 seconds)
08:31 🔗 RichardG has joined #archiveteam-bs
08:33 🔗 schbirid https://ch.linkedin.com/jobs/view/170173493
08:46 🔗 vitzli uh, is it real?
08:58 🔗 vitzli has quit IRC (Quit: Leaving)
09:06 🔗 Honno has joined #archiveteam-bs
09:08 🔗 Fusl has quit IRC (Read error: Operation timed out)
09:12 🔗 Honno Hey any resources (ala large files of images) for gesture drawing practice offline?
09:13 🔗 Honno I see websites who offer gestures for a short time then go to the next one, wonder how they have so many gesture images
09:20 🔗 joepie91 schbirid: what was that?
09:20 🔗 joepie91 it's gone...
09:21 🔗 schbirid damn
09:22 🔗 schbirid https://twitter.com/tanayj/status/748313095877980160
09:22 🔗 joepie91 schbirid: why didn't you throw it into archivebot
09:22 🔗 joepie91 :p
09:22 🔗 schbirid because that channel annoys the fuck out of me
09:22 🔗 joepie91 why?
09:22 🔗 schbirid i tried web.archive.oprg but was robots.txt blocked
09:22 🔗 schbirid because so much clutter
09:23 🔗 joepie91 so.. disable notifications...?
09:23 🔗 schbirid no!
09:23 🔗 joepie91 that seems like a poor reason not to archive a thing
09:23 🔗 joepie91 :p
09:23 🔗 schbirid let me be
09:23 🔗 schbirid i like to archive my notifications >:P
10:44 🔗 Atluxity has quit IRC (Ping timeout: 260 seconds)
10:46 🔗 Atluxity has joined #archiveteam-bs
10:51 🔗 REiN^ has quit IRC ()
11:43 🔗 schbirid has quit IRC (Quit: Leaving)
11:53 🔗 schbirid has joined #archiveteam-bs
11:59 🔗 Fusl has joined #archiveteam-bs
12:28 🔗 dashcloud has quit IRC (Read error: Operation timed out)
12:32 🔗 dashcloud has joined #archiveteam-bs
12:44 🔗 Igloo Who uses Heritix to crawl sites? I'm only getting 2 threads active after tweaking to be a little more intrusive
13:29 🔗 BlueMaxim has quit IRC (Quit: Leaving)
13:41 🔗 dashcloud has quit IRC (Read error: Operation timed out)
13:45 🔗 dashcloud has joined #archiveteam-bs
13:50 🔗 DoomTay has joined #archiveteam-bs
14:06 🔗 HCross http://www.friendsreunited.co.uk still no news on the so called "export function"
14:18 🔗 luckcolor has quit IRC (Remote host closed the connection)
14:19 🔗 luckcolor has joined #archiveteam-bs
14:24 🔗 luckcolor has quit IRC (Read error: Connection reset by peer)
14:25 🔗 luckcolor has joined #archiveteam-bs
14:27 🔗 DoomTay Coursera's tracker shows data as MB/u. What's u? Unit?
14:27 🔗 Frogging yes
14:27 🔗 Frogging that would make sense anyway
14:31 🔗 DoomTay And that's basically the same as "item"? If so, why isn't it i?
14:33 🔗 luckcolor has quit IRC (Read error: Connection reset by peer)
14:33 🔗 luckcolor has joined #archiveteam-bs
14:34 🔗 schbirid beacuse noone cares
14:43 🔗 Frogging pretty much
14:43 🔗 luckcolor has quit IRC (Read error: Connection reset by peer)
14:44 🔗 Igloo I think it's average MB/item basically
14:55 🔗 luckcolor has joined #archiveteam-bs
15:01 🔗 Honno has quit IRC (Quit: Leaving)
15:18 🔗 j08nY has joined #archiveteam-bs
15:29 🔗 SilSte has quit IRC (Read error: Operation timed out)
15:30 🔗 JesseW has joined #archiveteam-bs
15:54 🔗 vitzli has joined #archiveteam-bs
15:55 🔗 JesseW has quit IRC (Read error: Operation timed out)
16:08 🔗 vitzli I've uploaded .djvu file to the IA (opensource collection, texts mediatype), but it was not processed/derived? - no pdfs/dervied formats were made, is it the way IA works or I did something wrong with this item?
16:09 🔗 SketchCow FOS basically cleared out of the buffer drive, yay
16:10 🔗 Start_ is now known as Start
16:10 🔗 Asparagir has quit IRC (Asparagir)
16:15 🔗 metalcamp has joined #archiveteam-bs
16:53 🔗 DoomTay Unbelievable. I can't see http://web.archive.org/web/*/foxbox.tv supposedly because of robots.txt, the robots.txt file itself appears to be down
16:55 🔗 fie has joined #archiveteam-bs
17:08 🔗 Frogging wayback uses the last known robots.txt
17:09 🔗 xmc vitzli: what's the item url?
17:09 🔗 xmc i'll take a look
17:09 🔗 tfgbd_znc has quit IRC (Read error: Connection reset by peer)
17:11 🔗 vitzli https://archive.org/details/ComputerPress-199306djvu - but it could be because I uploaded .jpeg cover there
17:12 🔗 vitzli but it wasn't derived when it was the djvu-only item, iirc
17:12 🔗 xmc well from the first derivation job, when there was only a djvu ... https://catalogd.archive.org/log/471342476
17:12 🔗 xmc it says NonSource : "DjVu"
17:12 🔗 xmc so maybe they can't derive from djvu
17:12 🔗 xmc has it worked elsewhere for you?
17:14 🔗 vitzli no, they were my first uploaded djvus, i think
17:16 🔗 xmc hrm
17:16 🔗 DoomTay I dunno. For a time, Oocities was down and I could browse stuff when I couldn't before
17:16 🔗 xmc what is the source format that you're getting?
17:16 🔗 vitzli djvu
17:16 🔗 * xmc nods
17:18 🔗 vitzli i can't get the original scan and it's the only format available
17:18 🔗 xmc hm, sorry, i can't help you much ... maybe if you can find a way to convert to pdf, and then upload both?
17:19 🔗 xmc but that should be the deriver's job
17:20 🔗 vitzli np, thank you, I'll try to do the conversion but djvu's aren't very good as sources
17:21 🔗 tfgbd_znc has joined #archiveteam-bs
17:22 🔗 ris has joined #archiveteam-bs
17:22 🔗 xmc yeah
17:23 🔗 vitzli what would be a better solution - to keep all original djvu's in one item (in .tar or .zip) and .pdf in individual items or keep then together as .pdf/.djvu pairs?
17:24 🔗 xmc each publication should be one item. you can upload multiple formats but they should be the same thing.
17:24 🔗 xmc if you want to be extra careful, you can note in the description that the djvu is the most-senior format that you have access to
17:29 🔗 vitzli thank you, will try to do it with the new items
17:47 🔗 goekesmi has quit IRC (Remote host closed the connection)
17:51 🔗 goekesmi has joined #archiveteam-bs
18:06 🔗 VADemon has joined #archiveteam-bs
18:07 🔗 SilSte has joined #archiveteam-bs
18:21 🔗 metalcamp has quit IRC (Ping timeout: 244 seconds)
18:21 🔗 metalcamp has joined #archiveteam-bs
18:37 🔗 Asparagir has joined #archiveteam-bs
19:18 🔗 vitzli has quit IRC (Leaving)
19:36 🔗 tomwsmf-a has joined #archiveteam-bs
19:53 🔗 Asparagir has quit IRC (Asparagir)
20:24 🔗 Aranje has joined #archiveteam-bs
20:35 🔗 mutoso has quit IRC (Ping timeout: 250 seconds)
20:58 🔗 JW_work has joined #archiveteam-bs
20:59 🔗 JW_work1 has quit IRC (Read error: Operation timed out)
21:29 🔗 metalcamp has quit IRC (Ping timeout: 244 seconds)
21:40 🔗 DoomTay has quit IRC (Quit: Page closed)
22:17 🔗 SilSte has quit IRC (Read error: Operation timed out)
22:26 🔗 SilSte has joined #archiveteam-bs
22:50 🔗 Asparagir has joined #archiveteam-bs
23:26 🔗 Asparagir has quit IRC (Asparagir)
23:28 🔗 BlueMaxim has joined #archiveteam-bs
23:44 🔗 DoomTay has joined #archiveteam-bs

irclogger-viewer