#archiveteam 2012-10-29,Mon

↑back Search

Time Nickname Message
03:48 🔗 SketchCow Supernap
03:55 🔗 SketchCow alard:
03:55 🔗 SketchCow While we'd like nothing more than to ride out the storm by transcribing Waldorf-Astoria menus, it looks like that's no longer an option.
03:55 🔗 SketchCow Wait
03:55 🔗 SketchCow I lost hope of retrieving my tabblo pictures when I found the Tabblo lifeboat thread.
03:55 🔗 SketchCow The lifeboat does not work for me so myabe you can help
03:55 🔗 SketchCow My username is teodorapopa
04:48 🔗 SketchCow --------------
04:48 🔗 SketchCow So, just to prepare. Chance my power might go out (Hurricaine)
04:49 🔗 SketchCow I'll use my cell to say hi and check on mail, but I might be iffy for the next few days.
04:50 🔗 chronomex noted
04:50 🔗 chronomex maybe the problem will be solved when you come back
04:50 🔗 * chronomex ducks
04:53 🔗 underscor hahaha
04:54 🔗 * BlueMax throws chronomex out a window
05:14 🔗 godane SketchCow: make sure the cube is wet proof
05:14 🔗 godane maybe if space bag some of the stuff
05:18 🔗 SketchCow Yeah, already on that.
05:19 🔗 SketchCow The cube itself is not, the stuff inside is a foot higher or even higher than than as required
05:21 🔗 godane i was thinking the space bag thing cause there suppose to keep water out
05:22 🔗 godane i'm in nh so we may lose power too
05:22 🔗 godane i backed up most the gbtv stuff last night
05:46 🔗 SketchCow http://sphotos-b.xx.fbcdn.net/hphotos-prn1/547044_430246950357496_951372999_n.jpg
05:47 🔗 chronomex hahaha
05:47 🔗 underscor bahahaha
05:53 🔗 SketchCow I'm proposing an Internet Archive kickstarter. Let's see how that flies.
05:53 🔗 SketchCow Done right, instant $500k
05:53 🔗 SketchCow That would be good
06:26 🔗 SketchCow http://justsolve.archiveteam.org/index.php/FAQ
10:48 🔗 alard SketchCow: http://ia601202.us.archive.org/3/items/test-memac-index-test/tabblo.html#teodorapopa
10:53 🔗 SketchCow Thanks much.
15:15 🔗 dragondon Just started my warrior, getting nothing but "Tracker rate limiting is in effect. Retrying after 30 seconds..."
15:18 🔗 ersi dragondon: It's okay. It's intended. We're slowing down/pausing the Webshots archival project for the moment
15:18 🔗 dragondon ah, guess I'll switch to something else.
15:19 🔗 alard Change it to "ArchiveTeam's Choice"!
15:19 🔗 ersi You can leave it on if you'd like, it'll get work to do - just not as often for the time being. Or you might switch to one of the other projects, like AT's choice
15:19 🔗 alard I've just pointed that to the URLTeam project and will switch it back to Webshots when we continue that.
15:19 🔗 dragondon yeah, just switched to AT Choice.
15:19 🔗 flaushy \o/ my nas is dominating the recent stats ... slowest thing to turn in work late ;)
15:20 🔗 alard dragondon: Great.
15:21 🔗 flaushy alard: after about 15 mins / 800 pages wikipediareview gives me HTTP 400s
15:21 🔗 flaushy could a fresh cookie help at that point?
15:21 🔗 alard flaushy: Ah, yes, I saw your message.
15:22 🔗 alard I don't know. You could try, or you could try with more time between requests.
15:22 🔗 alard (Problem is: how do you get Wget to ask for a fresh cookie?)
15:23 🔗 flaushy overwrite it in a second process?
15:23 🔗 flaushy but i am probably too naive, i ll try :)
15:24 🔗 alard I'm not sure if it reads the cookie file.
15:24 🔗 alard So it isn't an IP-based block?
15:25 🔗 flaushy it wasn't
15:25 🔗 flaushy at least i could browse the forums
15:25 🔗 alard Does it give any browsable error messages?
15:25 🔗 alard (Error messages you could search for on Google, I mean.)
15:26 🔗 flaushy checking
15:30 🔗 flaushy google suggest using sane user agents
15:31 🔗 alard I think just going slower might help. Invision power board seems to have a lot of ways to limit the number of X per second.
15:32 🔗 flaushy ok slowly crawling :=
15:32 🔗 alard It's not going away soon, is it?
15:33 🔗 flaushy na, it was more a "we should get it sometime" i think
15:47 🔗 flaushy ok running with with wait 10 and random-wait
16:44 🔗 soultcer That's a lot of URLTeam users: http://tracker.tinyarchive.org/v1/
17:01 🔗 ersi I stopped my workers when the tracker kept resetting every few days
17:15 🔗 soultcer Oh, it's not resetting, I'm just draining it
17:17 🔗 soultcer I think I'll have to add an all-time leaderboard that saves the number of tasks done by each user, even when I remove the finished tasks
17:26 🔗 flaushy oh 2 of my workers stopped -.-
17:27 🔗 flaushy soultcer: running into no buffer space available on my vps
17:27 🔗 soultcer Can you paste the exact error message?
18:48 🔗 SketchCow > x-archive-meta-title:Mirror of SAMPLES.MPLAYERHQ.HU - Multimedia Samples Archive
18:48 🔗 SketchCow > Content-Length: 57073367040
18:48 🔗 SketchCow So that's happening.
18:55 🔗 SmileyG soultcer: plz do, i asked for that long ago
18:56 🔗 SmileyG statswhore me!
19:14 🔗 bsmith094 any other projects i could help with? webshots is rate limited apparently
19:14 🔗 bsmith094 remote server so not earrior
19:14 🔗 bsmith094 warrior
19:16 🔗 flaushy urlteam :)
19:21 🔗 SketchCow just solve the problem
19:22 🔗 SketchCow archiveteam wiki
19:27 🔗 ersi bsmith094: Yeah, like flaushy and SketchCow said: 1) help add content to http://justsolve.archiveteam.org 2) help update and pretty up http://archiveteam.org 3) urlteam or AT's choice
19:28 🔗 SketchCow We're dealing with a small slowdown, please be patient about that.
19:29 🔗 ersi ie. take the ADHD meds
19:29 🔗 ersi and possibly a beer
19:29 🔗 SketchCow At the same time?
19:33 🔗 ersi mayhapples
19:33 🔗 ersi most likely; no
21:48 🔗 bsmith094 does urlteam have a script?
21:50 🔗 ersi Do you mean a pipeline script? Yes
21:51 🔗 bsmith094 where?
21:52 🔗 bsmith094 i dont think i can run the warrior on cli, so i need a pipeline script
21:57 🔗 ersi I don't know where. But soultcer does, I think. Or you run the GUI
21:57 🔗 ersi s/GUI/Warrior/
21:57 🔗 ersi the warrior has an API, you can HTTP commands to it
21:59 🔗 alard bsmith094: https://github.com/soult/tinyback/
21:59 🔗 alard But if you want to run it yourself, you might be better of running ./run.py directly.
22:04 🔗 bsmith094 alard: how, the instructions are vague
22:04 🔗 alard bsmith094: I have not tried it, but the pipeline.py gives an example.
22:05 🔗 alard ./run.py -h
22:08 🔗 bsmith094 alard: well i feel stupid for not realizing that, thank:)
22:12 🔗 alard bsmith094: But I have to agree with you that the readme instructions under "How to run TinyBack" aren't exactly helpful. :) Perhaps, once you figure out what to do, you should send soultcer a patch.
22:13 🔗 bsmith094 alard: run this screen -SL grab ./run.py --tracker=http://tracker.tinyarchive.org/v1/ --sleep=20 --one-task --temp-dir=./data --username=bsmith093 -d -c
22:14 🔗 alard Thank you (I have a warrior).

irclogger-viewer