Time |
Nickname |
Message |
03:48
🔗
|
SketchCow |
Supernap |
03:55
🔗
|
SketchCow |
alard: |
03:55
🔗
|
SketchCow |
While we'd like nothing more than to ride out the storm by transcribing Waldorf-Astoria menus, it looks like that's no longer an option. |
03:55
🔗
|
SketchCow |
Wait |
03:55
🔗
|
SketchCow |
I lost hope of retrieving my tabblo pictures when I found the Tabblo lifeboat thread. |
03:55
🔗
|
SketchCow |
The lifeboat does not work for me so myabe you can help |
03:55
🔗
|
SketchCow |
My username is teodorapopa |
04:48
🔗
|
SketchCow |
-------------- |
04:48
🔗
|
SketchCow |
So, just to prepare. Chance my power might go out (Hurricaine) |
04:49
🔗
|
SketchCow |
I'll use my cell to say hi and check on mail, but I might be iffy for the next few days. |
04:50
🔗
|
chronomex |
noted |
04:50
🔗
|
chronomex |
maybe the problem will be solved when you come back |
04:50
🔗
|
* |
chronomex ducks |
04:53
🔗
|
underscor |
hahaha |
04:54
🔗
|
* |
BlueMax throws chronomex out a window |
05:14
🔗
|
godane |
SketchCow: make sure the cube is wet proof |
05:14
🔗
|
godane |
maybe if space bag some of the stuff |
05:18
🔗
|
SketchCow |
Yeah, already on that. |
05:19
🔗
|
SketchCow |
The cube itself is not, the stuff inside is a foot higher or even higher than than as required |
05:21
🔗
|
godane |
i was thinking the space bag thing cause there suppose to keep water out |
05:22
🔗
|
godane |
i'm in nh so we may lose power too |
05:22
🔗
|
godane |
i backed up most the gbtv stuff last night |
05:46
🔗
|
SketchCow |
http://sphotos-b.xx.fbcdn.net/hphotos-prn1/547044_430246950357496_951372999_n.jpg |
05:47
🔗
|
chronomex |
hahaha |
05:47
🔗
|
underscor |
bahahaha |
05:53
🔗
|
SketchCow |
I'm proposing an Internet Archive kickstarter. Let's see how that flies. |
05:53
🔗
|
SketchCow |
Done right, instant $500k |
05:53
🔗
|
SketchCow |
That would be good |
06:26
🔗
|
SketchCow |
http://justsolve.archiveteam.org/index.php/FAQ |
10:48
🔗
|
alard |
SketchCow: http://ia601202.us.archive.org/3/items/test-memac-index-test/tabblo.html#teodorapopa |
10:53
🔗
|
SketchCow |
Thanks much. |
15:15
🔗
|
dragondon |
Just started my warrior, getting nothing but "Tracker rate limiting is in effect. Retrying after 30 seconds..." |
15:18
🔗
|
ersi |
dragondon: It's okay. It's intended. We're slowing down/pausing the Webshots archival project for the moment |
15:18
🔗
|
dragondon |
ah, guess I'll switch to something else. |
15:19
🔗
|
alard |
Change it to "ArchiveTeam's Choice"! |
15:19
🔗
|
ersi |
You can leave it on if you'd like, it'll get work to do - just not as often for the time being. Or you might switch to one of the other projects, like AT's choice |
15:19
🔗
|
alard |
I've just pointed that to the URLTeam project and will switch it back to Webshots when we continue that. |
15:19
🔗
|
dragondon |
yeah, just switched to AT Choice. |
15:19
🔗
|
flaushy |
\o/ my nas is dominating the recent stats ... slowest thing to turn in work late ;) |
15:20
🔗
|
alard |
dragondon: Great. |
15:21
🔗
|
flaushy |
alard: after about 15 mins / 800 pages wikipediareview gives me HTTP 400s |
15:21
🔗
|
flaushy |
could a fresh cookie help at that point? |
15:21
🔗
|
alard |
flaushy: Ah, yes, I saw your message. |
15:22
🔗
|
alard |
I don't know. You could try, or you could try with more time between requests. |
15:22
🔗
|
alard |
(Problem is: how do you get Wget to ask for a fresh cookie?) |
15:23
🔗
|
flaushy |
overwrite it in a second process? |
15:23
🔗
|
flaushy |
but i am probably too naive, i ll try :) |
15:24
🔗
|
alard |
I'm not sure if it reads the cookie file. |
15:24
🔗
|
alard |
So it isn't an IP-based block? |
15:25
🔗
|
flaushy |
it wasn't |
15:25
🔗
|
flaushy |
at least i could browse the forums |
15:25
🔗
|
alard |
Does it give any browsable error messages? |
15:25
🔗
|
alard |
(Error messages you could search for on Google, I mean.) |
15:26
🔗
|
flaushy |
checking |
15:30
🔗
|
flaushy |
google suggest using sane user agents |
15:31
🔗
|
alard |
I think just going slower might help. Invision power board seems to have a lot of ways to limit the number of X per second. |
15:32
🔗
|
flaushy |
ok slowly crawling := |
15:32
🔗
|
alard |
It's not going away soon, is it? |
15:33
🔗
|
flaushy |
na, it was more a "we should get it sometime" i think |
15:47
🔗
|
flaushy |
ok running with with wait 10 and random-wait |
16:44
🔗
|
soultcer |
That's a lot of URLTeam users: http://tracker.tinyarchive.org/v1/ |
17:01
🔗
|
ersi |
I stopped my workers when the tracker kept resetting every few days |
17:15
🔗
|
soultcer |
Oh, it's not resetting, I'm just draining it |
17:17
🔗
|
soultcer |
I think I'll have to add an all-time leaderboard that saves the number of tasks done by each user, even when I remove the finished tasks |
17:26
🔗
|
flaushy |
oh 2 of my workers stopped -.- |
17:27
🔗
|
flaushy |
soultcer: running into no buffer space available on my vps |
17:27
🔗
|
soultcer |
Can you paste the exact error message? |
18:48
🔗
|
SketchCow |
> x-archive-meta-title:Mirror of SAMPLES.MPLAYERHQ.HU - Multimedia Samples Archive |
18:48
🔗
|
SketchCow |
> Content-Length: 57073367040 |
18:48
🔗
|
SketchCow |
So that's happening. |
18:55
🔗
|
SmileyG |
soultcer: plz do, i asked for that long ago |
18:56
🔗
|
SmileyG |
statswhore me! |
19:14
🔗
|
bsmith094 |
any other projects i could help with? webshots is rate limited apparently |
19:14
🔗
|
bsmith094 |
remote server so not earrior |
19:14
🔗
|
bsmith094 |
warrior |
19:16
🔗
|
flaushy |
urlteam :) |
19:21
🔗
|
SketchCow |
just solve the problem |
19:22
🔗
|
SketchCow |
archiveteam wiki |
19:27
🔗
|
ersi |
bsmith094: Yeah, like flaushy and SketchCow said: 1) help add content to http://justsolve.archiveteam.org 2) help update and pretty up http://archiveteam.org 3) urlteam or AT's choice |
19:28
🔗
|
SketchCow |
We're dealing with a small slowdown, please be patient about that. |
19:29
🔗
|
ersi |
ie. take the ADHD meds |
19:29
🔗
|
ersi |
and possibly a beer |
19:29
🔗
|
SketchCow |
At the same time? |
19:33
🔗
|
ersi |
mayhapples |
19:33
🔗
|
ersi |
most likely; no |
21:48
🔗
|
bsmith094 |
does urlteam have a script? |
21:50
🔗
|
ersi |
Do you mean a pipeline script? Yes |
21:51
🔗
|
bsmith094 |
where? |
21:52
🔗
|
bsmith094 |
i dont think i can run the warrior on cli, so i need a pipeline script |
21:57
🔗
|
ersi |
I don't know where. But soultcer does, I think. Or you run the GUI |
21:57
🔗
|
ersi |
s/GUI/Warrior/ |
21:57
🔗
|
ersi |
the warrior has an API, you can HTTP commands to it |
21:59
🔗
|
alard |
bsmith094: https://github.com/soult/tinyback/ |
21:59
🔗
|
alard |
But if you want to run it yourself, you might be better of running ./run.py directly. |
22:04
🔗
|
bsmith094 |
alard: how, the instructions are vague |
22:04
🔗
|
alard |
bsmith094: I have not tried it, but the pipeline.py gives an example. |
22:05
🔗
|
alard |
./run.py -h |
22:08
🔗
|
bsmith094 |
alard: well i feel stupid for not realizing that, thank:) |
22:12
🔗
|
alard |
bsmith094: But I have to agree with you that the readme instructions under "How to run TinyBack" aren't exactly helpful. :) Perhaps, once you figure out what to do, you should send soultcer a patch. |
22:13
🔗
|
bsmith094 |
alard: run this screen -SL grab ./run.py --tracker=http://tracker.tinyarchive.org/v1/ --sleep=20 --one-task --temp-dir=./data --username=bsmith093 -d -c |
22:14
🔗
|
alard |
Thank you (I have a warrior). |