Time |
Nickname |
Message |
00:32
🔗
|
|
fredgido has quit IRC (Ping timeout: 612 seconds) |
00:37
🔗
|
|
bitBaron has quit IRC (Quit: My computer has gone to sleep. 😴😪ZZZzzz…) |
00:45
🔗
|
|
bitBaron has joined #internetarchive |
01:01
🔗
|
|
benjinsmi has joined #internetarchive |
01:04
🔗
|
|
benjins has quit IRC (Read error: Operation timed out) |
01:20
🔗
|
Phoen1x |
Is there a recommended limit for connections per server beyond the usual 2 connections per host from the RFCs? Trying to find a good compromise between 'server asplode' and 1 MB/s/server |
01:30
🔗
|
Somebody2 |
I don't know about the connection limit -- but there's various unofficial documentation on https://www.archiveteam.org/index.php?title=IA |
01:30
🔗
|
Somebody2 |
that should help with identifying where the files are |
01:31
🔗
|
Somebody2 |
But I'm not sure I understand why you care -- the only cost of using archive.org/download/{identifier}/{path} is one extra redirect per file... |
01:32
🔗
|
Phoen1x |
I'm trying to improve download speeds by being a dick and using multiple connections per server, not eliminate a roundtrip |
01:32
🔗
|
Somebody2 |
you can get a list of files from archive.org/metadata/{identifier} (in JSON format) or archive.org/download/{identifier}/{identifier}_files.xml (in XML format) |
01:32
🔗
|
Phoen1x |
Yep, already got a script that parses all that. Now I just need to know if I can parallelize per server without getting banned or causing problems |
01:33
🔗
|
Somebody2 |
*ah*, I see. |
01:33
🔗
|
Somebody2 |
ehhh... probably better to keep it to 2 -- but you might consider using the torrent links, also. |
01:34
🔗
|
Somebody2 |
that way other people who download can benefit from your copy automatically |
01:35
🔗
|
Somebody2 |
what's your hurry on downloading? do you have a deadline on needing to populate the space you've got, or something else? |
01:35
🔗
|
|
bitBaron has quit IRC (Quit: My computer has gone to sleep. 😴😪ZZZzzz…) |
01:35
🔗
|
Phoen1x |
Unfortunately the torrent files for the items I'm downloading are outdated and are missing several of the files, and I can't find the original uploader to ask him to rederive |
01:35
🔗
|
Phoen1x |
My hurry is that I'm downloading a lot of data and would like it to not take 3 weeks |
01:35
🔗
|
Phoen1x |
So I guess not a major hurry, really |
01:39
🔗
|
Somebody2 |
Yeah, 3 weeks isn't that long. :-) |
01:40
🔗
|
Somebody2 |
You can rederive the torrent files yourself -- in a number of ways; probably the easiest is to just add a review to the item. (You can even delete it again afterward if you wish) |
03:00
🔗
|
|
benjinsmi has quit IRC (Leaving) |
04:28
🔗
|
|
qw3rty119 has joined #internetarchive |
04:34
🔗
|
|
qw3rty118 has quit IRC (Read error: Operation timed out) |
07:37
🔗
|
|
deevious has joined #internetarchive |
08:56
🔗
|
|
dtm has quit IRC (Read error: Operation timed out) |
09:02
🔗
|
|
atomotic has joined #internetarchive |
09:14
🔗
|
|
dtm has joined #internetarchive |
11:39
🔗
|
|
atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) |
12:20
🔗
|
|
bitBaron has joined #internetarchive |
12:35
🔗
|
|
sknebel has quit IRC (Read error: Connection reset by peer) |
12:36
🔗
|
|
sknebel has joined #internetarchive |
16:13
🔗
|
|
deevious has quit IRC (Remote host closed the connection) |
17:19
🔗
|
|
bitBaron has quit IRC (Quit: My computer has gone to sleep. 😴😪ZZZzzz…) |
17:20
🔗
|
|
bitBaron has joined #internetarchive |
19:22
🔗
|
|
bitBaron has quit IRC (Quit: My computer has gone to sleep. 😴😪ZZZzzz…) |
19:22
🔗
|
|
bitBaron has joined #internetarchive |
19:32
🔗
|
|
bitBaron has quit IRC (Quit: My computer has gone to sleep. 😴😪ZZZzzz…) |
19:58
🔗
|
|
bitBaron has joined #internetarchive |
22:11
🔗
|
|
wise_flow has joined #internetarchive |
22:49
🔗
|
|
sivoais has quit IRC (Remote host closed the connection) |
23:00
🔗
|
|
thewisefl has joined #internetarchive |
23:02
🔗
|
|
wise_flow has quit IRC (Read error: Operation timed out) |