Time |
Nickname |
Message |
15:34
🔗
|
SketchCow |
Back from dropping the lady off on her flight. |
15:34
🔗
|
SketchCow |
Now let's blow this popstand out on the archive.org side. |
15:34
🔗
|
SketchCow |
(Adding things into WARC format is taking a bit of time.) |
15:39
🔗
|
alard |
SketchCow: Do you have the latest version of the megawarc script (last update was four days ago)? That probably faster than the previous version. |
15:40
🔗
|
SketchCow |
Yes. |
15:41
🔗
|
SketchCow |
But remember, I'm doing between 200-300tb a swath here. |
15:41
🔗
|
SketchCow |
Like, 20 50gb items, etc. |
15:41
🔗
|
alard |
True, it's not fast, its fastER. : |
15:42
🔗
|
alard |
Are the webshots warcs also on the list for the wayback machine update? |
15:44
🔗
|
SketchCow |
All the new ones are. If I can get them in before this upcoming week, they'll get in. |
15:44
🔗
|
SketchCow |
I refer again to the google doc. I can link again if you want. |
15:46
🔗
|
alard |
Well, webshots is not on your list. However, most of the items already are megawarcs, so those probably don't need any work? |
15:46
🔗
|
alard |
(There's also a bunch of webshots items that aren't uploaded to the webshots collection. I don't have access to it, so they end up as 'opensource'.) |
15:47
🔗
|
alard |
https://archive.org/metamgr.php?&w_identifier=webshots*&w_collection=opensource&mode=more |
15:48
🔗
|
SketchCow |
In theory, all megawarcs will just work. |
15:48
🔗
|
SketchCow |
We have a small amount not megawarcs. |
15:48
🔗
|
SketchCow |
To be honest, webshots is super low priority. It's not dead yet, and some amount will live on |
15:49
🔗
|
SketchCow |
Whereas getting Tabblo into the thing, which we've done! is going to be huge. |
15:50
🔗
|
SketchCow |
About to start shoving in Anyhub. |
15:50
🔗
|
SketchCow |
And going to fix City of Heroes, now that we have a better version of megawarc. |
15:58
🔗
|
alard |
Would it be an idea to make some of the tar-to-warc tasks distributed? |
15:59
🔗
|
alard |
"Download an item with tars, megawarc, upload to an item with warcs" isn't too difficult. |
16:00
🔗
|
SketchCow |
It's the pipe. |
16:00
🔗
|
SketchCow |
You just described two 50gb operations. |
16:02
🔗
|
alard |
There must be some people with fast connections. (Could run one on heroku.) |
16:02
🔗
|
SketchCow |
I'm seriously not over-worried about this. |
16:03
🔗
|
SketchCow |
I'm more concerned about the Fix the File Format Problem Month, which could use your help. |
16:03
🔗
|
alard |
You're patient. :) That's very good. |
16:03
🔗
|
SketchCow |
http://archive.org/details/archiveteam-anyhub-00000000-warc --- first anyhub loading in! |
16:16
🔗
|
SketchCow |
Downloading Splinder from the items to convert into megawarcs. |
16:16
🔗
|
SketchCow |
It's going 15mb-20mb a second, it's hard to get dependable faster hits. |
20:06
🔗
|
godane |
DFJustin: I'm uploading sserc graphics 1993 cdrom |
20:07
🔗
|
godane |
i'm also add the rar file to this one |
21:04
🔗
|
godane |
DFJustin: http://archive.org/details/cdrom-sserc-graphics-1993 |