Time |
Nickname |
Message |
02:09
π
|
Wyatt|Wor |
Found another mobileme user recursing on homepage.mac.com |
02:11
π
|
Wyatt|Wor |
Username is tauran; haven't looked at why; busy with the CGI thing. |
02:41
π
|
Zuu- |
Hello |
02:42
π
|
Zuu- |
Was someone here archiving revision 3 shows? |
04:59
π
|
Coderjoe |
mm |
05:00
π
|
Coderjoe |
a bunch of curl connection failures. i suspect either my network or the s3 endpoint hiccupped |
05:35
π
|
chronomex |
christ, mobileme user has been uploading for a day now |
05:47
π
|
Wyatt|Wor |
chronomex: How big is it? |
05:47
π
|
Wyatt|Wor |
(And it's not something that got into a big loop or mirrored some other users?) |
05:48
π
|
chronomex |
-rw-r--r-- 1 duncan duncan 2.3G Nov 3 11:48 data/c/cr/cra/craig.schmidt/public.me.com/public.me.com-craig.schmidt.warc.gz |
05:49
π
|
Wyatt|Wor |
Ah, I see. Sorry, thinking in terms of 100Mb connections. ^^;; |
11:09
π
|
Schbirid |
shaqfu: ah, shame. :) |
11:09
π
|
Schbirid |
runs pretty well here |
11:11
π
|
Schbirid |
how could we get the fileplanet stuff nicely to archive.org? i guess it would be 100k+ items. |
11:11
π
|
Schbirid |
so it might be a bad idea to upoad them individually |
11:22
π
|
Schbirid |
so far i have an average of <3MB per item. but then i am still <50000 and they will just get bigger and bigger |
12:07
π
|
Schbirid |
http://www.gamefront.com/breaking-ign-to-close-fileplanet/ |
12:07
π
|
Schbirid |
"We have decided to archive FilePlanet and will eventually stop operating the site" |
12:07
π
|
Schbirid |
so "archived" is a misleading term |
12:08
π
|
Schbirid |
Γ’ΒΒWhile the site will no longer be updated,Γ’ΒΒ IGN told us, Γ’ΒΒfor now users can still use the site as a repository of file content. If/when we remove all site content completely, weΓ’ΒΒll be sure to communicate that to users before it happens.Γ’ΒΒ |
12:29
π
|
Nemo_bis |
Schbirid, why will they get bigger and bigger? you're downloading them chronologically an recent files are bigger? |
12:29
π
|
Nemo_bis |
10k files per item should be ok anyway |
12:31
π
|
Nemo_bis |
soo, I'm at about 1700 wikis downloaded for #wikiteam, but nobody is working on the uploading script |
12:32
π
|
Nemo_bis |
aka https://code.google.com/p/wikiteam/source/browse/trunk/uploader.py |
12:38
π
|
Schbirid |
yeah, i go IDs upwards and that means later files are bigger (guessing but i am 100% sure) |
12:39
π
|
Nemo_bis |
yep |
12:40
π
|
Schbirid |
ok, 10k parts sounds like a good idea |
12:40
π
|
Nemo_bis |
tarred maybe |
12:41
π
|
SketchCow |
Hey, people. |
12:41
π
|
SketchCow |
Anything I need to know about? |
12:42
π
|
Nemo_bis |
That I'm eager to see ISO images flooding archive.org? |
12:42
π
|
Schbirid |
tar would mean that the files would not be accessible easily |
12:43
π
|
Nemo_bis |
Well, if the tar archive is under 5-10 GB there's the tar viewer. |
12:43
π
|
Nemo_bis |
But if you have to load an item description with 10k elements the HTML will take forever. |
12:43
π
|
Schbirid |
tar viewer! i never heard of that |
12:43
π
|
Schbirid |
oh, that is true |
12:43
π
|
Nemo_bis |
Probably it's better to download everything and then ask SketchCow with real data |
12:44
π
|
Nemo_bis |
take the /download link and add a / at the end of the URL |
12:44
π
|
Schbirid |
is that indexed by crawlers? |
12:45
π
|
Nemo_bis |
only if you put links I guess; or maybe not even in that case because there's nofollow even for internal links? |
12:45
π
|
Nemo_bis |
anyway, for instance: http://archive.org/download/mobileme-hero-1335947007/mobileme-full-1335947007.tar/ |
12:46
π
|
dnova |
that is awesome |
12:46
π
|
Schbirid |
nice |
12:47
π
|
Coderjoe |
ugh |
12:47
π
|
Coderjoe |
I fear I am uploading a bunch of tv show episodes to IA now >_< |
12:47
π
|
SketchCow |
I do agree that fos is getting a little slow. |
12:48
π
|
SketchCow |
Not sure why. |
12:57
π
|
SketchCow |
I'm cleaning up uploaded mobileme sets right now. |
12:57
π
|
Nemo_bis |
wow |
12:57
π
|
Nemo_bis |
what about splinder? |
12:57
π
|
SketchCow |
WHat about it? |
12:57
π
|
Nemo_bis |
is fos slowed down by the splinder tidying up^ |
12:57
π
|
SketchCow |
No, no. |
12:57
π
|
Nemo_bis |
or did it finish |
12:57
π
|
SketchCow |
It's at a halt point, has been. |
13:03
π
|
SketchCow |
Just verified and removed 1.7tb of mobileme from the machine. |
13:04
π
|
SketchCow |
Another 2tb is being uploaded now. |
18:03
π
|
chronomex |
e.g. by me |
18:07
π
|
emijrp |
do you know how to change the spotlight item (on the left sidebar)? http://archive.org/details/spanishrevolution |
18:46
π
|
chronomex |
there's either a thing in the metadata for the collection or the item |
18:47
π
|
chronomex |
I think the collectino |
20:27
π
|
jojo56 |
hello everyone |
20:38
π
|
Schbirid |
hi |
20:39
π
|
jojo56 |
why |
20:49
π
|
shaqfu |
...why? |
20:49
π
|
Schbirid |
WHY! |
20:52
π
|
shaqfu |
Schbirid: Is there any useful metadata coming down with the FP files? |
20:52
π
|
Schbirid |
yeah, i save the download page too |
20:52
π
|
Schbirid |
and the url the file comes from |
20:52
π
|
shaqfu |
Awesome |
20:52
π
|
Schbirid |
eg http://www.fileplanet.com/224884/download |
20:53
π
|
Schbirid |
has a full title "Gas Guzzlers: Combat Carnage Beta Client" |
20:53
π
|
Schbirid |
and their category "Home / Gaming / RPG / Massively Multiplayer / Gas Guzzlers: Combat Carnage / Game Clients" |
20:54
π
|
Schbirid |
perfect would be to save http://www.fileplanet.com/224884/220000/fileinfo/Gas-Guzzlers:-Combat-Carnage-Beta-Client too i guess. but i could not find a way to easily find those URLs so i just do the numeric id increments |
20:54
π
|
Schbirid |
their older files have informative download urls like http://download.direct2drive.com/ftp2/bgchronicles/agportraits/vance/celeb.zip |
20:55
π
|
shaqfu |
So long as the script grabs the page with basic metadata, it sohuld be good |
20:55
π
|
Schbirid |
yeah |