Time |
Nickname |
Message |
00:49
🔗
|
godane |
SketchCow: looks some g4tv.com-videos are in g4video |
00:49
🔗
|
godane |
there should be in g4video-web |
00:59
🔗
|
chronomex |
alard: is there a reasonable, compatible way to pad a warc file? I want each response body to start at the beginning of a disk block, so zfs block-level dedup can work with it |
01:00
🔗
|
chronomex |
I'm thinking a blank 'metadata' block would be the ticket |
01:00
🔗
|
chronomex |
oh, nevermind, readers are supposed to skip unknown blocks |
01:00
🔗
|
chronomex |
perfect |
01:41
🔗
|
db48x |
chronomex: interesting idea |
01:52
🔗
|
db48x |
chronomex: have you seen how much memory that takes though? |
01:53
🔗
|
chronomex |
I have not |
01:54
🔗
|
chronomex |
is it horrendous? |
01:55
🔗
|
db48x |
yes |
01:56
🔗
|
db48x |
320 bytes per block allocated in your filesystem |
02:01
🔗
|
db48x |
the block size is variable, but assume 64k as a half-way point between the extremes |
02:01
🔗
|
db48x |
how big do you expect your dataset to be? |
02:01
🔗
|
db48x |
actually, if it's one giant warc file, then the block size will be 128k |
02:02
🔗
|
db48x |
http://constantin.glez.de/blog/2011/07/zfs-dedupe-or-not-dedupe is an article I hadn't seen before, but it summarizes the data very well |
02:25
🔗
|
Brenry |
are you guys awake yet |
02:32
🔗
|
db48x |
Brenry: nope. what's up? |
02:33
🔗
|
Brenry |
did they ever scrape user data for geocities ? |
02:34
🔗
|
Brenry |
or was it just those neighborhoods or commercial sites |
02:34
🔗
|
db48x |
what do you mean by user data? |
02:35
🔗
|
Brenry |
geocities.com/~username |
02:35
🔗
|
Brenry |
so i can get my fkn .jpg pics |
02:36
🔗
|
Brenry |
none of those were password protected.. user dirs.. just open files like a directory list unless it had an index.html file |
02:37
🔗
|
db48x |
ah, hrm |
02:40
🔗
|
db48x |
I don't really know |
02:40
🔗
|
db48x |
the geocities project wasn't very organized, I'm afraid |
02:41
🔗
|
db48x |
have you tried one of the mirrors? |
02:42
🔗
|
Brenry |
one of those sites said the .jp geocities was still attached.. but that wouldn't have data from other regions eh ? tried that and said no longer exists |
02:42
🔗
|
Brenry |
yeah spent alot of time like a month after that crap in 2009.. and a year later |
02:42
🔗
|
Brenry |
and trying back |
02:43
🔗
|
Brenry |
db48x: keep at it ok.. i'll be back in like 2 years.. and i would really like my pictures |
02:43
🔗
|
db48x |
what was your username? |
02:43
🔗
|
Brenry |
oplazzz |
02:43
🔗
|
db48x |
let me see |
02:43
🔗
|
Brenry |
it was geocities.com/oplazzz or geocities.com/~oplazzz |
02:43
🔗
|
Brenry |
i tried wayback machine.. but doesnt seem to have users |
02:45
🔗
|
db48x |
hmm. you're not in the username list on oocities.org |
02:47
🔗
|
db48x |
nor is it available on reocities.com |
02:47
🔗
|
db48x |
but they'll email you if they come across it |
02:47
🔗
|
Brenry |
k.. l8r |
02:51
🔗
|
db48x |
it occurs to me that I probably should have said that they _can_ email him, if he goes and puts in his email address |
02:53
🔗
|
DFJustin |
welp |
03:04
🔗
|
godane |
found another broken video |
03:04
🔗
|
godane |
not cause of errors but cause it slowed down for some reason at the 11min mark |
03:11
🔗
|
godane |
so the 9200, 9559, and 9717 have some sort of bad encoding |
03:12
🔗
|
db48x |
videos can have variable frame rates |
03:12
🔗
|
db48x |
although it's rarely used, so it more often a mistake |
03:12
🔗
|
godane |
the frames was move like 1 every 5 seconds |
03:12
🔗
|
db48x |
does the video run on past the audio? |
03:13
🔗
|
godane |
there is no audio when this starts happen |
03:20
🔗
|
dashcloud |
got a sample godane? |
03:20
🔗
|
godane |
https://archive.org/details/g4tv.com-video9200 |
03:36
🔗
|
godane |
so i found some g4 underground clips |
03:36
🔗
|
godane |
its better then the episodes that i have found |
03:36
🔗
|
godane |
the episodes are all croped |
03:37
🔗
|
godane |
so top and bottom are cut |
03:46
🔗
|
godane |
i found a microsoft key note from tgs 2008 |
03:46
🔗
|
godane |
tgs = tokyo game show |
03:54
🔗
|
DFJustin |
I uploaded this last night https://archive.org/details/osaka-game-show-2009 |
08:12
🔗
|
SketchCow |
godane - Stop uploading until I tell you to. |
08:15
🔗
|
SketchCow |
We need to give you direct access to the g4 collections because you successfully killed out opensource_videos, which is frankly amazing. |
08:36
🔗
|
godane |
SetchCow: your joking right? |
08:38
🔗
|
godane |
you aways tell me to upload stuff then we will deal with it |
08:38
🔗
|
SketchCow |
It won't last more than a day. |
08:38
🔗
|
godane |
also your still puting them into g4video |
08:38
🔗
|
godane |
not g4video-web |
08:38
🔗
|
SketchCow |
But it's midnight at Internet Archive, I need to have the privs modified during the busy day. |
08:38
🔗
|
SketchCow |
Dude, one day |
08:38
🔗
|
godane |
ok |
08:39
🔗
|
SketchCow |
The most recent set wasn't put in there by me. |
08:39
🔗
|
godane |
oh |
08:39
🔗
|
SketchCow |
It was put in my a desperate jeff trying to stop g4 related video from completely choking our RSS feed |
08:39
🔗
|
SketchCow |
And other things |
08:39
🔗
|
godane |
oh |
08:39
🔗
|
db48x |
lol |
08:39
🔗
|
SketchCow |
Normally, my scooping up your uploads every once in a while was fine. |
08:40
🔗
|
SketchCow |
But you turned up the heat. |
08:40
🔗
|
godane |
yes but this 35k+ videos |
08:40
🔗
|
db48x |
godane: make them beg! |
08:40
🔗
|
SketchCow |
So soon you will have the ability to declare g4video and g4video-web as the collection, and upload that way. |
08:40
🔗
|
godane |
i hope i will get the twit collections access too |
08:41
🔗
|
SketchCow |
But we need a day, it's all timeshifted now. It's 9:40 here and 12:40 in California. |
08:41
🔗
|
SketchCow |
I'll get you ALL the collections you need. |
08:41
🔗
|
godane |
ok |
08:41
🔗
|
SketchCow |
I am surprised you're not aware you're one of the single largest non-institution uploaders |
08:42
🔗
|
godane |
wow |
08:43
🔗
|
SketchCow |
35,000 videos is a lot of videos, sir. |
08:43
🔗
|
SketchCow |
Anyway, like I said, one day, and we'll get this shored up. |
08:43
🔗
|
godane |
thats ok |
08:56
🔗
|
ersi |
godane: Bwahaha, you're TOO GOOD :) |
08:56
🔗
|
ersi |
That's awesome |
08:59
🔗
|
chronomex |
:) |
09:04
🔗
|
godane |
also its about 255gb now |
09:43
🔗
|
godane |
there looks to be alot of first 15 mins previews of games |
09:43
🔗
|
godane |
:-D |
10:28
🔗
|
ersi |
What? How hard is tar? :| tar -xf <file> for extraction, tar -cf file <targets> for creation and |
10:28
🔗
|
ersi |
tar -tf <file> to look at it without extracting ;o |
11:11
🔗
|
godane |
i'm back |
11:12
🔗
|
godane |
my internet wifi when out |
11:12
🔗
|
godane |
*went out |
11:12
🔗
|
Schbirid |
SketchCow: dont miss going to http://www.computerspielemuseum.de/ ! |
11:13
🔗
|
Schbirid |
hm, their english site is incomplete |
12:25
🔗
|
Cameron_D |
I prefer `tar -xvf <file>` so you can watch stuff scroll past |
12:46
🔗
|
ersi |
I do that with tars from unknowns, if I made it myself - I just -xf it |
15:59
🔗
|
godane |
uploaded: http://archive.org/details/BBV.Customer.Service.VHSCap-CG |
23:16
🔗
|
Coderjoe |
wow. I just found the world's first website. In the footer: "There have been [counter] hits to this site since noon GMT, Jan 1st, 4713 BC." |
23:20
🔗
|
chronomex |
lol |
23:20
🔗
|
chronomex |
technically true on all levels |
23:41
🔗
|
db48x |
hah |