Time |
Nickname |
Message |
01:20
🔗
|
|
ravetcofx has quit IRC (Ping timeout: 506 seconds) |
01:29
🔗
|
|
ravetcofx has joined #archiveteam-bs |
01:44
🔗
|
|
tfgbd_znc has quit IRC (Ping timeout: 633 seconds) |
01:45
🔗
|
|
tfgbd_znc has joined #archiveteam-bs |
01:48
🔗
|
|
BartoCH has quit IRC (Ping timeout: 260 seconds) |
01:55
🔗
|
|
BartoCH has joined #archiveteam-bs |
02:24
🔗
|
kanzure |
hi could i have somebody look at my metadata/construction here and tell me if i am missing any important semantic things i should be doing? https://archive.org/details/hplusroadmap-logs-2016-09-18 |
02:27
🔗
|
|
tfgbd_znc has quit IRC (Ping timeout: 633 seconds) |
02:54
🔗
|
Somebody |
kanzure: will look |
02:56
🔗
|
Somebody |
kanzure: er, when I try to download the ots file, I don't get any data... |
02:57
🔗
|
|
tfgbd_znc has joined #archiveteam-bs |
02:59
🔗
|
Somebody |
it was my mistake, forgot to add -L to curl |
03:02
🔗
|
Somebody |
the metadata looks fine to me |
03:02
🔗
|
Somebody |
I can't verify the ots without downloading the client, which I don't feel like doing right now |
03:36
🔗
|
nicolas17 |
kanzure: ae_g_i_s: Glacier is for long-term infrequent-access storage, Snowball is for transfer (into Glacier or S3) |
03:37
🔗
|
nicolas17 |
also, my question was about archiving as in, archiveteam, I guess it would go into the Internet Archive |
03:37
🔗
|
nicolas17 |
(that's your usual destination for rescued data, right?) |
03:41
🔗
|
Somebody |
200TB would require coordinating with the Internet Archive, yes |
03:48
🔗
|
kanzure |
Somebody: the timestamps are done cooking for the large hash indexes. i'll throw together a file tomorrow, especially if someone highlights me and harasses me. |
03:50
🔗
|
|
VADemon has quit IRC (Quit: left4dead) |
03:56
🔗
|
Somebody |
kanzure: great! |
03:58
🔗
|
kanzure |
Somebody: https://github.com/bjornedstrom/timmy -- there was another tool like this (but not explicitly for timestamping). would you happen to remember what it was? |
04:19
🔗
|
|
jrwr has quit IRC (Remote host closed the connection) |
04:21
🔗
|
|
Frogging has quit IRC (Read error: Operation timed out) |
04:21
🔗
|
|
Frogging has joined #archiveteam-bs |
04:22
🔗
|
|
SadDM has quit IRC (Read error: Operation timed out) |
04:22
🔗
|
|
rduser has quit IRC (Read error: Operation timed out) |
04:23
🔗
|
|
ranma has quit IRC (Read error: Operation timed out) |
04:23
🔗
|
|
rduser has joined #archiveteam-bs |
04:23
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
04:23
🔗
|
|
swebb has quit IRC (Read error: Operation timed out) |
04:23
🔗
|
|
Start has quit IRC (Quit: Disconnected.) |
04:23
🔗
|
|
swebb has joined #archiveteam-bs |
04:23
🔗
|
|
ndiddy has quit IRC (Read error: Connection reset by peer) |
04:24
🔗
|
|
SadDM has joined #archiveteam-bs |
04:24
🔗
|
|
yakfish has quit IRC (Read error: Operation timed out) |
04:24
🔗
|
|
yakfish has joined #archiveteam-bs |
04:24
🔗
|
|
Start has joined #archiveteam-bs |
04:25
🔗
|
|
Start has quit IRC (Client Quit) |
04:25
🔗
|
|
ranma has joined #archiveteam-bs |
04:26
🔗
|
|
dashcloud has joined #archiveteam-bs |
04:26
🔗
|
|
Start has joined #archiveteam-bs |
04:26
🔗
|
|
Start has quit IRC (Client Quit) |
04:27
🔗
|
|
Start has joined #archiveteam-bs |
04:27
🔗
|
|
Start has quit IRC (Client Quit) |
04:27
🔗
|
|
SadDM has quit IRC (Read error: Operation timed out) |
04:28
🔗
|
|
Start has joined #archiveteam-bs |
04:31
🔗
|
|
SadDM has joined #archiveteam-bs |
04:37
🔗
|
Somebody |
kanzure: I don't, sorry. |
05:33
🔗
|
|
joepie91 has quit IRC (Read error: Operation timed out) |
05:40
🔗
|
|
joepie91 has joined #archiveteam-bs |
05:42
🔗
|
|
Sk1d has quit IRC (Ping timeout: 250 seconds) |
05:43
🔗
|
|
jsp12345 has quit IRC (Ping timeout: 633 seconds) |
05:46
🔗
|
|
ivan has quit IRC (Leaving) |
05:46
🔗
|
|
marvinw has joined #archiveteam-bs |
05:49
🔗
|
|
Sk1d has joined #archiveteam-bs |
05:52
🔗
|
|
Stiletto has joined #archiveteam-bs |
06:06
🔗
|
nicolas17 |
Somebody: I'm just wondering what the logistics would be |
06:06
🔗
|
nicolas17 |
I don't think AWS Snowball could be used, because the data is in Europe and IA is in the US |
06:19
🔗
|
|
nicolas17 has quit IRC (Quit: zzz) |
06:20
🔗
|
|
Start has quit IRC (Remote host closed the connection) |
06:21
🔗
|
|
BartoCH has quit IRC (Ping timeout: 260 seconds) |
06:22
🔗
|
|
Start has joined #archiveteam-bs |
06:27
🔗
|
|
BartoCH has joined #archiveteam-bs |
06:36
🔗
|
|
marvinw has quit IRC (west.us.hub irc.colosolutions.net) |
06:36
🔗
|
|
SadDM has quit IRC (west.us.hub irc.colosolutions.net) |
06:36
🔗
|
|
dashcloud has quit IRC (west.us.hub irc.colosolutions.net) |
06:36
🔗
|
|
yakfish has quit IRC (west.us.hub irc.colosolutions.net) |
06:36
🔗
|
|
rduser has quit IRC (west.us.hub irc.colosolutions.net) |
06:36
🔗
|
|
kvieta has quit IRC (west.us.hub irc.colosolutions.net) |
06:36
🔗
|
|
jspiros has quit IRC (west.us.hub irc.colosolutions.net) |
06:36
🔗
|
|
remsen has quit IRC (west.us.hub irc.colosolutions.net) |
06:38
🔗
|
|
marvinw_ has joined #archiveteam-bs |
06:49
🔗
|
|
vitzli has joined #archiveteam-bs |
06:52
🔗
|
|
dashcloud has joined #archiveteam-bs |
06:53
🔗
|
|
rduser has joined #archiveteam-bs |
06:53
🔗
|
|
kvieta has joined #archiveteam-bs |
06:54
🔗
|
|
SadDM has joined #archiveteam-bs |
06:54
🔗
|
|
yakfish has joined #archiveteam-bs |
06:54
🔗
|
|
rduser has quit IRC (Ping timeout: 246 seconds) |
06:56
🔗
|
|
remsen has joined #archiveteam-bs |
06:57
🔗
|
|
jspiros has joined #archiveteam-bs |
06:57
🔗
|
|
rduser has joined #archiveteam-bs |
06:59
🔗
|
Somebody |
nicolas17: (if you read the logs) -- At the scale of 200TB, and IA being pretty well known and respected, my guess would be a specific deal between Amazon and IA, in which Amazon would load up a set of drives themselves, then deliver them to IA directly. But I am not affiliated with either, so IDK. If you want a better answer, email info@archive . org |
07:04
🔗
|
|
kristian_ has quit IRC (Quit: Leaving) |
07:05
🔗
|
|
rduser has quit IRC (west.us.hub irc.colosolutions.net) |
07:05
🔗
|
|
SadDM has quit IRC (west.us.hub irc.colosolutions.net) |
07:05
🔗
|
|
yakfish has quit IRC (west.us.hub irc.colosolutions.net) |
07:05
🔗
|
|
jspiros has quit IRC (west.us.hub irc.colosolutions.net) |
07:05
🔗
|
|
kvieta has quit IRC (west.us.hub irc.colosolutions.net) |
07:07
🔗
|
|
rduser has joined #archiveteam-bs |
07:07
🔗
|
|
jspiros has joined #archiveteam-bs |
07:07
🔗
|
|
yakfish has joined #archiveteam-bs |
07:07
🔗
|
|
SadDM has joined #archiveteam-bs |
07:07
🔗
|
|
kvieta has joined #archiveteam-bs |
07:46
🔗
|
|
Somebody has quit IRC (Ping timeout: 370 seconds) |
08:09
🔗
|
|
ravetcofx has quit IRC (Read error: Operation timed out) |
08:46
🔗
|
|
GE has joined #archiveteam-bs |
08:51
🔗
|
|
Somebody has joined #archiveteam-bs |
09:07
🔗
|
|
Somebody has quit IRC (Ping timeout: 370 seconds) |
10:46
🔗
|
godane |
looks like WFMU has more audio then even KPFA |
10:46
🔗
|
|
BlueMaxim has quit IRC (Quit: Leaving) |
10:46
🔗
|
godane |
i'm finding audio going back to the late 1970s on there site |
10:47
🔗
|
godane |
these files are going to be mp4 files that i'm renaming m4a |
10:48
🔗
|
godane |
only for derive reasons so we can get a mp3 and ogg file |
11:21
🔗
|
|
GE has quit IRC (Remote host closed the connection) |
12:05
🔗
|
|
whopper has joined #archiveteam-bs |
13:12
🔗
|
|
GE has joined #archiveteam-bs |
13:37
🔗
|
|
schbirid has joined #archiveteam-bs |
13:55
🔗
|
|
krazedkat has joined #archiveteam-bs |
14:11
🔗
|
joepie91 |
recommended watch (11 mins), on using analytics for targeted manipulation of individual people: https://www.youtube.com/watch?v=n8Dd5aVXLCc (in the context of elections, in this case) |
14:14
🔗
|
schbirid |
someone leak their db please |
15:04
🔗
|
|
schbirid has quit IRC (Quit: Leaving) |
15:20
🔗
|
|
Sanqui has quit IRC (Ping timeout: 260 seconds) |
15:32
🔗
|
kanzure |
ah, i was thinking of https://tlsnotary.org/ not https://github.com/bjornedstrom/timmy |
15:32
🔗
|
|
Sanqui has joined #archiveteam-bs |
15:34
🔗
|
kanzure |
https://github.com/tlsnotary/pagesigner-browserless |
15:39
🔗
|
|
VADemon has joined #archiveteam-bs |
15:43
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
15:43
🔗
|
kanzure |
oh. the model is a little weird. what? it's using some kind of 'oracle' living on amazon aws. https://tlsnotary.org/pagesigner.html |
15:46
🔗
|
|
dashcloud has joined #archiveteam-bs |
16:14
🔗
|
kanzure |
have any of you goons been able to get a full copy of libgen from their torrents? |
16:14
🔗
|
kanzure |
and also sicmag |
16:14
🔗
|
kanzure |
scimag.. |
16:14
🔗
|
|
Sanqui has quit IRC (Ping timeout: 260 seconds) |
16:16
🔗
|
|
Sanqui has joined #archiveteam-bs |
16:26
🔗
|
SketchCow |
Hi, so. |
16:27
🔗
|
SketchCow |
When someone comes in, goes "I've spent a whole day at the Internet Archive, I think it runs this way, I therefore have a pile of changes for them because I, person, have discovered the One True Way" |
16:27
🔗
|
SketchCow |
It's... not beneficial |
16:27
🔗
|
SketchCow |
But I always suggest sending mailing to info@archive.org |
16:27
🔗
|
SketchCow |
Even if the response will be "you read it wrong, we actually do ___" |
16:28
🔗
|
SketchCow |
Did I miss anything in here? lots of -bs when I came back from my weekend (which is good, #archiveteam kept clean) |
16:34
🔗
|
SketchCow |
(I've been uploading PILES of stuff to Archive, as well as doing TONS of cleanup (45,000 items shifted around)) |
16:39
🔗
|
kanzure |
is this from my backlog, or other unrelated backlog? |
16:39
🔗
|
SketchCow |
Oh, everyone got to share in it |
16:40
🔗
|
kanzure |
like, is there some sort of disagreement about using multiple hash functions that i'm not aware of? |
16:40
🔗
|
SketchCow |
Here is what happened. |
16:40
🔗
|
SketchCow |
You went "Oh, they do ___" |
16:40
🔗
|
SketchCow |
Then went from there |
16:40
🔗
|
SketchCow |
In each item at the archive is something called a _meta.xml file |
16:41
🔗
|
SketchCow |
That has multiple hashes. MD5, SHA1, etc. |
16:41
🔗
|
kanzure |
what about it? |
16:41
🔗
|
kanzure |
well, i guess i should first think about the difficulty of hash collisions for both md5 and sha1 |
16:41
🔗
|
kanzure |
there are some free starts in sha1 i think |
16:42
🔗
|
SketchCow |
Cool story, I work for the Internet Archive |
16:42
🔗
|
kanzure |
but not much else? i'm not entirely sure. |
16:42
🔗
|
SketchCow |
So much backlog, I should get something to eat and read it. |
16:42
🔗
|
SketchCow |
Obviously cool stuff went by and by cool stuff I mean text |
16:42
🔗
|
kanzure |
i wasn't aware that one's employment situation would change hash function integrity :P |
16:42
🔗
|
SketchCow |
Ha ha. |
16:42
🔗
|
kanzure |
i'm very confused so i'm also going to get food |
16:42
🔗
|
kanzure |
this is a good idea |
16:43
🔗
|
SketchCow |
So, are you used to people walking away from you while you "explain" things to them? |
16:43
🔗
|
SketchCow |
Does that happen a lot? |
16:43
🔗
|
kanzure |
my confusion continues to grow |
16:43
🔗
|
SketchCow |
I'm sure that's your normal state. |
16:43
🔗
|
kanzure |
honestly most of my time is spent confused, yes |
16:43
🔗
|
SketchCow |
Well, let's go this way. |
16:44
🔗
|
SketchCow |
I know a lot of the lore of WHY the Internet Archive does certain things. Wrongly and rightly. |
16:44
🔗
|
SketchCow |
I can answer those. |
16:44
🔗
|
SketchCow |
You might go "But wait, I've nerd-hotboxed Something Better" but that's not relevant to the questions here. |
16:44
🔗
|
kanzure |
btw i was also curious if internet archive considers reharshing to be extremely burdensome? like imagine sha1 was broken. and some other hash function was selected. |
16:44
🔗
|
SketchCow |
People dream up better ways of doing things, lacking 20% of the factors involved, all the time. |
16:45
🔗
|
kanzure |
.. excuse me, s/reharshing/rehashing |
16:45
🔗
|
SketchCow |
Internet Archive has several, many times in the past where it has done "something" to every item on the Archive in the name of integrity, improvement, or management. So no, not burdonsome. |
16:45
🔗
|
SketchCow |
Doing it because you sounded your gabriel horn is less likely. |
16:45
🔗
|
kanzure |
cool. |
16:45
🔗
|
kanzure |
what? |
16:45
🔗
|
SketchCow |
toot |
16:45
🔗
|
kanzure |
are you making a chuck tingle joke or something? |
16:47
🔗
|
|
BartoCH has quit IRC (Ping timeout: 260 seconds) |
16:48
🔗
|
yipdw |
pretty sure that's more of a reference to the archangel Gabriel |
16:48
🔗
|
kanzure |
well, i'm illiterate or something, so i wouldn't know that. |
16:49
🔗
|
SketchCow |
Yeah, it's one of those buried references |
16:49
🔗
|
* |
SketchCow has uploaded 1,400 magazines this week |
16:49
🔗
|
SketchCow |
And sorted several hundred RPG items, hundreds of zines, and killed thousands of spam items |
16:50
🔗
|
SketchCow |
This has taken the open inbox from 645,000 to 615,000 items |
16:50
🔗
|
SketchCow |
But still! Progress |
16:50
🔗
|
|
vitzli has quit IRC (Leaving) |
16:52
🔗
|
SketchCow |
I'm focusing on progress these days. |
16:54
🔗
|
SketchCow |
http://kotaku.com/why-some-video-games-are-in-danger-of-disappearing-fore-1789609791 (Got interviewed) |
16:58
🔗
|
|
BartoCH has joined #archiveteam-bs |
17:08
🔗
|
ZizzyDizz |
SketchCow: how can I upload my own archive into the Internet Archive |
17:13
🔗
|
|
BartoCH has quit IRC (Ping timeout: 260 seconds) |
17:21
🔗
|
|
chazchaz has quit IRC (Read error: Operation timed out) |
17:21
🔗
|
|
chazchaz has joined #archiveteam-bs |
17:24
🔗
|
|
FluffyFox has joined #archiveteam-bs |
17:27
🔗
|
|
ranma_ has joined #archiveteam-bs |
17:27
🔗
|
|
MrRadar has quit IRC (Read error: Operation timed out) |
17:27
🔗
|
|
Frogging has quit IRC (Read error: Operation timed out) |
17:27
🔗
|
|
ranma has quit IRC (Read error: Operation timed out) |
17:27
🔗
|
|
FluffyFox is now known as Frogging |
17:28
🔗
|
|
Laverne has quit IRC (Ping timeout: 370 seconds) |
17:28
🔗
|
|
Cameron_D has quit IRC (Ping timeout: 370 seconds) |
17:28
🔗
|
|
atlogbot has quit IRC (Read error: Operation timed out) |
17:28
🔗
|
|
ranma_ is now known as ranma |
17:31
🔗
|
|
swebb_ has joined #archiveteam-bs |
17:34
🔗
|
|
hawc145 has joined #archiveteam-bs |
17:35
🔗
|
SketchCow |
What is your archive of |
17:37
🔗
|
|
nicolas17 has joined #archiveteam-bs |
17:44
🔗
|
|
swebb has quit IRC (ny.us.hub irc.servercentral.net) |
17:44
🔗
|
|
HCross has quit IRC (ny.us.hub irc.servercentral.net) |
17:44
🔗
|
|
slyphic_ has quit IRC (ny.us.hub irc.servercentral.net) |
17:44
🔗
|
|
Flierp has quit IRC (ny.us.hub irc.servercentral.net) |
17:44
🔗
|
|
ZizzyDizz has quit IRC (ny.us.hub irc.servercentral.net) |
17:50
🔗
|
|
atlogbot has joined #archiveteam-bs |
17:50
🔗
|
|
Flierp has joined #archiveteam-bs |
17:50
🔗
|
|
ZizzyDizz has joined #archiveteam-bs |
17:51
🔗
|
|
MrRadar has joined #archiveteam-bs |
17:51
🔗
|
|
Cameron_D has joined #archiveteam-bs |
17:56
🔗
|
|
slyphic has joined #archiveteam-bs |
18:00
🔗
|
|
swebb_ is now known as swebb |
18:05
🔗
|
|
Somebody has joined #archiveteam-bs |
18:38
🔗
|
|
BartoCH has joined #archiveteam-bs |
18:44
🔗
|
hawc145 |
https://www.reddit.com/r/DataHoarder/comments/5ggujk/ipernitycom_shutting_down_end_of_january_2017/ arkiver we should get this |
18:55
🔗
|
|
Aranje has joined #archiveteam-bs |
18:59
🔗
|
|
hawc145 is now known as HCross |
19:00
🔗
|
arkiver |
hawc145: looks not too hard to grab |
19:00
🔗
|
HCross |
yea |
19:00
🔗
|
|
compu_85 has joined #archiveteam-bs |
20:06
🔗
|
|
BartoCH has quit IRC (Ping timeout: 260 seconds) |
20:20
🔗
|
|
dashcloud has quit IRC (Remote host closed the connection) |
20:44
🔗
|
|
BartoCH has joined #archiveteam-bs |
20:49
🔗
|
|
Somebody has quit IRC (Ping timeout: 370 seconds) |
21:24
🔗
|
|
jrwr has joined #archiveteam-bs |
21:37
🔗
|
|
VADemon has quit IRC (Quit: left4dead) |
21:49
🔗
|
arkiver |
HCross: what quality did you get pewdiepie in? |
21:49
🔗
|
HCross |
max |
21:49
🔗
|
HCross |
arkiver, its all sat on newsbuddy atm |
21:51
🔗
|
arkiver |
cool |
21:51
🔗
|
arkiver |
it nice that he has some 4K videos |
21:59
🔗
|
|
BlueMaxim has joined #archiveteam-bs |
22:00
🔗
|
|
Cameron_D has quit IRC (Ping timeout: 370 seconds) |
22:01
🔗
|
|
MrRadar has quit IRC (Ping timeout: 370 seconds) |
22:02
🔗
|
HCross |
SketchCow, can I have FOS FTP login details please so I can get this sent over |
22:02
🔗
|
HCross |
its 500GB |
22:05
🔗
|
|
Somebody has joined #archiveteam-bs |
22:08
🔗
|
|
slyphic_ has joined #archiveteam-bs |
22:08
🔗
|
|
slyphic has quit IRC (Read error: Operation timed out) |
22:09
🔗
|
|
Cameron_D has joined #archiveteam-bs |
22:12
🔗
|
|
MrRadar has joined #archiveteam-bs |
23:04
🔗
|
|
GE has quit IRC (Remote host closed the connection) |
23:18
🔗
|
|
dashcloud has joined #archiveteam-bs |
23:37
🔗
|
|
ravetcofx has joined #archiveteam-bs |
23:47
🔗
|
|
Somebody has quit IRC (Ping timeout: 370 seconds) |