Time |
Nickname |
Message |
00:17
🔗
|
lennier1 |
Maybe they could have separated, but I don't see how they'd do it after the alleged infringement. As far as I know, it was all run by a single nonprofit organization at the time. |
00:34
🔗
|
icedice |
Yeah |
00:37
🔗
|
icedice |
But if they didn't transfer any of Internet Archive's money into the new legal entity and instead made all future donations go there then it could maybe be shielded to allow them to continue to operate the Wayback Machine and the downloads that aren't books in case the court case doesn't go well |
00:38
🔗
|
|
jason0597 has quit IRC (Read error: Operation timed out) |
02:04
🔗
|
Frogging |
I'm sure there are some very knowledgeable and experienced people working on this that are considering all the legal options, we'll just have to wait for them to do their thing ^^ |
02:11
🔗
|
|
ivan has quit IRC (Quit: Leaving) |
02:12
🔗
|
|
ivan has joined #archiveteam-bs |
02:13
🔗
|
|
synm0nger has quit IRC (Quit: Wait, what?) |
02:14
🔗
|
|
SynMonger has joined #archiveteam-bs |
02:14
🔗
|
|
ivan_ has joined #archiveteam-bs |
02:25
🔗
|
|
ivan has quit IRC (Ping timeout: 745 seconds) |
02:37
🔗
|
|
ivan_ is now known as ivan |
02:42
🔗
|
|
Raccoon has quit IRC (Ping timeout: 622 seconds) |
03:00
🔗
|
|
Stiletto has joined #archiveteam-bs |
03:18
🔗
|
|
Stiletto has quit IRC () |
03:29
🔗
|
|
wp494 has quit IRC (Read error: Operation timed out) |
03:29
🔗
|
|
wp494 has joined #archiveteam-bs |
03:30
🔗
|
|
Stiletto has joined #archiveteam-bs |
03:46
🔗
|
|
qw3rty_ has joined #archiveteam-bs |
03:54
🔗
|
|
qw3rty__ has quit IRC (Read error: Operation timed out) |
04:07
🔗
|
|
wp494 has quit IRC (Read error: Operation timed out) |
04:09
🔗
|
|
wp494 has joined #archiveteam-bs |
04:35
🔗
|
godane |
SketchCow: i noticed you have not uploaded the last 4 vhs rips i uploaded to FOS |
04:36
🔗
|
godane |
i'm surprised that you have not uploaded them cause they was upload back on may 23 |
04:36
🔗
|
godane |
anyways letting you know |
04:49
🔗
|
|
BlueMax has quit IRC (Read error: Connection reset by peer) |
04:49
🔗
|
|
BlueMax has joined #archiveteam-bs |
06:54
🔗
|
icedice |
Frogging: Yeah, true |
07:38
🔗
|
|
lennier2 has joined #archiveteam-bs |
07:39
🔗
|
|
Ctrl has quit IRC (Read error: Operation timed out) |
07:41
🔗
|
|
lennier1 has quit IRC (Read error: Operation timed out) |
07:41
🔗
|
|
lennier2 is now known as lennier1 |
07:42
🔗
|
|
ats has quit IRC (Read error: Operation timed out) |
07:42
🔗
|
|
ats has joined #archiveteam-bs |
07:43
🔗
|
|
Meli has quit IRC (Read error: Operation timed out) |
07:46
🔗
|
|
lunik132 has joined #archiveteam-bs |
07:46
🔗
|
|
lunik13 has quit IRC (Ping timeout: 265 seconds) |
07:46
🔗
|
|
lunik132 is now known as lunik13 |
07:46
🔗
|
|
legoktm has quit IRC (Read error: Connection reset by peer) |
07:47
🔗
|
|
Meli has joined #archiveteam-bs |
07:47
🔗
|
|
legoktm has joined #archiveteam-bs |
07:50
🔗
|
|
Ctrl has joined #archiveteam-bs |
08:25
🔗
|
|
BlueMax has quit IRC (Read error: Connection reset by peer) |
08:33
🔗
|
|
Lord_Nigh has quit IRC (ZNC - http://znc.in) |
08:34
🔗
|
|
Lord_Nigh has joined #archiveteam-bs |
08:41
🔗
|
|
icedice has quit IRC (Leaving) |
10:06
🔗
|
|
wessel152 has joined #archiveteam-bs |
13:05
🔗
|
|
jason0597 has joined #archiveteam-bs |
13:06
🔗
|
|
dashcloud has quit IRC (Ping timeout: 745 seconds) |
13:11
🔗
|
|
jason0597 has quit IRC (Remote host closed the connection) |
13:12
🔗
|
|
jason0597 has joined #archiveteam-bs |
13:15
🔗
|
|
MaximeleG has joined #archiveteam-bs |
13:20
🔗
|
|
jason0597 has quit IRC (Remote host closed the connection) |
13:21
🔗
|
|
jason0597 has joined #archiveteam-bs |
13:27
🔗
|
|
Raccoon has joined #archiveteam-bs |
13:27
🔗
|
|
Raccoon has quit IRC (Remote host closed the connection!) |
13:27
🔗
|
|
Raccoon has joined #archiveteam-bs |
14:27
🔗
|
|
dashcloud has joined #archiveteam-bs |
14:31
🔗
|
|
Arcorann has quit IRC (Read error: Connection reset by peer) |
14:43
🔗
|
|
Datechnom has quit IRC (Read error: Operation timed out) |
14:43
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
16:32
🔗
|
|
katocala has quit IRC () |
16:53
🔗
|
|
katocala has joined #archiveteam-bs |
16:59
🔗
|
|
dashcloud has joined #archiveteam-bs |
17:19
🔗
|
|
dashcloud has quit IRC (Ping timeout: 265 seconds) |
17:29
🔗
|
|
SmileyG has joined #archiveteam-bs |
17:30
🔗
|
pie_ |
is there an outline of infrastructure somewhere |
17:30
🔗
|
pie_ |
or are most things largely ad-hoc |
17:32
🔗
|
JAA |
pie_: https://www.archiveteam.org/index.php?title=Dev/Infrastructure gives an overview at least for DPoS projects, but everything else is pretty much undocumented. |
17:33
🔗
|
|
Smiley has quit IRC (Read error: Operation timed out) |
17:34
🔗
|
pie_ |
on one hand thanks, on the other hand, eek |
17:34
🔗
|
pie_ |
i love the diagram |
17:34
🔗
|
pie_ |
only thing missing is the dog |
17:38
🔗
|
pie_ |
unrelated, is there any way to browse this data https://tracker.archiveteam.org/tumblr/ |
17:39
🔗
|
JAA |
It's all in the Wayback Machine. |
17:40
🔗
|
JAA |
Other than that, not really. |
17:46
🔗
|
Frogging |
pie_: A "hidden" reality of ArchiveTeam is that archiving (downloading and storing data) is really only half the battle, the other half is making the data useful |
17:46
🔗
|
Frogging |
:p |
17:46
🔗
|
Frogging |
There's the Wayback machine, but it has its limitations. |
17:48
🔗
|
Frogging |
All the data from ArchiveTeam grabs is accessible through Internet Archive items though (right, JAA?), so Wayback is not the only way to access it, but it's the easiest when its limitations are acceptable |
17:48
🔗
|
Frogging |
https://archive.org/details/archiveteam_tumblr |
17:48
🔗
|
Frogging |
from https://www.archiveteam.org/index.php?title=Tumblr |
18:03
🔗
|
Frogging |
making custom viewers for data sets like this would take some engineering, but I believe it's possible |
18:13
🔗
|
JAA |
Yes, all our stuff should be accessible. |
18:14
🔗
|
|
jason0597 has quit IRC (Read error: Operation timed out) |
18:15
🔗
|
JAA |
There were indices for some old projects so that you could easily download the data corresponding to a particular tracker item, but when Jason(?) asked about this here a couple years ago, nobody remembered how they were generated. And that hasn't existed for anything recent. |
18:15
🔗
|
JAA |
Custom viewers should be feasible but are definitely a lot of work. |
18:25
🔗
|
pie_ |
Frogging: right |
18:25
🔗
|
pie_ |
data you cant meaningfully access is not really helpful |
18:26
🔗
|
pie_ |
(including some manner of search) |
18:27
🔗
|
pie_ |
so IIUC there should be 8pb of tumbler on IA? |
18:30
🔗
|
JAA |
That's true for most of the WBM though. You need to know the URL to find any content, basically. |
18:30
🔗
|
pie_ |
right |
18:30
🔗
|
pie_ |
its correct but it doesnt help xd |
18:31
🔗
|
JAA |
They've been talking about a full-text index + search for years, but at the scale IA is operating on, it's ... not easy. |
18:31
🔗
|
pie_ |
yeah definitely would be problematic |
18:31
🔗
|
pie_ |
even file name dumps would be something for starters, or idk :/ |
18:32
🔗
|
JAA |
That can be done for our stuff. Not for the other data in the WBM though, most of which isn't publicly accessible. |
18:32
🔗
|
JAA |
It would still be quite large though. |
18:35
🔗
|
pie_ |
more manaable than 8pb :( |
18:35
🔗
|
JAA |
We certainly didn't archive 8 PB of Tumblr, and I doubt IA did either. |
18:39
🔗
|
pie_ |
then IDK how to read the list I linked :D |
18:39
🔗
|
pie_ |
doesnt it say the top guy did >7000GB |
18:39
🔗
|
pie_ |
wait im dumb |
18:40
🔗
|
pie_ |
so 70TB |
18:40
🔗
|
pie_ |
thats much better but still problematic |
19:05
🔗
|
|
jason0597 has joined #archiveteam-bs |
19:45
🔗
|
|
Pixi__ has joined #archiveteam-bs |
19:48
🔗
|
|
Pixi` has quit IRC (Ping timeout: 255 seconds) |
19:56
🔗
|
|
phirephly has quit IRC (Ping timeout: 255 seconds) |
20:00
🔗
|
|
bsmith093 has quit IRC (Read error: Operation timed out) |
20:17
🔗
|
|
bsmith093 has joined #archiveteam-bs |
20:17
🔗
|
|
phirephly has joined #archiveteam-bs |
20:26
🔗
|
|
Raccoon has quit IRC (Ping timeout: 745 seconds) |
20:28
🔗
|
|
jason0597 has quit IRC (Read error: Operation timed out) |
20:36
🔗
|
|
c0mpass has quit IRC (Read error: Connection reset by peer) |
20:44
🔗
|
|
tchaypo_ has quit IRC (Read error: Connection reset by peer) |
20:46
🔗
|
|
tchaypo_ has joined #archiveteam-bs |
20:49
🔗
|
|
fallenoak has quit IRC (Read error: Connection reset by peer) |
20:51
🔗
|
|
HP_Archiv has joined #archiveteam-bs |
20:54
🔗
|
|
jesse-s has quit IRC (Read error: Connection reset by peer) |
21:13
🔗
|
|
jason0597 has joined #archiveteam-bs |
21:13
🔗
|
|
tchaypo_ has quit IRC (Read error: Connection timed out) |
21:18
🔗
|
|
dashcloud has joined #archiveteam-bs |
21:20
🔗
|
|
MaximeleG has quit IRC (Quit: MaximeleG) |
21:26
🔗
|
|
godane has quit IRC (Read error: Connection reset by peer) |
21:35
🔗
|
|
fallenoak has joined #archiveteam-bs |
21:39
🔗
|
|
jesse-s has joined #archiveteam-bs |
21:43
🔗
|
|
godane has joined #archiveteam-bs |
21:47
🔗
|
|
tchaypo_ has joined #archiveteam-bs |
21:50
🔗
|
|
Lilpea has joined #archiveteam-bs |
22:06
🔗
|
|
HP_Archiv has quit IRC (Quit: Leaving) |
22:11
🔗
|
|
Lilpea has quit IRC (Ping timeout: 265 seconds) |
22:12
🔗
|
|
c0mpass has joined #archiveteam-bs |
22:47
🔗
|
|
Arcorann has joined #archiveteam-bs |
23:12
🔗
|
|
Datechnom has joined #archiveteam-bs |
23:19
🔗
|
|
Arcorann has quit IRC (Ping timeout: 265 seconds) |
23:33
🔗
|
|
Arcorann has joined #archiveteam-bs |
23:43
🔗
|
|
jason0597 has quit IRC (Read error: Operation timed out) |