| Time |
Nickname |
Message |
|
00:00
🔗
|
godane |
its a rca home theater vcr |
|
00:01
🔗
|
godane |
it powers one but the machine will not take the tape |
|
00:09
🔗
|
|
Atom has quit IRC (Read error: Connection reset by peer) |
|
00:10
🔗
|
|
icedice has quit IRC (Read error: Operation timed out) |
|
00:14
🔗
|
godane |
so i now have a tape hitting the 10000k mark |
|
00:14
🔗
|
godane |
not of your tapes didn't for some reason |
|
00:19
🔗
|
godane |
anyways i'm only putting this tape at 6000k now |
|
00:20
🔗
|
godane |
most cause i only capture tv stuff around 5000k |
|
00:38
🔗
|
godane |
btw i got the money from patreon now |
|
00:40
🔗
|
|
BlueMaxim has quit IRC (Quit: Leaving) |
|
00:43
🔗
|
|
pizzaiolo has joined #archiveteam-bs |
|
01:15
🔗
|
|
BlueMaxim has joined #archiveteam-bs |
|
01:41
🔗
|
dashcloud |
glad to hear you got the payment situation fixed godane |
|
01:49
🔗
|
|
schbirid2 has joined #archiveteam-bs |
|
01:50
🔗
|
godane |
btw i found a youtube channel with all Sightings epsiodes |
|
01:50
🔗
|
godane |
i'm grabbing it for the archive and myspleen cause they been looking for all episodes of it |
|
01:54
🔗
|
|
username1 has quit IRC (Read error: Operation timed out) |
|
01:55
🔗
|
|
dashcloud has quit IRC (Remote host closed the connection) |
|
01:56
🔗
|
godane |
so got sci-fi airing of star trek |
|
02:05
🔗
|
atrocity |
why does sci-fi airing it matter? |
|
02:05
🔗
|
atrocity |
just for commercials and stuff? |
|
02:07
🔗
|
second |
mundus: what's up with your server? |
|
02:11
🔗
|
godane |
it has intro with william shatner talking about the episode |
|
02:12
🔗
|
godane |
so its for commericals and stuff |
|
02:12
🔗
|
godane |
some times bad edits on stations |
|
02:13
🔗
|
godane |
anyways i got 14 tapes from the guy for $10.01 |
|
02:19
🔗
|
|
VerifiedJ has joined #archiveteam-bs |
|
02:36
🔗
|
VerifiedJ |
godane: I did some more digging and found a way to get full PDFs. Details here https://verifiedjoseph.com/f68qUv7lqs/archiveteam/pagesuite-pdfs.txt (i hope it makes sense) |
|
02:42
🔗
|
|
r3c0d3x has quit IRC (Ping timeout: 260 seconds) |
|
02:44
🔗
|
|
VerifiedJ has left |
|
02:46
🔗
|
|
r3c0d3x has joined #archiveteam-bs |
|
03:13
🔗
|
|
Asparagir has quit IRC (Asparagir) |
|
03:22
🔗
|
|
Stilett0 has joined #archiveteam-bs |
|
03:22
🔗
|
|
Stilett0 is now known as Stiletto |
|
03:24
🔗
|
mundus |
second, can't afford it |
|
03:24
🔗
|
|
pizzaiolo has quit IRC (Quit: pizzaiolo) |
|
03:25
🔗
|
second |
mundus: how much was it costing? |
|
03:25
🔗
|
mundus |
$7/mo |
|
03:28
🔗
|
second |
hmm |
|
03:28
🔗
|
second |
Bandwidth cost? |
|
03:28
🔗
|
mundus |
I know it's not much |
|
03:29
🔗
|
mundus |
But I don't have much money |
|
03:29
🔗
|
mundus |
Unlimited bw |
|
04:02
🔗
|
second |
Does anyone know where I can find the old imdb database? |
|
04:04
🔗
|
second |
Or a movie database dataset? |
|
04:07
🔗
|
second |
Can someone archive this? ftp://ftp.fu-berlin.de/pub/misc/movies/database/temporaryaccess/ |
|
04:07
🔗
|
second |
https://sourceforge.net/p/imdbpy/mailman/message/35922484/ |
|
04:07
🔗
|
second |
And perhaps this ftp://ftp.funet.fi/pub/mirrors/ftp.imdb.com/pub/ |
|
04:08
🔗
|
second |
IMDB got rid of their database dumps and got a new format but it is missing a lot of data |
|
04:08
🔗
|
second |
a lot of cast / crew meta |
|
04:10
🔗
|
Somebody2 |
second: how big are those? |
|
04:11
🔗
|
second |
A few gigs |
|
04:12
🔗
|
second |
Post said it was Old files: 49 files, 1.9 GB |
|
04:12
🔗
|
Somebody2 |
nods |
|
04:12
🔗
|
second |
New files: 6 files, 361 MB on S3 |
|
04:12
🔗
|
second |
I can download the S3 stuff and pay for it just need to know where I can upload it for archiveteam to take it and how they want it |
|
04:13
🔗
|
second |
Or I can pay someone to get it and they give me a copy ;D |
|
04:13
🔗
|
Somebody2 |
second: you can upload it to the Internet Archive. |
|
04:14
🔗
|
Somebody2 |
Just make an account (all you need is an email address, which will be permanently and (not very) publically attached to whatever you upload). |
|
04:14
🔗
|
Somebody2 |
You can download each file and upload them before downloading the next, which should avoid you needing to hold on to larger amounts of space. |
|
04:15
🔗
|
second |
How is the IA doing with space? |
|
04:15
🔗
|
Somebody2 |
second: they've got plenty. |
|
04:16
🔗
|
Somebody2 |
a few gigs won't even be noticed. |
|
04:16
🔗
|
second |
I hope they have backups, also california is on fire, I hope the IA doesn't burn down too |
|
04:16
🔗
|
Somebody2 |
a few *terabytes* wouldn't be noticed |
|
04:16
🔗
|
Somebody2 |
once you get up to a petabyte, it's polite to ask first. |
|
04:16
🔗
|
Somebody2 |
(I'm somewhat exagerating, but only somewhat) |
|
04:17
🔗
|
Somebody2 |
They have backups; they are working on a backup in Canada, although I haven't heard much about it lately. |
|
04:19
🔗
|
|
Sk1d has quit IRC (Ping timeout: 250 seconds) |
|
04:25
🔗
|
Somebody2 |
second: I'm grabbing the fu-berlin one now. |
|
04:26
🔗
|
|
Sk1d has joined #archiveteam-bs |
|
04:27
🔗
|
kisspunch |
second: also grabbing |
|
04:42
🔗
|
Somebody2 |
Up to 2.8G so far |
|
04:45
🔗
|
Somebody2 |
second: it looks like the other address, funet.fi, is a mirror; are you sure the data is different? |
|
04:46
🔗
|
kisspunch |
Somebody2: IMDB releases snapshots with diffs--compare something outside the diffs folder |
|
04:46
🔗
|
kisspunch |
They will either be the same data at 2 points in time or the same point in time is what I was trying to convey |
|
04:47
🔗
|
Somebody2 |
kisspunch: not sure what you mean? |
|
04:47
🔗
|
Somebody2 |
the reported file sizes are identical |
|
04:47
🔗
|
Somebody2 |
the timestamps are a few hours different |
|
04:49
🔗
|
kisspunch |
Somebody2: I am saying, compare a file that's not a diff if you're going to do that check. In any case, I'm just grabbing both and running a deduplicator after |
|
04:49
🔗
|
Somebody2 |
sounds good. |
|
04:50
🔗
|
Somebody2 |
Let me know if your deduplication finds that they are different, and I'll grab the second one. |
|
04:56
🔗
|
kisspunch |
Pretty sure they're the same (actors.list.gz is the same size) but I'll double check tomorrow or so |
|
04:58
🔗
|
kisspunch |
Expect it to be 14G |
|
04:58
🔗
|
kisspunch |
My internet's not that fast, I just have an old dump :) |
|
05:11
🔗
|
yipdw |
JAA: send me an SSH public key over query or email or whatnot, I can grant you access to archivebot@archivebot-proto2 and then you can register new pipelines |
|
05:13
🔗
|
|
wp494 has quit IRC (Ping timeout: 506 seconds) |
|
05:19
🔗
|
Somebody2 |
YAYAYAY! New pipeline energy! |
|
05:20
🔗
|
Somebody2 |
https://blog.archive.org/2017/10/10/books-from-1923-to-1941-now-liberated/ |
|
05:21
🔗
|
Somebody2 |
One of the points about this focus on whether copies can be bought for a fair price. |
|
05:21
🔗
|
Somebody2 |
If there are only a few copies, can someone buy them, and announce that they are no longer for sale, and thereby trigger section 108(h)? |
|
05:23
🔗
|
pikhq |
How shockingly reasonable of US copyright law. |
|
05:25
🔗
|
Somebody2 |
pikhq: yeah, ain't it? |
|
05:44
🔗
|
|
wp494 has joined #archiveteam-bs |
|
05:48
🔗
|
|
BlueMaxim has quit IRC (Quit: Leaving) |
|
06:10
🔗
|
Somebody2 |
second: I've now got the fu-berlin one; it's 13G in size. I'll wait to hear from kisspunch about whether the funet.fi one is different before going after that. |
|
06:24
🔗
|
|
BlueMaxim has joined #archiveteam-bs |
|
06:35
🔗
|
|
loadup has quit IRC (Read error: Operation timed out) |
|
07:02
🔗
|
|
Honno has joined #archiveteam-bs |
|
08:01
🔗
|
|
atrocity has quit IRC () |
|
08:23
🔗
|
|
BlueMaxim has quit IRC (Ping timeout: 255 seconds) |
|
08:23
🔗
|
|
BlueMaxim has joined #archiveteam-bs |
|
08:44
🔗
|
|
wp494 has quit IRC (Ping timeout: 492 seconds) |
|
08:51
🔗
|
|
wp494 has joined #archiveteam-bs |
|
09:26
🔗
|
|
tfgbd_znc has quit IRC (Read error: Connection reset by peer) |
|
09:46
🔗
|
|
wabu has quit IRC (Read error: Operation timed out) |
|
09:56
🔗
|
|
wabu has joined #archiveteam-bs |
|
09:56
🔗
|
|
kepler45 has joined #archiveteam-bs |
|
10:05
🔗
|
kisspunch |
ugh, there are so many versions of fdupes |
|
10:15
🔗
|
|
Honno has quit IRC (Read error: Operation timed out) |
|
10:22
🔗
|
kisspunch |
Somebody2: It's the same. |
|
10:28
🔗
|
|
atrocity has joined #archiveteam-bs |
|
10:29
🔗
|
|
ivan has quit IRC (Leaving) |
|
10:40
🔗
|
|
marvinw has joined #archiveteam-bs |
|
10:54
🔗
|
|
Mateon1 has quit IRC (Ping timeout: 250 seconds) |
|
11:02
🔗
|
|
midas has quit IRC (Read error: Connection reset by peer) |
|
11:03
🔗
|
|
midas has joined #archiveteam-bs |
|
11:04
🔗
|
JAA |
yipdw: Excellent, will do in a bit. |
|
11:27
🔗
|
|
pizzaiolo has joined #archiveteam-bs |
|
12:00
🔗
|
|
qw3rty3 has joined #archiveteam-bs |
|
12:25
🔗
|
|
wabu has quit IRC (Read error: Operation timed out) |
|
12:28
🔗
|
|
Atom has joined #archiveteam-bs |
|
12:35
🔗
|
|
wabu has joined #archiveteam-bs |
|
12:43
🔗
|
|
BlueMaxim has quit IRC (Quit: Leaving) |
|
13:32
🔗
|
second |
Thank you Somebody2 |
|
13:33
🔗
|
second |
Did anyone by chance download the aws bucket for the imdb data? |
|
13:39
🔗
|
JAA |
Since you have to pay S3's exorbitant bandwidth fees (it's a Requester-Pays bucket), I kind of doubt it. I believe IMDB is still working on an HTTP interface without those fees. |
|
13:39
🔗
|
JAA |
See: https://getsatisfaction.com/imdb/topics/imdb-data-now-available-in-amazon-s3 |
|
13:40
🔗
|
JAA |
Them not having the HTTP interface up seems to be the reason why the FTP servers are still online. |
|
13:46
🔗
|
second |
Does anyone know where I can find a last.fm dump? |
|
13:47
🔗
|
|
Mateon1 has joined #archiveteam-bs |
|
14:12
🔗
|
|
Pixi has quit IRC (Quit: Pixi) |
|
14:12
🔗
|
|
Pixi has joined #archiveteam-bs |
|
14:14
🔗
|
|
icedice has joined #archiveteam-bs |
|
14:17
🔗
|
qw3rty3 |
Is there a channel for Amazon Forum archival? |
|
14:37
🔗
|
|
sep332 has joined #archiveteam-bs |
|
15:06
🔗
|
|
Asparagir has joined #archiveteam-bs |
|
15:16
🔗
|
|
icedice has quit IRC (Quit: Leaving) |
|
15:21
🔗
|
|
Stiletto has quit IRC (Ping timeout: 260 seconds) |
|
15:28
🔗
|
|
ZexaronS- has joined #archiveteam-bs |
|
15:30
🔗
|
|
ZexaronS has quit IRC (Ping timeout: 260 seconds) |
|
15:34
🔗
|
JAA |
qw3rty3: No, there isn't. |
|
15:58
🔗
|
|
Asparagir has quit IRC (Asparagir) |
|
16:15
🔗
|
schbirid2 |
re that new order forum. 125$ for a forum that would run on a 5$ host... wtf |
|
16:16
🔗
|
JAA |
I don't know the story behind this case, but I've seen similar setups before, and there it was a matter of "never change a running system" mixed with "I'm too lazy to do anything about it". |
|
16:18
🔗
|
|
Stilett0 has joined #archiveteam-bs |
|
16:24
🔗
|
|
Stilett0 is now known as Stiletto |
|
17:22
🔗
|
|
Asparagir has joined #archiveteam-bs |
|
17:39
🔗
|
|
pa has joined #archiveteam-bs |
|
17:50
🔗
|
dd0a13f37 |
VerifiedJ: that's basically a slower version of pdfcat though, it's not pristine so to speak |
|
17:52
🔗
|
|
pa has quit IRC (Quit: pa) |
|
17:54
🔗
|
|
pa has joined #archiveteam-bs |
|
17:56
🔗
|
dd0a13f37 |
second: a quick google search gives me https://www.demonforums.net/Thread-Last-fm-Dump-Re-upload https://leakninja.com/39243-lastfm-1-8gb-dump-12.html |
|
17:56
🔗
|
dd0a13f37 |
oh hey, https://btdig.com/85f39f1d94917d61277725e7da85d8177a5c12eb/ |
|
17:57
🔗
|
dd0a13f37 |
/last.fm/lastfm.txt.gz |
|
17:59
🔗
|
dd0a13f37 |
Any way to upload a torrent larger than 100gb to internetarchive? |
|
18:44
🔗
|
|
Stiletto has quit IRC () |
|
19:16
🔗
|
|
Asparagir has quit IRC (Asparagir) |
|
20:07
🔗
|
|
schbirid2 has quit IRC (Quit: Leaving) |
|
20:09
🔗
|
|
schbirid has joined #archiveteam-bs |
|
20:32
🔗
|
|
pa has quit IRC (Quit: pa) |
|
20:33
🔗
|
|
pa has joined #archiveteam-bs |
|
20:34
🔗
|
|
pa has quit IRC (Client Quit) |
|
20:39
🔗
|
JAA |
What's the best way to archive different source code repositories? I know about svnrdump for SVN repos, but what about other softwares? git, Mercurial, Bazaar, CVS, etc. |
|
20:41
🔗
|
JAA |
In particular, what to do if the repository itself is not public but only accessible through a web frontend? (There's an ArchiveBot job currently grabbing a CVSweb instance; that's the immediate trigger for these questions, though I've been wondering about it for longer.) |
|
20:44
🔗
|
yipdw |
git clone |
|
20:44
🔗
|
yipdw |
etc |
|
20:44
🔗
|
yipdw |
github-backup for stuff on github that might have other useful things like issues, wiki pages, etc |
|
20:53
🔗
|
kisspunch |
For git clone the harder part is keeping your mirror up to date--the initial clone yeah, git clone works fine |
|
21:38
🔗
|
|
Stilett0 has joined #archiveteam-bs |
|
22:33
🔗
|
|
Asparagir has joined #archiveteam-bs |
|
22:39
🔗
|
|
kepler45 has quit IRC (Quit: Leaving) |
|
22:49
🔗
|
wp494 |
that guy that I asked to PM me about the whole deal involving NCIX got back to me and he straight up refused despite having PMed someone else already |
|
22:49
🔗
|
wp494 |
/shrug |
|
23:15
🔗
|
|
BlueMaxim has joined #archiveteam-bs |
|
23:37
🔗
|
|
Soni has quit IRC (Ping timeout: 272 seconds) |
|
23:53
🔗
|
|
Asparagir has quit IRC (Asparagir) |