Time |
Nickname |
Message |
00:03
🔗
|
|
octothorp has joined #archiveteam-bs |
00:41
🔗
|
|
Asparagir has joined #archiveteam-bs |
01:13
🔗
|
voltagex |
444gb torrent site dump |
01:13
🔗
|
voltagex |
ah, godane beat me to it |
01:14
🔗
|
|
Asparagir has quit IRC (Asparagir) |
01:14
🔗
|
voltagex |
robogoat: I have the infohash-only dump which is 300mb gzipped |
01:22
🔗
|
godane |
i have the infohash-only dump too |
01:24
🔗
|
godane |
maybe some here will want to archive this: https://www.youtube.com/user/mikeziegler/videos |
01:34
🔗
|
SketchCow |
I HAVE A SUTPID IDEA |
01:34
🔗
|
SketchCow |
AS SUTPID AS IT GETS |
01:34
🔗
|
SketchCow |
But I was thinking of it today. |
01:35
🔗
|
SketchCow |
Maybe we should have some page on the Wiki, some obvious name, which is the HOT BUTTON topics, the ones we're getting a lot about, so people aren't dependent on the IRC scroll. |
01:35
🔗
|
SketchCow |
Optional. Mostly reflective that I've been really busy with non IRC stuff, but if people think we're keeping up, great. |
01:36
🔗
|
voltagex |
godane: lol, just sampled 100 odd hashes - porn. |
01:38
🔗
|
godane |
so i'm now capturing a bad tape of mst3k |
01:38
🔗
|
godane |
thats going be 6 hours |
01:38
🔗
|
godane |
SketchCow: btw one tape i will have to fix cause there is a bit of plastic in it |
01:39
🔗
|
godane |
also the tape guard will have be replace |
01:40
🔗
|
godane |
left side goes up more the the right side |
01:44
🔗
|
|
jacketcha has quit IRC (Ping timeout: 252 seconds) |
01:53
🔗
|
|
Dimtree has joined #archiveteam-bs |
01:57
🔗
|
Stilett0 |
just found out YouTube deprecated video location fields: https://developers.google.com/youtube/v3/revision_history#release_notes_06_01_2017 |
01:57
🔗
|
* |
Stilett0 has been living under a rock apparently |
01:58
🔗
|
Stilett0 |
obligatory "SketchCow: DOOMED" |
01:58
🔗
|
|
Stilett0 is now known as Stiletto |
02:01
🔗
|
godane |
SketchCow: we really need to get the show called 'The Site' to have some digitize full episodes |
02:01
🔗
|
godane |
there is no full episodes of it out there |
02:02
🔗
|
godane |
its that and digitizing any zdtv stuff from 1998/1999 |
02:03
🔗
|
SketchCow |
Over time a lot will come out, I'm sure. |
02:05
🔗
|
godane |
after this box i have to digitize the new box of stuff i bought |
02:06
🔗
|
godane |
i think that new box will take about 7 to 10 days to digitize |
02:18
🔗
|
|
bithippo has joined #archiveteam-bs |
02:19
🔗
|
|
Petri152 has quit IRC (Ping timeout: 246 seconds) |
02:19
🔗
|
godane |
btw i'm at 37,887 items now for this month |
02:19
🔗
|
godane |
36,716 are the dtic docs |
02:20
🔗
|
|
Petri152 has joined #archiveteam-bs |
02:21
🔗
|
|
yuitimoth has quit IRC (Read error: Operation timed out) |
02:21
🔗
|
|
yuitimoth has joined #archiveteam-bs |
03:20
🔗
|
|
bithippo has quit IRC (Ping timeout: 260 seconds) |
03:25
🔗
|
|
Mateon1 has quit IRC (Remote host closed the connection) |
03:26
🔗
|
|
Mateon1 has joined #archiveteam-bs |
03:33
🔗
|
|
bithippo has joined #archiveteam-bs |
03:40
🔗
|
bithippo |
Is explicit permission require to add an item to the "Archive Team" collection? |
03:40
🔗
|
bithippo |
s/require/required |
04:02
🔗
|
|
bithippo has quit IRC (Quit: Page closed) |
04:04
🔗
|
|
Jens has quit IRC (Remote host closed the connection) |
04:05
🔗
|
|
Jens has joined #archiveteam-bs |
04:23
🔗
|
|
Sanqui has quit IRC (Ping timeout: 260 seconds) |
04:35
🔗
|
|
Sanqui has joined #archiveteam-bs |
04:39
🔗
|
|
qw3rty119 has joined #archiveteam-bs |
04:44
🔗
|
|
qw3rty118 has quit IRC (Ping timeout: 600 seconds) |
04:51
🔗
|
robogoat |
voltagex: I got the infohashes as well, are you just sampling by querying the DHT for the infohash? |
04:53
🔗
|
voltagex |
Yes |
04:54
🔗
|
voltagex |
Just be aware there's some fucked up stuff in there |
04:56
🔗
|
voltagex |
Left 10k hashes running, but I've got to get some paid work done today haha |
05:01
🔗
|
voltagex |
robogoat: ping me if you're interested in this |
05:13
🔗
|
|
Stilett0 has joined #archiveteam-bs |
05:14
🔗
|
|
Stilett0 has quit IRC (Client Quit) |
05:30
🔗
|
|
BlueMax has quit IRC (Leaving) |
05:35
🔗
|
voltagex |
If all goes well I'll have the 444gb dump tomorrow |
05:57
🔗
|
|
zyphlar_ has joined #archiveteam-bs |
07:40
🔗
|
|
Valentine has quit IRC (Ping timeout: 506 seconds) |
07:43
🔗
|
|
Valentine has joined #archiveteam-bs |
08:07
🔗
|
|
zyphlar_ has quit IRC (Quit: Connection closed for inactivity) |
09:42
🔗
|
|
ranavalon has quit IRC (Read error: Connection reset by peer) |
09:44
🔗
|
|
ranavalon has joined #archiveteam-bs |
10:32
🔗
|
|
Mateon1 has quit IRC (Read error: Operation timed out) |
10:32
🔗
|
|
Mateon1 has joined #archiveteam-bs |
12:07
🔗
|
JAA |
https://www.troyhunt.com/ive-just-launched-pwned-passwords-version-2/ |
12:08
🔗
|
JAA |
Over 500 million hashes now, up from 320M last August. |
12:52
🔗
|
|
klondike has joined #archiveteam-bs |
16:01
🔗
|
godane |
SketchCow: so i think someone here could hack this to get a local wayback machine on rpi project going: https://github.com/alard/warc-proxy |
16:02
🔗
|
godane |
to me it would be the most simple way to get a jump start one |
16:02
🔗
|
godane |
i'm not good at python though so its up to some else to do the hard work |
16:20
🔗
|
JAA |
Goddammit, why is URL parsing so damn complicated? |
18:26
🔗
|
|
Pixi has quit IRC (Quit: Pixi) |
18:27
🔗
|
|
Pixi has joined #archiveteam-bs |
18:35
🔗
|
joepie91 |
JAA: use a library for it? :P |
18:38
🔗
|
|
bitBaron has joined #archiveteam-bs |
18:54
🔗
|
|
MrDignity has quit IRC (Remote host closed the connection) |
18:54
🔗
|
|
MrDignity has joined #archiveteam-bs |
19:00
🔗
|
|
bitBaron has quit IRC (Quit: My computer has gone to sleep. ZZZzzz…) |
19:02
🔗
|
|
bitBaron has joined #archiveteam-bs |
19:12
🔗
|
|
jschwart has joined #archiveteam-bs |
20:19
🔗
|
godane |
SketchCow: tape 24 is getting digitize |
20:20
🔗
|
godane |
tape 23 has no video signal for the last 40 minutes |
20:21
🔗
|
godane |
most likely going named like this: random-tv-mtv-letterman-hard-copy-making-of-oz-1990.mpg |
20:21
🔗
|
godane |
for full tape |
20:27
🔗
|
|
RichardG has quit IRC (Read error: Connection reset by peer) |
20:29
🔗
|
|
RichardG has joined #archiveteam-bs |
20:36
🔗
|
|
ola_norsk has joined #archiveteam-bs |
20:37
🔗
|
ola_norsk |
the webrecorder.io guys seem quite nice. They upped my storage from 1.5Gb to 7Gb when i said i used it to warc to IA |
20:38
🔗
|
ola_norsk |
bytes even* |
20:43
🔗
|
|
WubTheCap has joined #archiveteam-bs |
20:49
🔗
|
ola_norsk |
some claimed webrecorder was 'shady', but i forget the reason. I don't want to fall under some spell for 7GB of online storage.. |
20:49
🔗
|
ola_norsk |
_why_ are they shady? |
21:02
🔗
|
|
atlogbot has quit IRC (Remote host closed the connection) |
21:02
🔗
|
|
swebb has quit IRC (Quit: badcheese.com - where crap sometimes gets done) |
21:14
🔗
|
|
atlogbot has joined #archiveteam-bs |
21:14
🔗
|
|
svchfoo1 sets mode: +v atlogbot |
21:23
🔗
|
|
WubTheCap has quit IRC (Read error: Connection reset by peer) |
21:47
🔗
|
JAA |
joepie91: Yeah. I'm working with wpull currently, which already has that code. I guess I'll look into replacing it with urllib or something, but that's more of a long-term goal. I'm just trying to fix bugs right now to get wpull 2 into usable shape. |
21:51
🔗
|
JAA |
NB, it already uses urllib for some things, but not for everything. |
21:52
🔗
|
JAA |
I'm not sure what the reasoning behind that is, but I'm sure I'll find out about all the subtleties when I try to replace it. |
22:08
🔗
|
|
ola_norsk has quit IRC (It's all goblins and frogs! https://pastebin.com/raw/jeEdHUQC) |
23:01
🔗
|
|
jtn2 has quit IRC (Ping timeout: 492 seconds) |
23:04
🔗
|
|
jschwart has quit IRC (Quit: Konversation terminated!) |
23:08
🔗
|
|
Famicoman has joined #archiveteam-bs |
23:25
🔗
|
|
jtn2 has joined #archiveteam-bs |
23:26
🔗
|
SketchCow |
https://archive.org/details/Popular_Science_1984-06_June_600_dpi |
23:27
🔗
|
SketchCow |
So, unfortunately, his scanner was dirty so streaks on the left pages. I've asked him to rescan. |
23:27
🔗
|
SketchCow |
But the point is there. Lovely. |
23:30
🔗
|
SketchCow |
He's willing to do it all "right" |
23:31
🔗
|
SketchCow |
And we've been working back and forth, lots of mail, and he's debinding magazines and off he goes. He's doing Popular Science and Byte |
23:46
🔗
|
arbin |
how well do the debind scans come out? |
23:47
🔗
|
arbin |
ive considered debinding some stuff to get better scans |
23:47
🔗
|
arbin |
dunno if i really have the right tools for it though (xacto, cutting mat, and metal ruler) |