Time |
Nickname |
Message |
00:04
π
|
|
RichardG has quit IRC (Read error: Connection reset by peer) |
00:05
π
|
|
squires has joined #archiveteam-bs |
00:06
π
|
|
RichardG has joined #archiveteam-bs |
01:03
π
|
|
vectr0n` has quit IRC (Remote host closed the connection) |
01:23
π
|
|
pikhq has quit IRC (Ping timeout: 268 seconds) |
02:14
π
|
|
ta9le has quit IRC (Quit: Connection closed for inactivity) |
02:34
π
|
|
pikhq has joined #archiveteam-bs |
03:31
π
|
|
Frogging has joined #archiveteam-bs |
03:44
π
|
|
odemg has quit IRC (Ping timeout: 268 seconds) |
03:47
π
|
|
archodg_ has quit IRC (Read error: Operation timed out) |
03:54
π
|
|
archodg_ has joined #archiveteam-bs |
03:56
π
|
|
odemg has joined #archiveteam-bs |
05:26
π
|
|
godane has quit IRC (Read error: Connection reset by peer) |
05:32
π
|
|
Pixi has quit IRC (Quit: Pixi) |
05:32
π
|
|
Pixi has joined #archiveteam-bs |
05:41
π
|
|
godane has joined #archiveteam-bs |
05:57
π
|
|
BlueMax has quit IRC (Leaving) |
06:06
π
|
|
BlueMax has joined #archiveteam-bs |
06:23
π
|
|
wp494 has quit IRC (Ping timeout: 633 seconds) |
06:23
π
|
|
wp494 has joined #archiveteam-bs |
06:34
π
|
|
achip has quit IRC (west.us.hub irc.Prison.NET) |
06:40
π
|
|
achip has joined #archiveteam-bs |
06:41
π
|
|
schbirid has joined #archiveteam-bs |
06:44
π
|
|
achip has quit IRC (west.us.hub irc.Prison.NET) |
06:50
π
|
|
achip has joined #archiveteam-bs |
07:59
π
|
|
schbirid has quit IRC (Remote host closed the connection) |
08:22
π
|
|
flashfure is now known as Flashfire |
09:02
π
|
omglolbah |
anything special to do with the warrior to get it working on wikispaces? Just seems to hang on 'begining work on a project' |
09:19
π
|
redlizard |
omglolbah: I think wikispaces is being worked on by more people than the site can sustain, so you may need to wait until the tracker gives you a job, subject to rate limits. |
09:19
π
|
omglolbah |
aha |
09:19
π
|
eientei95 |
SketchCow: [20:52:56] <schbird> yo, i am in contact with someone on twitter who would love to offload about 1tb of political geospatial data into IA. they say they wrote to info@ but have not heard back |
09:19
π
|
eientei95 |
[20:53:27] <schbird> is there someone at IA interested in that sort of thing? if so, i'd like to provide the link between both :) |
09:20
π
|
eientei95 |
(#internetarchive) |
10:02
π
|
|
BlueMax has quit IRC (Leaving) |
10:20
π
|
JAA |
As mentioned in #archivebot by BiggerJ: Comic Genesis, an old comic hosting website, is unstable, semi-broken, and basically unmaintained/unsupported. Might be a good idea to grab it all before it comes crashing down. I'll throw the main page and the forums into ArchiveBot and extract the comic subdomains from those jobs' logs. |
11:07
π
|
|
ta9le has joined #archiveteam-bs |
11:44
π
|
|
m007a83_ has joined #archiveteam-bs |
11:46
π
|
|
m007a83 has quit IRC (Read error: Operation timed out) |
11:47
π
|
|
m007a83 has joined #archiveteam-bs |
11:52
π
|
|
m007a83_ has quit IRC (Read error: Operation timed out) |
12:12
π
|
|
Sanky is now known as Sanqui |
12:56
π
|
|
Mateon1 has quit IRC (Ping timeout: 260 seconds) |
12:56
π
|
|
Mateon1 has joined #archiveteam-bs |
14:20
π
|
|
Dimtree has quit IRC (Read error: Operation timed out) |
14:29
π
|
|
Dimtree has joined #archiveteam-bs |
14:42
π
|
archodg_ |
SketchCow, arkiver I'm working on this, https://old.reddit.com/r/DataHoarder/comments/906884/youtube_metadata_archive_because_working_with/ something that would be of value to us/ia? Worth putting the data on ia? |
14:46
π
|
arkiver |
archodg_: yeah! |
14:47
π
|
arkiver |
would be great to have a metadata archive of youtube |
14:47
π
|
archodg_ |
sound, I'm bringing a few new machines online get to work on the 133,000,000 ids, building a new list which currently stands at 93,000,000... fucking big task this one!! |
14:48
π
|
arkiver |
any idea how many videos youtube has in total? |
14:48
π
|
archodg_ |
no idea |
14:50
π
|
archodg_ |
too many |
14:52
π
|
arkiver |
well I think itΒ΄s definitely something IA wants to have |
14:52
π
|
arkiver |
IΒ΄ll bring it up |
14:53
π
|
archodg_ |
some sources say 7 billion videos as of late last year |
14:54
π
|
arkiver |
so we be looking at 145 TB |
14:54
π
|
archodg_ |
this is something we should probably considering farming out and sticking on the tracker, the way I'm doing it there are no limitations other than sheer cpu power |
14:54
π
|
JAA |
Rough estimate based on "number of hours uploaded each day" gives ~3 million new videos daily, so yeah, definitely in the billions. |
14:55
π
|
archodg_ |
we'd just have to feed it channelIDs or videoIDs depending on how you wanted to code it |
14:56
π
|
arkiver |
archodg_: yeah, we could make a warrior project, do some tests how much larger it is if we create WARCs too and decided on WARCs or not |
14:56
π
|
arkiver |
no WARCs would be fine, just metadata in that case with youtube-dl |
14:57
π
|
arkiver |
I guess this is not real time? https://the-eye.eu/public/Random/yt-metadata-archive/ncdu_archive.txt.mp4 |
14:57
π
|
|
ta9le has quit IRC (Quit: Connection closed for inactivity) |
14:57
π
|
archodg_ |
I'm working toward using something other that ytdl as it's got a lot of overhead which is slowing things down considerably |
14:57
π
|
archodg_ |
and no that's just a directory listing of the output after that data took 18 hours to grab |
14:58
π
|
archodg_ |
I packed up that data and put it here; https://the-eye.eu/public/Random/yt-metadata-archive/yt as an example |
16:11
π
|
|
RichardG has quit IRC (Ping timeout: 268 seconds) |
16:14
π
|
|
djsundog has joined #archiveteam-bs |
16:15
π
|
|
RichardG has joined #archiveteam-bs |
17:07
π
|
jrwr |
eientei95: how is the data stored |
17:15
π
|
|
djsundog has quit IRC (The Lounge - https://thelounge.chat) |
17:18
π
|
|
schbirid has joined #archiveteam-bs |
17:19
π
|
|
djsundog has joined #archiveteam-bs |
17:34
π
|
|
ta9le has joined #archiveteam-bs |
17:39
π
|
|
Mateon1 has quit IRC (west.us.hub irc.Prison.NET) |
17:39
π
|
|
achip has quit IRC (west.us.hub irc.Prison.NET) |
18:09
π
|
|
achip has joined #archiveteam-bs |
18:09
π
|
|
Mateon1 has joined #archiveteam-bs |
18:40
π
|
|
jschwart has joined #archiveteam-bs |
18:46
π
|
|
achip has quit IRC (west.us.hub irc.Prison.NET) |
18:46
π
|
|
Mateon1 has quit IRC (west.us.hub irc.Prison.NET) |
19:16
π
|
|
achip has joined #archiveteam-bs |
19:16
π
|
|
Mateon1 has joined #archiveteam-bs |
20:22
π
|
|
schbirid has quit IRC (Quit: Leaving) |
20:24
π
|
|
chirlu`` has quit IRC (Ping timeout: 268 seconds) |
20:56
π
|
|
RichardG has quit IRC (Read error: Connection reset by peer) |
20:57
π
|
|
RichardG has joined #archiveteam-bs |
21:33
π
|
|
Dimtree has quit IRC (Peace) |
21:48
π
|
|
Stilett0 has quit IRC () |
21:53
π
|
|
Dimtree has joined #archiveteam-bs |
21:56
π
|
|
Stilett0 has joined #archiveteam-bs |
21:58
π
|
|
Silas has joined #archiveteam-bs |
22:00
π
|
Silas |
Is it ok to actually scan a VHS tape with a scanner, i.e. would it somehow mess with it? I'm going to try to digitize and upload a tape and I want to avoid as many mistakes as possible |
22:00
π
|
astrid |
i can think of no way that doing so would cause any problems |
22:00
π
|
astrid |
yes it is ok; no it would not somehow mess with it |
22:02
π
|
Silas |
Ok, thank you very much! |
22:02
π
|
|
jschwart has quit IRC (Quit: Konversation terminated!) |
22:09
π
|
|
Silas has quit IRC (Quit: Page closed) |
22:23
π
|
|
Stilett0 has quit IRC (Ping timeout: 264 seconds) |
22:28
π
|
|
SketchCow has quit IRC (Read error: Connection reset by peer) |
22:28
π
|
|
SketchCow has joined #archiveteam-bs |
22:28
π
|
|
swebb sets mode: +o SketchCow |
22:53
π
|
|
DopefishJ is now known as DFJustin |
22:53
π
|
|
DFJustin has quit IRC (Remote host closed the connection) |
22:54
π
|
|
DFJustin has joined #archiveteam-bs |
22:54
π
|
|
swebb sets mode: +o DFJustin |
23:00
π
|
|
RichardG has quit IRC (Read error: Connection reset by peer) |
23:01
π
|
|
RichardG has joined #archiveteam-bs |
23:12
π
|
|
Stilett0 has joined #archiveteam-bs |
23:29
π
|
|
ta9le has quit IRC (Quit: Connection closed for inactivity) |
23:39
π
|
|
Stilett0 has quit IRC (Ping timeout: 252 seconds) |
23:42
π
|
|
Stilett0 has joined #archiveteam-bs |
23:47
π
|
|
Stilett0 has quit IRC (Ping timeout: 255 seconds) |
23:51
π
|
|
Stilett0 has joined #archiveteam-bs |
23:54
π
|
|
Stiletto has joined #archiveteam-bs |
23:58
π
|
|
jrwr has quit IRC (Read error: Operation timed out) |
23:59
π
|
|
jrwr has joined #archiveteam-bs |
23:59
π
|
|
Stiletto has quit IRC (Ping timeout: 260 seconds) |
23:59
π
|
|
Stiletto has joined #archiveteam-bs |