#archiveteam-bs 2018-07-19,Thu

↑back Search

Time Nickname Message
00:04 πŸ”— RichardG has quit IRC (Read error: Connection reset by peer)
00:05 πŸ”— squires has joined #archiveteam-bs
00:06 πŸ”— RichardG has joined #archiveteam-bs
01:03 πŸ”— vectr0n` has quit IRC (Remote host closed the connection)
01:23 πŸ”— pikhq has quit IRC (Ping timeout: 268 seconds)
02:14 πŸ”— ta9le has quit IRC (Quit: Connection closed for inactivity)
02:34 πŸ”— pikhq has joined #archiveteam-bs
03:31 πŸ”— Frogging has joined #archiveteam-bs
03:44 πŸ”— odemg has quit IRC (Ping timeout: 268 seconds)
03:47 πŸ”— archodg_ has quit IRC (Read error: Operation timed out)
03:54 πŸ”— archodg_ has joined #archiveteam-bs
03:56 πŸ”— odemg has joined #archiveteam-bs
05:26 πŸ”— godane has quit IRC (Read error: Connection reset by peer)
05:32 πŸ”— Pixi has quit IRC (Quit: Pixi)
05:32 πŸ”— Pixi has joined #archiveteam-bs
05:41 πŸ”— godane has joined #archiveteam-bs
05:57 πŸ”— BlueMax has quit IRC (Leaving)
06:06 πŸ”— BlueMax has joined #archiveteam-bs
06:23 πŸ”— wp494 has quit IRC (Ping timeout: 633 seconds)
06:23 πŸ”— wp494 has joined #archiveteam-bs
06:34 πŸ”— achip has quit IRC (west.us.hub irc.Prison.NET)
06:40 πŸ”— achip has joined #archiveteam-bs
06:41 πŸ”— schbirid has joined #archiveteam-bs
06:44 πŸ”— achip has quit IRC (west.us.hub irc.Prison.NET)
06:50 πŸ”— achip has joined #archiveteam-bs
07:59 πŸ”— schbirid has quit IRC (Remote host closed the connection)
08:22 πŸ”— flashfure is now known as Flashfire
09:02 πŸ”— omglolbah anything special to do with the warrior to get it working on wikispaces? Just seems to hang on 'begining work on a project'
09:19 πŸ”— redlizard omglolbah: I think wikispaces is being worked on by more people than the site can sustain, so you may need to wait until the tracker gives you a job, subject to rate limits.
09:19 πŸ”— omglolbah aha
09:19 πŸ”— eientei95 SketchCow: [20:52:56] <schbird> yo, i am in contact with someone on twitter who would love to offload about 1tb of political geospatial data into IA. they say they wrote to info@ but have not heard back
09:19 πŸ”— eientei95 [20:53:27] <schbird> is there someone at IA interested in that sort of thing? if so, i'd like to provide the link between both :)
09:20 πŸ”— eientei95 (#internetarchive)
10:02 πŸ”— BlueMax has quit IRC (Leaving)
10:20 πŸ”— JAA As mentioned in #archivebot by BiggerJ: Comic Genesis, an old comic hosting website, is unstable, semi-broken, and basically unmaintained/unsupported. Might be a good idea to grab it all before it comes crashing down. I'll throw the main page and the forums into ArchiveBot and extract the comic subdomains from those jobs' logs.
11:07 πŸ”— ta9le has joined #archiveteam-bs
11:44 πŸ”— m007a83_ has joined #archiveteam-bs
11:46 πŸ”— m007a83 has quit IRC (Read error: Operation timed out)
11:47 πŸ”— m007a83 has joined #archiveteam-bs
11:52 πŸ”— m007a83_ has quit IRC (Read error: Operation timed out)
12:12 πŸ”— Sanky is now known as Sanqui
12:56 πŸ”— Mateon1 has quit IRC (Ping timeout: 260 seconds)
12:56 πŸ”— Mateon1 has joined #archiveteam-bs
14:20 πŸ”— Dimtree has quit IRC (Read error: Operation timed out)
14:29 πŸ”— Dimtree has joined #archiveteam-bs
14:42 πŸ”— archodg_ SketchCow, arkiver I'm working on this, https://old.reddit.com/r/DataHoarder/comments/906884/youtube_metadata_archive_because_working_with/ something that would be of value to us/ia? Worth putting the data on ia?
14:46 πŸ”— arkiver archodg_: yeah!
14:47 πŸ”— arkiver would be great to have a metadata archive of youtube
14:47 πŸ”— archodg_ sound, I'm bringing a few new machines online get to work on the 133,000,000 ids, building a new list which currently stands at 93,000,000... fucking big task this one!!
14:48 πŸ”— arkiver any idea how many videos youtube has in total?
14:48 πŸ”— archodg_ no idea
14:50 πŸ”— archodg_ too many
14:52 πŸ”— arkiver well I think itΒ΄s definitely something IA wants to have
14:52 πŸ”— arkiver IΒ΄ll bring it up
14:53 πŸ”— archodg_ some sources say 7 billion videos as of late last year
14:54 πŸ”— arkiver so we be looking at 145 TB
14:54 πŸ”— archodg_ this is something we should probably considering farming out and sticking on the tracker, the way I'm doing it there are no limitations other than sheer cpu power
14:54 πŸ”— JAA Rough estimate based on "number of hours uploaded each day" gives ~3 million new videos daily, so yeah, definitely in the billions.
14:55 πŸ”— archodg_ we'd just have to feed it channelIDs or videoIDs depending on how you wanted to code it
14:56 πŸ”— arkiver archodg_: yeah, we could make a warrior project, do some tests how much larger it is if we create WARCs too and decided on WARCs or not
14:56 πŸ”— arkiver no WARCs would be fine, just metadata in that case with youtube-dl
14:57 πŸ”— arkiver I guess this is not real time? https://the-eye.eu/public/Random/yt-metadata-archive/ncdu_archive.txt.mp4
14:57 πŸ”— ta9le has quit IRC (Quit: Connection closed for inactivity)
14:57 πŸ”— archodg_ I'm working toward using something other that ytdl as it's got a lot of overhead which is slowing things down considerably
14:57 πŸ”— archodg_ and no that's just a directory listing of the output after that data took 18 hours to grab
14:58 πŸ”— archodg_ I packed up that data and put it here; https://the-eye.eu/public/Random/yt-metadata-archive/yt as an example
16:11 πŸ”— RichardG has quit IRC (Ping timeout: 268 seconds)
16:14 πŸ”— djsundog has joined #archiveteam-bs
16:15 πŸ”— RichardG has joined #archiveteam-bs
17:07 πŸ”— jrwr eientei95: how is the data stored
17:15 πŸ”— djsundog has quit IRC (The Lounge - https://thelounge.chat)
17:18 πŸ”— schbirid has joined #archiveteam-bs
17:19 πŸ”— djsundog has joined #archiveteam-bs
17:34 πŸ”— ta9le has joined #archiveteam-bs
17:39 πŸ”— Mateon1 has quit IRC (west.us.hub irc.Prison.NET)
17:39 πŸ”— achip has quit IRC (west.us.hub irc.Prison.NET)
18:09 πŸ”— achip has joined #archiveteam-bs
18:09 πŸ”— Mateon1 has joined #archiveteam-bs
18:40 πŸ”— jschwart has joined #archiveteam-bs
18:46 πŸ”— achip has quit IRC (west.us.hub irc.Prison.NET)
18:46 πŸ”— Mateon1 has quit IRC (west.us.hub irc.Prison.NET)
19:16 πŸ”— achip has joined #archiveteam-bs
19:16 πŸ”— Mateon1 has joined #archiveteam-bs
20:22 πŸ”— schbirid has quit IRC (Quit: Leaving)
20:24 πŸ”— chirlu`` has quit IRC (Ping timeout: 268 seconds)
20:56 πŸ”— RichardG has quit IRC (Read error: Connection reset by peer)
20:57 πŸ”— RichardG has joined #archiveteam-bs
21:33 πŸ”— Dimtree has quit IRC (Peace)
21:48 πŸ”— Stilett0 has quit IRC ()
21:53 πŸ”— Dimtree has joined #archiveteam-bs
21:56 πŸ”— Stilett0 has joined #archiveteam-bs
21:58 πŸ”— Silas has joined #archiveteam-bs
22:00 πŸ”— Silas Is it ok to actually scan a VHS tape with a scanner, i.e. would it somehow mess with it? I'm going to try to digitize and upload a tape and I want to avoid as many mistakes as possible
22:00 πŸ”— astrid i can think of no way that doing so would cause any problems
22:00 πŸ”— astrid yes it is ok; no it would not somehow mess with it
22:02 πŸ”— Silas Ok, thank you very much!
22:02 πŸ”— jschwart has quit IRC (Quit: Konversation terminated!)
22:09 πŸ”— Silas has quit IRC (Quit: Page closed)
22:23 πŸ”— Stilett0 has quit IRC (Ping timeout: 264 seconds)
22:28 πŸ”— SketchCow has quit IRC (Read error: Connection reset by peer)
22:28 πŸ”— SketchCow has joined #archiveteam-bs
22:28 πŸ”— swebb sets mode: +o SketchCow
22:53 πŸ”— DopefishJ is now known as DFJustin
22:53 πŸ”— DFJustin has quit IRC (Remote host closed the connection)
22:54 πŸ”— DFJustin has joined #archiveteam-bs
22:54 πŸ”— swebb sets mode: +o DFJustin
23:00 πŸ”— RichardG has quit IRC (Read error: Connection reset by peer)
23:01 πŸ”— RichardG has joined #archiveteam-bs
23:12 πŸ”— Stilett0 has joined #archiveteam-bs
23:29 πŸ”— ta9le has quit IRC (Quit: Connection closed for inactivity)
23:39 πŸ”— Stilett0 has quit IRC (Ping timeout: 252 seconds)
23:42 πŸ”— Stilett0 has joined #archiveteam-bs
23:47 πŸ”— Stilett0 has quit IRC (Ping timeout: 255 seconds)
23:51 πŸ”— Stilett0 has joined #archiveteam-bs
23:54 πŸ”— Stiletto has joined #archiveteam-bs
23:58 πŸ”— jrwr has quit IRC (Read error: Operation timed out)
23:59 πŸ”— jrwr has joined #archiveteam-bs
23:59 πŸ”— Stiletto has quit IRC (Ping timeout: 260 seconds)
23:59 πŸ”— Stiletto has joined #archiveteam-bs

irclogger-viewer