[00:04] *** RichardG has quit IRC (Read error: Connection reset by peer) [00:05] *** squires has joined #archiveteam-bs [00:06] *** RichardG has joined #archiveteam-bs [01:03] *** vectr0n` has quit IRC (Remote host closed the connection) [01:23] *** pikhq has quit IRC (Ping timeout: 268 seconds) [02:14] *** ta9le has quit IRC (Quit: Connection closed for inactivity) [02:34] *** pikhq has joined #archiveteam-bs [03:31] *** Frogging has joined #archiveteam-bs [03:44] *** odemg has quit IRC (Ping timeout: 268 seconds) [03:47] *** archodg_ has quit IRC (Read error: Operation timed out) [03:54] *** archodg_ has joined #archiveteam-bs [03:56] *** odemg has joined #archiveteam-bs [05:26] *** godane has quit IRC (Read error: Connection reset by peer) [05:32] *** Pixi has quit IRC (Quit: Pixi) [05:32] *** Pixi has joined #archiveteam-bs [05:41] *** godane has joined #archiveteam-bs [05:57] *** BlueMax has quit IRC (Leaving) [06:06] *** BlueMax has joined #archiveteam-bs [06:23] *** wp494 has quit IRC (Ping timeout: 633 seconds) [06:23] *** wp494 has joined #archiveteam-bs [06:34] *** achip has quit IRC (west.us.hub irc.Prison.NET) [06:40] *** achip has joined #archiveteam-bs [06:41] *** schbirid has joined #archiveteam-bs [06:44] *** achip has quit IRC (west.us.hub irc.Prison.NET) [06:50] *** achip has joined #archiveteam-bs [07:59] *** schbirid has quit IRC (Remote host closed the connection) [08:22] *** flashfure is now known as Flashfire [09:02] anything special to do with the warrior to get it working on wikispaces? Just seems to hang on 'begining work on a project' [09:19] omglolbah: I think wikispaces is being worked on by more people than the site can sustain, so you may need to wait until the tracker gives you a job, subject to rate limits. [09:19] aha [09:19] SketchCow: [20:52:56] yo, i am in contact with someone on twitter who would love to offload about 1tb of political geospatial data into IA. they say they wrote to info@ but have not heard back [09:19] [20:53:27] is there someone at IA interested in that sort of thing? if so, i'd like to provide the link between both :) [09:20] (#internetarchive) [10:02] *** BlueMax has quit IRC (Leaving) [10:20] As mentioned in #archivebot by BiggerJ: Comic Genesis, an old comic hosting website, is unstable, semi-broken, and basically unmaintained/unsupported. Might be a good idea to grab it all before it comes crashing down. I'll throw the main page and the forums into ArchiveBot and extract the comic subdomains from those jobs' logs. [11:07] *** ta9le has joined #archiveteam-bs [11:44] *** m007a83_ has joined #archiveteam-bs [11:46] *** m007a83 has quit IRC (Read error: Operation timed out) [11:47] *** m007a83 has joined #archiveteam-bs [11:52] *** m007a83_ has quit IRC (Read error: Operation timed out) [12:12] *** Sanky is now known as Sanqui [12:56] *** Mateon1 has quit IRC (Ping timeout: 260 seconds) [12:56] *** Mateon1 has joined #archiveteam-bs [14:20] *** Dimtree has quit IRC (Read error: Operation timed out) [14:29] *** Dimtree has joined #archiveteam-bs [14:42] SketchCow, arkiver I'm working on this, https://old.reddit.com/r/DataHoarder/comments/906884/youtube_metadata_archive_because_working_with/ something that would be of value to us/ia? Worth putting the data on ia? [14:46] archodg_: yeah! [14:47] would be great to have a metadata archive of youtube [14:47] sound, I'm bringing a few new machines online get to work on the 133,000,000 ids, building a new list which currently stands at 93,000,000... fucking big task this one!! [14:48] any idea how many videos youtube has in total? [14:48] no idea [14:50] too many [14:52] well I think it´s definitely something IA wants to have [14:52] I´ll bring it up [14:53] some sources say 7 billion videos as of late last year [14:54] so we be looking at 145 TB [14:54] this is something we should probably considering farming out and sticking on the tracker, the way I'm doing it there are no limitations other than sheer cpu power [14:54] Rough estimate based on "number of hours uploaded each day" gives ~3 million new videos daily, so yeah, definitely in the billions. [14:55] we'd just have to feed it channelIDs or videoIDs depending on how you wanted to code it [14:56] archodg_: yeah, we could make a warrior project, do some tests how much larger it is if we create WARCs too and decided on WARCs or not [14:56] no WARCs would be fine, just metadata in that case with youtube-dl [14:57] I guess this is not real time? https://the-eye.eu/public/Random/yt-metadata-archive/ncdu_archive.txt.mp4 [14:57] *** ta9le has quit IRC (Quit: Connection closed for inactivity) [14:57] I'm working toward using something other that ytdl as it's got a lot of overhead which is slowing things down considerably [14:57] and no that's just a directory listing of the output after that data took 18 hours to grab [14:58] I packed up that data and put it here; https://the-eye.eu/public/Random/yt-metadata-archive/yt as an example [16:11] *** RichardG has quit IRC (Ping timeout: 268 seconds) [16:14] *** djsundog has joined #archiveteam-bs [16:15] *** RichardG has joined #archiveteam-bs [17:07] eientei95: how is the data stored [17:15] *** djsundog has quit IRC (The Lounge - https://thelounge.chat) [17:18] *** schbirid has joined #archiveteam-bs [17:19] *** djsundog has joined #archiveteam-bs [17:34] *** ta9le has joined #archiveteam-bs [17:39] *** Mateon1 has quit IRC (west.us.hub irc.Prison.NET) [17:39] *** achip has quit IRC (west.us.hub irc.Prison.NET) [18:09] *** achip has joined #archiveteam-bs [18:09] *** Mateon1 has joined #archiveteam-bs [18:40] *** jschwart has joined #archiveteam-bs [18:46] *** achip has quit IRC (west.us.hub irc.Prison.NET) [18:46] *** Mateon1 has quit IRC (west.us.hub irc.Prison.NET) [19:16] *** achip has joined #archiveteam-bs [19:16] *** Mateon1 has joined #archiveteam-bs [20:22] *** schbirid has quit IRC (Quit: Leaving) [20:24] *** chirlu`` has quit IRC (Ping timeout: 268 seconds) [20:56] *** RichardG has quit IRC (Read error: Connection reset by peer) [20:57] *** RichardG has joined #archiveteam-bs [21:33] *** Dimtree has quit IRC (Peace) [21:48] *** Stilett0 has quit IRC () [21:53] *** Dimtree has joined #archiveteam-bs [21:56] *** Stilett0 has joined #archiveteam-bs [21:58] *** Silas has joined #archiveteam-bs [22:00] Is it ok to actually scan a VHS tape with a scanner, i.e. would it somehow mess with it? I'm going to try to digitize and upload a tape and I want to avoid as many mistakes as possible [22:00] i can think of no way that doing so would cause any problems [22:00] yes it is ok; no it would not somehow mess with it [22:02] Ok, thank you very much! [22:02] *** jschwart has quit IRC (Quit: Konversation terminated!) [22:09] *** Silas has quit IRC (Quit: Page closed) [22:23] *** Stilett0 has quit IRC (Ping timeout: 264 seconds) [22:28] *** SketchCow has quit IRC (Read error: Connection reset by peer) [22:28] *** SketchCow has joined #archiveteam-bs [22:28] *** swebb sets mode: +o SketchCow [22:53] *** DopefishJ is now known as DFJustin [22:53] *** DFJustin has quit IRC (Remote host closed the connection) [22:54] *** DFJustin has joined #archiveteam-bs [22:54] *** swebb sets mode: +o DFJustin [23:00] *** RichardG has quit IRC (Read error: Connection reset by peer) [23:01] *** RichardG has joined #archiveteam-bs [23:12] *** Stilett0 has joined #archiveteam-bs [23:29] *** ta9le has quit IRC (Quit: Connection closed for inactivity) [23:39] *** Stilett0 has quit IRC (Ping timeout: 252 seconds) [23:42] *** Stilett0 has joined #archiveteam-bs [23:47] *** Stilett0 has quit IRC (Ping timeout: 255 seconds) [23:51] *** Stilett0 has joined #archiveteam-bs [23:54] *** Stiletto has joined #archiveteam-bs [23:58] *** jrwr has quit IRC (Read error: Operation timed out) [23:59] *** jrwr has joined #archiveteam-bs [23:59] *** Stiletto has quit IRC (Ping timeout: 260 seconds) [23:59] *** Stiletto has joined #archiveteam-bs