#archiveteam-bs 2016-04-08,Fri

↑back Search

Time Nickname Message
00:02 πŸ”— dashcloud I got to see Dweezil perform a set before- it was pretty good.
00:24 πŸ”— BlueMaxim has joined #archiveteam-bs
00:44 πŸ”— will has quit IRC (Ping timeout: 244 seconds)
00:59 πŸ”— will has joined #archiveteam-bs
01:00 πŸ”— lytv has joined #archiveteam-bs
01:05 πŸ”— vtyl has quit IRC (Read error: Operation timed out)
01:13 πŸ”— dashcloud has quit IRC (Read error: Operation timed out)
01:16 πŸ”— JesseW has joined #archiveteam-bs
01:28 πŸ”— dashcloud has joined #archiveteam-bs
02:00 πŸ”— tomwsmf-a has joined #archiveteam-bs
03:01 πŸ”— bwn has quit IRC (Read error: Operation timed out)
04:00 πŸ”— tomwsmf-a has quit IRC (Read error: Operation timed out)
04:08 πŸ”— Sk1d has quit IRC (Ping timeout: 194 seconds)
04:10 πŸ”— dashcloud has quit IRC (Read error: Operation timed out)
04:13 πŸ”— Sk1d has joined #archiveteam-bs
04:17 πŸ”— dashcloud has joined #archiveteam-bs
04:48 πŸ”— bsmith093 JesseW: one last thing regarding that sql db you made, how far past 10 million did the stories go? for the sake of round numbers, thats where my new grab starts, and i can't think of a quick way to check.
04:49 πŸ”— bsmith093 JesseW: story id, is not a number, and i cant think of a way to break it up that way, plus the numbers arent zero padded, so thats annoying.
05:03 πŸ”— marvinw is now known as ivan`
05:40 πŸ”— JesseW bsmith093: hm
05:44 πŸ”— JesseW Well, ordering the "Story URL" column gives me:
05:48 πŸ”— JesseW it goes up to at least https://www.fanfiction.net/s/9999903/1/
05:50 πŸ”— JesseW now running select "Story URL" from metadata order by "Story URL" desc limit 10;
05:50 πŸ”— JesseW it is taking a while
05:51 πŸ”— bsmith093 JesseW: is null data literally nothing, or is it a single space, in this table?
05:52 πŸ”— JesseW nothing -- it's not a single space
05:53 πŸ”— bsmith093 select path from metadata where "story url" is null that should return something, becasue you told me there were entries with a lot of missing data
05:54 πŸ”— JesseW yes, 59, IIRC
05:54 πŸ”— bsmith093 can i pull data where any column is null?
05:54 πŸ”— JesseW query finished
05:54 πŸ”— JesseW The largest story URL is https://www.fanfiction.net/s/9999999/1/
05:54 πŸ”— bsmith093 select path from metadata where * is null that's not working either
05:55 πŸ”— JesseW * means "all the columns" -- you can't ask "are all the columns null" that way
05:55 πŸ”— bsmith093 awesome. no overlap!
05:55 πŸ”— JesseW also, even if you did, you'd just get null back :-)
05:55 πŸ”— bsmith093 right because path would never be null.
05:56 πŸ”— bsmith093 return path where at least one other column is null?
05:58 πŸ”— JesseW well, afaik, all the ones with one null column have all the rest null as well (except path
06:00 πŸ”— bsmith093 JesseW: but i keep checking, and it's not returning anything
06:02 πŸ”— JesseW apparently they are empty strings, not null
06:02 πŸ”— JesseW select * from metadata where language = '';
06:02 πŸ”— JesseW will give you the paths
06:03 πŸ”— JesseW and there should be 79 of them
06:04 πŸ”— bsmith093 i've also found a weird quirk. apparently some stories literally have a word count of 0, thats what the website returns as the metadata. nothing we can do about it, of course, just interesting
06:05 πŸ”— bsmith093 here's an example http://www.fanfiction.net/s/2279489/1/
06:05 πŸ”— JesseW hm, that is odd
06:08 πŸ”— bsmith093 Twilight - Drowning in Chaos - Mesmerizing angel.txt id is fanfiction.net/s/6958332, according to google. these must have been a edge case for the scraper.
06:10 πŸ”— bsmith093 how does count work, i can't find decent docs?
06:16 πŸ”— bsmith093 i figured it out, but damn, that's not intuitive syntax at *all*.
06:16 πŸ”— JesseW lol. No, no it is not.
06:16 πŸ”— bsmith093 select count (*) from metadata where language = '';
06:17 πŸ”— bsmith093 what else could i possible want but all of that group, why give the option?!
06:22 πŸ”— JesseW because it is actually counting the number of *non-null* values *in the given column*.
06:22 πŸ”— JesseW it's just the most often, you want a count of any non-null columns.
06:23 πŸ”— JesseW er, a count of all rows with *any* non-null column
06:24 πŸ”— Honno has joined #archiveteam-bs
06:28 πŸ”— bsmith093 thanks.
06:33 πŸ”— JesseW glad to help
06:36 πŸ”— metalcamp has joined #archiveteam-bs
06:48 πŸ”— JesseW has quit IRC (Ping timeout: 370 seconds)
07:09 πŸ”— bwn has joined #archiveteam-bs
09:38 πŸ”— mr-b has quit IRC (Read error: Operation timed out)
09:38 πŸ”— Jonimus has quit IRC (Read error: Operation timed out)
09:39 πŸ”— Jonimus has joined #archiveteam-bs
09:39 πŸ”— swebb sets mode: +o Jonimus
09:55 πŸ”— mr-b has joined #archiveteam-bs
10:52 πŸ”— Stiletto has quit IRC (Ping timeout: 250 seconds)
10:53 πŸ”— Sk1d has quit IRC (Ping timeout: 250 seconds)
10:53 πŸ”— logchfoo4 has quit IRC (Ping timeout: 250 seconds)
10:57 πŸ”— logchfoo1 starts logging #archiveteam-bs at Fri Apr 08 10:57:28 2016
10:57 πŸ”— logchfoo1 has joined #archiveteam-bs
11:02 πŸ”— koon has quit IRC (Ping timeout: 250 seconds)
11:38 πŸ”— koon has joined #archiveteam-bs
11:44 πŸ”— Fletcher_ has joined #archiveteam-bs
12:18 πŸ”— BlueMaxim has quit IRC (Quit: Leaving)
12:53 πŸ”— metalcamp has quit IRC (Quit: Bye)
13:05 πŸ”— metalcamp has joined #archiveteam-bs
13:38 πŸ”— SketchCow http://archiveguide.witness.org/
13:51 πŸ”— atrocity all the videos i download aren't for public consumption, lol
13:51 πŸ”— atrocity actually, i don't even know who would own the copyright on the shit i download
14:16 πŸ”— jut has joined #archiveteam-bs
14:24 πŸ”— Start has quit IRC (Quit: Disconnected.)
14:38 πŸ”— vitzli has joined #archiveteam-bs
15:26 πŸ”— Start has joined #archiveteam-bs
15:40 πŸ”— JesseW has joined #archiveteam-bs
16:09 πŸ”— Start has quit IRC (Quit: Disconnected.)
16:13 πŸ”— Start has joined #archiveteam-bs
16:15 πŸ”— JesseW has quit IRC (Ping timeout: 370 seconds)
16:20 πŸ”— Start has quit IRC (Quit: Disconnected.)
16:26 πŸ”— Frogging atrocity: Needs more LZMA
16:27 πŸ”— Frogging oh wait, 7z is LZMA
16:29 πŸ”— xmc 7zma
16:29 πŸ”— vitzli has quit IRC (Quit: Leaving)
16:38 πŸ”— VADemon has joined #archiveteam-bs
16:48 πŸ”— Honno has quit IRC (Read error: Operation timed out)
16:49 πŸ”— Honno has joined #archiveteam-bs
17:09 πŸ”— jspiros has joined #archiveteam-bs
17:09 πŸ”— yakfish has joined #archiveteam-bs
17:09 πŸ”— matthusby has joined #archiveteam-bs
17:13 πŸ”— SadDM has joined #archiveteam-bs
17:13 πŸ”— swebb sets mode: +o SadDM
17:47 πŸ”— Stilett0 is now known as Stiletto
17:50 πŸ”— Atluxity https://www.youtube.com/watch?v=D2fSXp6N-vs
17:59 πŸ”— PurpleSym I wonder if there’s a program to make these kind of videos. As in: Feed it timestamped transcripts of audio/video and a text. Should be fairly easy actually – except for the timestamped transcripts.
18:01 πŸ”— SketchCow Yes
18:23 πŸ”— Start has joined #archiveteam-bs
18:49 πŸ”— Honno has quit IRC (Read error: Operation timed out)
19:01 πŸ”— Honno has joined #archiveteam-bs
19:04 πŸ”— Boppen has quit IRC (Ping timeout: 194 seconds)
19:04 πŸ”— Sk2d has joined #archiveteam-bs
19:09 πŸ”— jut has quit IRC (Read error: Connection reset by peer)
19:10 πŸ”— Sk1d has quit IRC (hub.se irc.du.se)
19:23 πŸ”— Start has quit IRC (Ping timeout: 260 seconds)
19:25 πŸ”— Sk2d is now known as Sk1d
19:30 πŸ”— bwn has quit IRC (Read error: Operation timed out)
19:31 πŸ”— Start has joined #archiveteam-bs
19:44 πŸ”— Start has quit IRC (Quit: Disconnected.)
19:49 πŸ”— Start has joined #archiveteam-bs
19:49 πŸ”— bwn has joined #archiveteam-bs
19:51 πŸ”— Sanqui has quit IRC (Read error: Operation timed out)
19:51 πŸ”— Sanqui has joined #archiveteam-bs
19:53 πŸ”— Start has quit IRC (Client Quit)
19:56 πŸ”— Infreq_ has quit IRC (Read error: Operation timed out)
19:56 πŸ”— Sanqui has quit IRC (Read error: Operation timed out)
20:04 πŸ”— Infreq has joined #archiveteam-bs
20:13 πŸ”— Sanqui has joined #archiveteam-bs
20:47 πŸ”— metalcamp has quit IRC (Ping timeout: 244 seconds)
20:59 πŸ”— godane SketchCow: i'm up to 2015-04-30 with kpfa
21:00 πŸ”— godane we are less then a year behind now
21:02 πŸ”— Atluxity I have just acquired two dv-tape recorders/players and will start a project digitizing ~150 dv tapes my organization got archived
21:04 πŸ”— Start has joined #archiveteam-bs
21:05 πŸ”— Honno has quit IRC (Read error: Operation timed out)
21:10 πŸ”— BlueMaxim has joined #archiveteam-bs
21:14 πŸ”— Start has quit IRC (Quit: Disconnected.)
21:35 πŸ”— atrocity nice!
21:42 πŸ”— Atluxity tens of years of free software talks
21:51 πŸ”— Honno has joined #archiveteam-bs
22:05 πŸ”— Start has joined #archiveteam-bs
22:18 πŸ”— BlueMaxim oh god joepie91 that brings back so many memories
22:22 πŸ”— joepie91 :
22:22 πŸ”— joepie91 :D*
22:27 πŸ”— slpeeds has joined #archiveteam-bs
22:34 πŸ”— fdo54ss has quit IRC (Ping timeout: 633 seconds)
22:43 πŸ”— Stiletto is now known as Stilett0
23:08 πŸ”— Honno has quit IRC (Read error: Operation timed out)
23:53 πŸ”— dashcloud has quit IRC (Read error: Operation timed out)

irclogger-viewer