[00:21] *** primus104 has quit IRC (Leaving.) [00:23] *** Start has quit IRC (Quit: Disconnected.) [00:23] *** Start has joined #archiveteam-bs [00:27] *** ripvanwin has joined #archiveteam-bs [00:27] *** ripvanwin has quit IRC (Client Quit) [00:35] *** wyatt8740 has joined #archiveteam-bs [01:36] *** vitzli has joined #archiveteam-bs [01:48] SketchCow: i just found out the Beijing Review keeps pdfs on there site [01:48] oh my god [01:48] i can get them going back to 2001 [01:49] *** BlueMaxim has joined #archiveteam-bs [01:51] Neat [01:52] godane: Who are they? A news source? [01:53] https://en.wikipedia.org/wiki/Beijing_Review [01:53] How old are you, anomie? [01:54] SketchCow: 19 [01:54] Usually I'm not so lazy, I'm just a little retarded today. [01:55] i see pdfs of it going back to 1988 [01:56] Yeah, that's worth archiving. [01:57] But now wonder I wonder… are there journals that are distributed more discreetly due to China's pro-active censorship? [01:58] The older the better, bet they have interesting things to say about Taiwan. [02:03] i don't think i will get full archives [02:03] but will get full archives for the past few years [02:50] so i found this website: http://www.massline.org/ [02:50] it has tons of Peking Review/ Beijing Review [02:52] i throw it to the archivebot [02:57] You remember when photobucket broke the internet? [03:15] anomie: ... you're not the same anomie i know from the snes scene... [03:15] There's another one? [03:15] I guess it's not surprising. It's a word in the dictionary. [03:15] or are you [03:15] hmm [03:16] I don't think so. [03:16] the anomie mentioned here: http://board.zsnes.com/phpBB3/viewtopic.php?f=6&t=4946 [03:17] Nope. [03:18] and https://en.wikipedia.org/wiki/User:Anomie is the same anomie as well [03:19] ok fair enough [03:20] oh, I thought that name seemed familiar [03:22] Should I be all right if I feed libcom to the bot? [03:23] I don't think it's unreasonably big. [03:50] *** Start has quit IRC (Quit: Disconnected.) [03:52] *** Start has joined #archiveteam-bs [04:08] *** SN4T14_ has joined #archiveteam-bs [04:12] *** SN4T14 has quit IRC (Ping timeout: 483 seconds) [04:30] *** aaaaaaaaa has quit IRC (Leaving) [05:13] *** zenguy_pc has quit IRC (Read error: Connection reset by peer) [05:14] *** zenguy_pc has joined #archiveteam-bs [05:51] Oh, i assumed you were the Wikipedia user, lol [06:04] i'm uploading may 12 2015 to may 18 2015 of medium.com [06:04] i'm also going to bed now [06:04] bbl [06:48] *** RedType_ has quit IRC (Remote host closed the connection) [06:51] *** arkiver2 has joined #archiveteam-bs [06:56] *** RedType has joined #archiveteam-bs [06:57] *** arkiver2 has quit IRC (Ping timeout: 252 seconds) [07:03] *** PurpleSym has joined #archiveteam-bs [07:46] *** qwebirc56 has joined #archiveteam-bs [07:48] Hmm… formatting those fanfiction stories automatically might be difficult. [07:48] so yeah, i dont think they all have a table of contents at the beginning like that , i just used the basic config from the web service to make that, they do have Chapter 2 "chapter name" though [07:48] its just markdown formattion [07:49] It doesn't look like markdown to me. [07:50] anomie: the ironic thing is , the script does actually have an epub setting, that i chose not to use because i assumed text files would be smaller. [07:50] anomie: ok, by markdown i mean *this* is italic and _this_ is bold [07:50] is that not what that means? [07:52] they're all utf-8 the block of metadata at the beginning is always there, just some of them dont have the times, just the dates. [07:52] tsp_: here the magnet for the gzip torrent "magnet:?xt=urn:btih:3E2HBHI4P4N7E3MCM4MIATPF66STOV64&dn=Fanfiction.tar.gz&tr=udp://tracker.openbittorrent.com:80" [07:53] whoops andweay theres the torrent link anomie [07:53] qwebirc56: txts are smaller, yes, but also lose a lot of metadat [07:53] metadata, like formatting [07:54] trust me check the story link in that pastebin, theres not much formatting lost [07:55] ffnet has a customizable interface, the only thing the author can control is the centering of the text, with tabs [07:55] i can even change fonts if i want to [07:55] the text is the same [07:57] *** schbirid has joined #archiveteam-bs [07:58] *** phuzion has quit IRC (Remote host closed the connection) [07:58] *** phuzion has joined #archiveteam-bs [08:09] Would RTF be a reasonable choice for text with simple formatting? [08:10] I'd prefer markdown. Why? [08:11] Ah true. jw. [08:13] I mean, for what purpose? Depending on the situation, markdown might not be suitable. [08:13] But if you're archiving, I say stick with the original format. [08:15] Well you're talking about fanfiction.net so I was thinking about that. Yeah for sure, keep it lossless [08:17] Yeah. [08:18] It seems to me he could have gotten epubs instead of text though. [08:18] The text is undoubtedly smaller, but I wonder how much overhead epubs have over the textual content. [08:19] *** primus104 has joined #archiveteam-bs [08:19] anomie: again tweak the font to serif, put back the bold and italic, maybe, if you want to gen fancy, bold the chapter headings, but seriously, theres not much formatting lost [08:20] It's basically zipped html files, so it's probably not bad for medium-sized stories, but I can imaagine the small ones adding up. [08:20] qwebirc56: yeah, I know. [08:20] *** zhongfu has quit IRC (Ping timeout: 240 seconds) [08:20] I wonder if I could create pdfs from these and share them on libgen… [08:20] *** dashcloud has quit IRC (Ping timeout: 240 seconds) [08:21] *** zhongfu has joined #archiveteam-bs [08:22] that said, i actually found an android app that apparently is the vlc of text readers, its called "alreader" epub ftp txt mobi fb2 whatever the hell that is, this thing will read them all, and its so customizable, it has customizing options for it's own menus?! [08:22] i converted my kindle library to epubs, and dumoped the kindle app [08:23] Does it respect muh freedoms? [08:23] It was good, I used an app with the same name on wm5.0 pda [08:23] Anyways, I use FBReader myself. It never had a problem with anything I used. [08:24] anomie: ummm, yes? i dont understand the question [08:25] qwebirc56: Is it free software? [08:25] s/free software/open source/ [08:26] errr, no, but its awesome! and free as in beer, though i highly suggest throwing the author a dollar or 2 [08:26] *** jk[SVP] has quit IRC (Ping timeout: 240 seconds) [08:27] *** jk[SVP] has joined #archiveteam-bs [08:27] anomie: also for the fanfiction, theres codex reader, and readup, readuo can even bakcupits own database file, witch is just sql. [08:28] *** RichardG has quit IRC (Remote host closed the connection) [08:28] fbreader is open source? https://fbreader.org/files/sources/fbreader-sources-0.12.10.tgz [08:28] or am i not understanding the conversation [08:28] *** zhongfu has quit IRC (Ping timeout: 240 seconds) [08:28] kyan: Yup. [08:28] *** RichardG has joined #archiveteam-bs [08:28] Oh, alreader is not libre [08:28] i see [08:29] I love fbreader [08:29] even though the mac version is missing a lot of features, it's one of the best options [08:29] *** dashcloud has joined #archiveteam-bs [08:29] The linux version has way more features, though [08:31] Good… [08:31] *** MrRadar has quit IRC (Read error: Operation timed out) [08:31] *** useretail has quit IRC (Read error: Operation timed out) [08:31] *** marvinw has quit IRC (Read error: Operation timed out) [08:31] *** MrRadar has joined #archiveteam-bs [08:31] *** Lord_Nigh has quit IRC (Read error: Operation timed out) [08:31] Though, I usually use the Calibre ebook reader when on the computer, even if it is a little overkill. [08:31] I hate calibre [08:31] *** robink has quit IRC (Ping timeout: 492 seconds) [08:31] it makes duplicate copies of all my ebooks [08:32] I want them where I put them, and not copied into some disorganized "library" thing [08:32] *** dxrt has quit IRC (Read error: Operation timed out) [08:32] *** botpie91 has quit IRC (Read error: Operation timed out) [08:32] *** achip has quit IRC (Read error: Operation timed out) [08:32] *** dxrt has joined #archiveteam-bs [08:32] *** phiren has quit IRC (Read error: Operation timed out) [08:33] *** phiren has joined #archiveteam-bs [08:33] I read my books by topic, not by author (eg I have no idea who wrote "CJKV Information Processing", but I know it's in the Reference/Humanities/Writing systems/ folder) [08:33] *** Lord_Nigh has joined #archiveteam-bs [08:33] *** SmileyG has joined #archiveteam-bs [08:33] and where would ya look for the Bible in a calibre-sorted-by-author library, anyway? [08:33] So yeah, Fbreader and Djvulibre ftw [08:34] kyan: What do you read? [08:34] not that much, nowadays [08:34] *** zhongfu has joined #archiveteam-bs [08:35] Mostly fiction, some nonfiction [08:35] My favorite book is "Les Miserables" by Victor Hugo, though I haven't read it in years [08:35] *** Gfy has quit IRC (Ping timeout: 364 seconds) [08:35] I used to be much better at reading than I am now [08:36] I've been contemplating reading that. [08:36] How big is it? [08:37] 4,1mb epub [08:37] Umm… [08:37] How is the "thickness" of ebooks measured? [08:37] Can you see how many words? [08:37] 655,478 words in the project gutenberg version [08:37] but that would presumably be in french [08:37] Oh, wow… [08:37] *** kvieta has quit IRC (Read error: Operation timed out) [08:37] this is a translation [08:38] *** robink has joined #archiveteam-bs [08:38] unabridged, though [08:38] I think I've actually read bigger fanfics than that… :/ [08:38] I read it in paper, it was... pretty fricken big [08:38] took me about a year to read [08:38] maybe 1200–1500 pages, fine print? [08:38] It was really good, very inspirational [08:39] which is funny, since mostly I read easier-to-read things [08:39] *** kevin has quit IRC (Ping timeout: 600 seconds) [08:39] *** yakfish has quit IRC (Ping timeout: 600 seconds) [08:40] mind if i pm you? [08:40] *** SimpBrain has quit IRC (Ping timeout: 600 seconds) [08:40] Nope. [08:41] *** SimpBrain has joined #archiveteam-bs [08:41] *** phuzion has quit IRC (Read error: Connection reset by peer) [08:41] *** yakfish has joined #archiveteam-bs [08:41] *** achip has joined #archiveteam-bs [08:41] *** kvieta has joined #archiveteam-bs [08:41] *** wacky has quit IRC (Read error: Connection reset by peer) [08:42] *** useretai- has joined #archiveteam-bs [08:42] *** phuzion has joined #archiveteam-bs [08:43] *** botpie91 has joined #archiveteam-bs [08:43] *** Smiley has quit IRC (Read error: Operation timed out) [08:44] *** wacky has joined #archiveteam-bs [08:44] *** marvinw has joined #archiveteam-bs [08:45] *** Gfy has joined #archiveteam-bs [08:48] back when my fanfic collection was still manageably small, under a few gigs, i did a rough page count [08:48] i had 2500 REAMS of text. [08:50] What's a REAM? [08:52] ream == 500 pages, 24 reams to a case [08:52] All right. [08:52] i think 30 cases to a pallet [08:53] *** primus104 has quit IRC (Leaving.) [08:53] and once you're dealing with dozens of pallets, i believe the technical term is "assload" [09:05] *** arkiver2 has joined #archiveteam-bs [09:11] *** robink has quit IRC (Ping timeout: 492 seconds) [09:12] *** robink has joined #archiveteam-bs [09:15] *** vitzli has quit IRC (Quit: Leaving) [09:18] *** arkiver2 has quit IRC (Ping timeout: 252 seconds) [09:20] dhgtldcotzfxhmxsz7slu581e seems to scraping up places I didn't mean it to [09:21] *** primus104 has joined #archiveteam-bs [09:24] *** arkiver2 has joined #archiveteam-bs [09:30] *** Rotab has quit IRC (Ping timeout: 306 seconds) [09:32] *** Rotab has joined #archiveteam-bs [09:59] *** robink has quit IRC (Read error: Connection reset by peer) [10:04] *** robink has joined #archiveteam-bs [10:21] *** Rotab has quit IRC (Ping timeout: 306 seconds) [10:22] *** Rotab has joined #archiveteam-bs [10:28] *** arkiver2 has quit IRC (Ping timeout: 252 seconds) [10:33] *** arkiver2 has joined #archiveteam-bs [10:54] *** arkiver2 has quit IRC (Ping timeout: 252 seconds) [10:57] *** sigkell has quit IRC (Ping timeout: 252 seconds) [10:57] *** sigkell has joined #archiveteam-bs [10:58] *** Kazzy has quit IRC (Ping timeout: 252 seconds) [10:59] *** Kazzy has joined #archiveteam-bs [11:05] *** robink has quit IRC (Ping timeout: 492 seconds) [11:14] *** robink has joined #archiveteam-bs [11:21] *** primus104 has quit IRC (Leaving.) [11:48] *** robink has quit IRC (Ping timeout: 492 seconds) [12:13] *** robink has joined #archiveteam-bs [12:31] *** vitzli has joined #archiveteam-bs [13:05] *** Smiley has joined #archiveteam-bs [13:08] *** RedType_ has joined #archiveteam-bs [13:08] *** SmileyG has quit IRC (hub.se irc.efnet.pl) [13:08] *** PurpleSym has quit IRC (hub.se irc.efnet.pl) [13:08] *** RedType has quit IRC (hub.se irc.efnet.pl) [13:08] *** primus has quit IRC (hub.se irc.efnet.pl) [13:08] *** edsu_ has quit IRC (hub.se irc.efnet.pl) [13:08] *** tsp_ has quit IRC (hub.se irc.efnet.pl) [13:08] *** szalwia has quit IRC (hub.se irc.efnet.pl) [13:16] *** edsu has joined #archiveteam-bs [13:16] *** swebb sets mode: +o edsu [13:24] *** PurpleSym has joined #archiveteam-bs [13:25] *** tsp_ has joined #archiveteam-bs [13:25] *** szalwia has joined #archiveteam-bs [13:25] *** primus has joined #archiveteam-bs [13:46] *** robink has quit IRC (Ping timeout: 492 seconds) [13:49] *** primus104 has joined #archiveteam-bs [13:54] *** jspiros has quit IRC (Ping timeout: 186 seconds) [13:59] *** primus104 has quit IRC (Leaving.) [14:00] *** robink has joined #archiveteam-bs [14:01] *** PurpleSym has quit IRC (Remote host closed the connection) [14:09] *** BlueMaxim has quit IRC (Quit: Leaving) [14:19] *** arkiver2 has joined #archiveteam-bs [14:28] *** PurpleSym has joined #archiveteam-bs [14:31] *** robink has quit IRC (Read error: Connection reset by peer) [14:31] *** robink has joined #archiveteam-bs [14:36] *** robink has quit IRC (Read error: Connection reset by peer) [14:37] *** arkiver2 has quit IRC (Ping timeout: 252 seconds) [14:38] *** robink has joined #archiveteam-bs [15:01] *** robink has quit IRC (Read error: Connection reset by peer) [15:06] *** primus104 has joined #archiveteam-bs [15:16] *** robink has joined #archiveteam-bs [15:30] *** robink has quit IRC (Read error: Operation timed out) [15:42] *** robink has joined #archiveteam-bs [16:01] *** robink has quit IRC (Ping timeout: 492 seconds) [16:04] *** vitzli has quit IRC (Quit: Leaving) [16:15] *** Kazzy has quit IRC (Quit: ZNC - http://znc.in) [16:17] *** Kazzy has joined #archiveteam-bs [16:20] *** Kazzy has quit IRC (Client Quit) [16:21] *** Kazzy has joined #archiveteam-bs [16:32] *** lytv has quit IRC (Ping timeout: 483 seconds) [16:34] *** Kazzy has quit IRC (Quit: ZNC - http://znc.in) [16:35] *** Kazzy has joined #archiveteam-bs [16:36] *** lytv has joined #archiveteam-bs [16:40] *** Kazzy has quit IRC (Quit: ZNC - http://znc.in) [16:48] *** primus104 has quit IRC (Leaving.) [16:52] *** Zandro|2 has joined #archiveteam-bs [16:56] *** altlabel has quit IRC (Read error: Operation timed out) [16:56] *** ersi has quit IRC (Ping timeout: 258 seconds) [16:57] *** Infreq_ has joined #archiveteam-bs [16:57] *** chfoo0 has joined #archiveteam-bs [16:57] *** kyan has quit IRC (hub.efnet.us irc.Prison.NET) [16:57] *** lbft has quit IRC (hub.efnet.us irc.Prison.NET) [16:57] *** Infreq has quit IRC (hub.efnet.us irc.Prison.NET) [16:57] *** chfoo has quit IRC (hub.efnet.us irc.Prison.NET) [16:57] *** Zandro has quit IRC (hub.efnet.us irc.Prison.NET) [17:01] *** ersi has joined #archiveteam-bs [17:01] *** swebb sets mode: +o ersi [17:02] *** kyan has joined #archiveteam-bs [17:02] *** lbft has joined #archiveteam-bs [17:07] *** Kazzy has joined #archiveteam-bs [17:07] *** altlabel has joined #archiveteam-bs [17:08] *** Start has quit IRC (Remote host closed the connection) [17:08] *** Kazzy has quit IRC (Read error: Connection reset by peer) [17:13] *** chfoo0 is now known as chfoo [17:16] *** primus104 has joined #archiveteam-bs [17:20] *** kevin__ has joined #archiveteam-bs [17:37] *** Kazzy has joined #archiveteam-bs [18:33] *** Stiletto is now known as Stilett0 [18:54] *** primus104 has quit IRC (Read error: Connection reset by peer) [19:00] *** primus104 has joined #archiveteam-bs [19:28] *** robink has joined #archiveteam-bs [20:03] *** Start has joined #archiveteam-bs [20:11] *** PurpleSym has quit IRC (Remote host closed the connection) [20:15] *** schbirid has quit IRC (Quit: Leaving) [21:42] *** Gfy has quit IRC (Read error: Operation timed out) [21:43] *** phuzion has quit IRC (Read error: Operation timed out) [21:43] *** wacky has quit IRC (Read error: Operation timed out) [21:43] *** sep332 has quit IRC (Read error: Operation timed out) [21:43] *** kvieta has quit IRC (Read error: Operation timed out) [21:43] *** S[h]O[r]T has quit IRC (Read error: Operation timed out) [21:43] *** ersi has quit IRC (Read error: Operation timed out) [21:44] *** marvinw has quit IRC (Read error: Operation timed out) [21:44] *** ersi has joined #archiveteam-bs [21:44] *** swebb sets mode: +o ersi [21:44] *** will has quit IRC (Read error: Operation timed out) [21:44] *** useretai- has quit IRC (Read error: Operation timed out) [21:44] *** achip has quit IRC (Read error: Operation timed out) [21:45] *** botpie91 has quit IRC (Read error: Operation timed out) [21:45] *** toad2 has quit IRC (Read error: Operation timed out) [21:46] *** SimpBrain has quit IRC (Read error: Operation timed out) [21:48] *** will has joined #archiveteam-bs [21:48] *** wacky has joined #archiveteam-bs [21:51] *** yakfish has quit IRC (Ping timeout: 600 seconds) [21:53] *** Stiletto has joined #archiveteam-bs [21:55] *** lysobit has quit IRC (Read error: Operation timed out) [21:55] *** SimpBrain has joined #archiveteam-bs [21:55] *** lytv has quit IRC (Read error: Operation timed out) [21:56] *** lytv has joined #archiveteam-bs [21:56] *** nico_32 has quit IRC (Read error: Operation timed out) [21:56] *** lysobit has joined #archiveteam-bs [21:56] *** nico_32 has joined #archiveteam-bs [21:57] *** Stilett0 has quit IRC (Read error: Operation timed out) [21:57] *** Apathy has quit IRC (Read error: Operation timed out) [21:59] *** brayden_ has quit IRC (Read error: Operation timed out) [22:01] *** Zandro|2 has quit IRC (Read error: Operation timed out) [22:01] *** kevin__ is now known as kevin [22:02] *** yipdw has quit IRC (Ping timeout: 606 seconds) [22:02] *** Sk1d has quit IRC (Ping timeout: 606 seconds) [22:02] *** yipdw has joined #archiveteam-bs [22:05] *** SilSte has quit IRC (Ping timeout: 606 seconds) [22:07] *** Apathy has joined #archiveteam-bs [22:09] *** SilSte has joined #archiveteam-bs [22:09] *** Sk1d has joined #archiveteam-bs [22:10] *** marvinw has joined #archiveteam-bs [22:19] *** phuzion has joined #archiveteam-bs [22:19] *** yakfish has joined #archiveteam-bs [22:19] *** kvieta has joined #archiveteam-bs [22:19] *** achip has joined #archiveteam-bs [22:19] *** sep332 has joined #archiveteam-bs [22:19] *** S[h]O[r]T has joined #archiveteam-bs [22:21] *** Gfy has joined #archiveteam-bs [22:22] *** useretail has joined #archiveteam-bs [22:25] *** toad1 has joined #archiveteam-bs [22:28] *** Ravenloft has quit IRC (Ping timeout: 483 seconds) [22:37] looks like someone uploaded R.E.M bootlegs: https://archive.org/details/opensource_audio?and[]=subject%3A%22R.E.M.%22 [22:37] SketchCow: you may need to check that so they can be move a collection at some point [22:37] i don't think are R.E.M [22:38] *i don't think all are R.E.M [23:18] so i found that there is guy that uploads bootlegs to filefactory [23:18] called T.U.B.E [23:19] i may grab all of them from filefactory that are R.E.M band [23:19] there are over 5000 i think [23:19] T.U.B.E files [23:19] R.E.M is at 67 [23:37] *** Start has quit IRC (Quit: Disconnected.)