[00:29] *** vitzli has joined #archiveteam-bs [00:37] *** yipdw has quit IRC (Ping timeout: 506 seconds) [00:47] *** zhongfu has joined #archiveteam-bs [00:56] *** yipdw has joined #archiveteam-bs [01:18] *** vitzli has quit IRC (Leaving) [01:30] *** BlueMaxim has quit IRC (Read error: Connection reset by peer) [01:37] *** JesseW has joined #archiveteam-bs [02:24] *** schbirid2 has joined #archiveteam-bs [02:25] *** schbirid has quit IRC (Read error: Operation timed out) [02:36] *** JesseW has quit IRC (Leaving.) [03:03] *** JesseW has joined #archiveteam-bs [03:10] *** fie has joined #archiveteam-bs [04:35] *** BlueMaxim has joined #archiveteam-bs [05:04] *** JetBalsa has quit IRC (Read error: Connection reset by peer) [05:16] *** Muad-Dib has joined #archiveteam-bs [05:53] *** dcmorton has joined #archiveteam-bs [05:53] *** dcmorton has quit IRC (Excess Flood) [05:53] *** dcmorton has joined #archiveteam-bs [06:43] *** vitzli has joined #archiveteam-bs [07:39] *** vitzli has quit IRC (Leaving) [07:47] i'm uploading star wars gamer [07:51] *** JesseW has quit IRC (Leaving.) [11:02] *** SilSte has quit IRC (Quit: No Ping reply in 180 seconds.) [11:02] *** SilSte has joined #archiveteam-bs [11:02] *** Kazzy has quit IRC (Ping timeout: 260 seconds) [11:04] *** Kazzy has joined #archiveteam-bs [11:11] *** VADemon has joined #archiveteam-bs [12:12] *** arkiver3 has joined #archiveteam-bs [12:33] *** arkiver3 has quit IRC (Ping timeout: 252 seconds) [13:06] *** arkiver3 has joined #archiveteam-bs [13:10] *** arkiver3 has quit IRC (Ping timeout: 252 seconds) [13:59] *** BlueMaxim has quit IRC (Quit: Leaving) [14:57] *** VADemon has quit IRC (Quit: left4dead) [16:11] *** VADemon has joined #archiveteam-bs [17:01] *** JesseW has joined #archiveteam-bs [17:17] *** lbft has quit IRC (Read error: Operation timed out) [17:17] *** lbft has joined #archiveteam-bs [17:24] *** JesseW has quit IRC (Leaving.) [17:25] *** lbft has quit IRC (Read error: Operation timed out) [17:27] *** lbft has joined #archiveteam-bs [17:52] *** VADemon_ has joined #archiveteam-bs [17:53] *** VADemon_ has quit IRC (Read error: Connection reset by peer) [17:54] *** VADemon_ has joined #archiveteam-bs [17:55] *** VADemon has quit IRC (Read error: Operation timed out) [17:59] *** VADemon_ has quit IRC (Read error: Connection reset by peer) [18:00] *** VADemon has joined #archiveteam-bs [18:04] *** VADemon has quit IRC (Read error: Connection reset by peer) [18:04] *** VADemon has joined #archiveteam-bs [19:14] *** JetBalsa has joined #archiveteam-bs [20:02] godane: I found some newspapers www.liberte-algerie.com/pdf/download?id=3264 [20:02] Change ID for earlier PDFs [20:05] misread as lingerie and got excited :( [20:05] godane: more newspapers! http://www.elmoudjahid.com/fr/archive/pdf [20:09] i discovered "site:magazin.spiegel.de inurl:EpubDelivery" earlier. nothing special if you get a spiegel dump elsewhere but a nice list of free article samples [20:14] i'm grabbing liberte algerie as a web archive [20:15] no archive.org items? [20:15] also, http://www.el-massa.com/dz/%D8%A7%D9%84%D9%86%D8%B3%D8%AE%D8%A9-%D8%A7%D9%84%D9%88%D8%B1%D9%82%D9%8A%D8%A9/%D8%A7%D9%84%D8%B9%D8%AF%D8%AF-5783.html [20:15] change the ID 5783 to earlier if you want earlier newpapers [20:17] arkiver: mostly cause i have not date metadata [20:19] godane: earlier papers, like http://www.liberte-algerie.com/pdf/download?id=2264 , are zipped. The PDFs inside the ZIP file have the date [20:20] i figure that for the earlier ones [20:21] but i'm just grabbing a web archive so at least that gets uploaded [20:23] ok [20:24] some more here http://www.ech-chaab.com/ar/%D8%A7%D9%84%D9%86%D8%B3%D8%AE%D8%A9-%D8%A7%D9%84%D9%88%D8%B1%D9%82%D9%8A%D8%A9/item/37826-%D8%A7%D9%84%D8%B9%D8%AF%D8%AF-16934.html [20:27] 834 newspapers here http://www.ennaharonline.com/ar/archives_pdf/index.1.html [20:30] 1880 newspapers: http://www.al-fadjr.com/ar/pdf [20:33] newspapers going back to 2011: http://www.akhersaa-dz.com/themes/rtl/pdf/ [20:37] newspapers here, which can be found through a calendar http://www.lexpressiondz.com/autres/archives_html/index.1.html [20:38] Newspaper can be found on the bottom of a page for a day, for example http://www.lexpressiondz.com/index.php?news=233583 [20:40] and pdfs here, but is currently not working http://www.elkhabarerriadhi.com/pdf [20:44] i'm just going to work on liberte-algerie.com cause i have too much back log [20:45] yes [20:45] I'm not trying to overload you with work, just pasting here what I find so it won't forgotten [20:46] looks like with lexpressiondz.com i have to grab the pages to get the pdfs [20:46] yes [20:46] They always have some random characters in the name http://www.lexpressiondz.com/files.php?force&file=pdf/P20160118lmhkfjfh.pdf [20:47] I found these newspapers while sorting out the 16 new algerian newssites for newsbuddy [20:47] I'll just paste here what I find in the future [21:03] I live in a world where arkiver successfully filled godane's buffer [21:07] *** schbirid2 has quit IRC (Quit: Leaving) [21:11] its mostly cause my buffer is full already [21:11] i have stuff that needs to get uploaded [21:14] also i'm uploading stuff like Water Mark Church Videos [21:14] 2013 videos are all uploaded now [21:37] *** VADemon has quit IRC (Read error: No route to host) [21:40] *** VADemon has joined #archiveteam-bs [21:52] *** wickedpla is now known as wp494 [22:11] *** slyphic is now known as slyphic|a [22:21] *** VADemon has quit IRC (Read error: Connection reset by peer) [22:25] *** xmc is now known as chronomex [22:25] *** chronomex is now known as xmc [22:57] just got a shout that https://www.reddit.com/r/DIY is having some 'issues' and might shut down [22:57] dont know how fast we can grab large reddits like this [23:08] https://www.reddit.com/r/Cinema4D/comments/41zzw6/freebie_worldmachine_files/ [23:09] Smiley: I threw it into Archivebot [23:09] Though it looks like it's waiting for a pipeline to free up [23:10] I have a feeling we could do with a pipeline for "longer grabs"