Time |
Nickname |
Message |
00:29
🔗
|
|
vitzli has joined #archiveteam-bs |
00:37
🔗
|
|
yipdw has quit IRC (Ping timeout: 506 seconds) |
00:47
🔗
|
|
zhongfu has joined #archiveteam-bs |
00:56
🔗
|
|
yipdw has joined #archiveteam-bs |
01:18
🔗
|
|
vitzli has quit IRC (Leaving) |
01:30
🔗
|
|
BlueMaxim has quit IRC (Read error: Connection reset by peer) |
01:37
🔗
|
|
JesseW has joined #archiveteam-bs |
02:24
🔗
|
|
schbirid2 has joined #archiveteam-bs |
02:25
🔗
|
|
schbirid has quit IRC (Read error: Operation timed out) |
02:36
🔗
|
|
JesseW has quit IRC (Leaving.) |
03:03
🔗
|
|
JesseW has joined #archiveteam-bs |
03:10
🔗
|
|
fie has joined #archiveteam-bs |
04:35
🔗
|
|
BlueMaxim has joined #archiveteam-bs |
05:04
🔗
|
|
JetBalsa has quit IRC (Read error: Connection reset by peer) |
05:16
🔗
|
|
Muad-Dib has joined #archiveteam-bs |
05:53
🔗
|
|
dcmorton has joined #archiveteam-bs |
05:53
🔗
|
|
dcmorton has quit IRC (Excess Flood) |
05:53
🔗
|
|
dcmorton has joined #archiveteam-bs |
06:43
🔗
|
|
vitzli has joined #archiveteam-bs |
07:39
🔗
|
|
vitzli has quit IRC (Leaving) |
07:47
🔗
|
godane |
i'm uploading star wars gamer |
07:51
🔗
|
|
JesseW has quit IRC (Leaving.) |
11:02
🔗
|
|
SilSte has quit IRC (Quit: No Ping reply in 180 seconds.) |
11:02
🔗
|
|
SilSte has joined #archiveteam-bs |
11:02
🔗
|
|
Kazzy has quit IRC (Ping timeout: 260 seconds) |
11:04
🔗
|
|
Kazzy has joined #archiveteam-bs |
11:11
🔗
|
|
VADemon has joined #archiveteam-bs |
12:12
🔗
|
|
arkiver3 has joined #archiveteam-bs |
12:33
🔗
|
|
arkiver3 has quit IRC (Ping timeout: 252 seconds) |
13:06
🔗
|
|
arkiver3 has joined #archiveteam-bs |
13:10
🔗
|
|
arkiver3 has quit IRC (Ping timeout: 252 seconds) |
13:59
🔗
|
|
BlueMaxim has quit IRC (Quit: Leaving) |
14:57
🔗
|
|
VADemon has quit IRC (Quit: left4dead) |
16:11
🔗
|
|
VADemon has joined #archiveteam-bs |
17:01
🔗
|
|
JesseW has joined #archiveteam-bs |
17:17
🔗
|
|
lbft has quit IRC (Read error: Operation timed out) |
17:17
🔗
|
|
lbft has joined #archiveteam-bs |
17:24
🔗
|
|
JesseW has quit IRC (Leaving.) |
17:25
🔗
|
|
lbft has quit IRC (Read error: Operation timed out) |
17:27
🔗
|
|
lbft has joined #archiveteam-bs |
17:52
🔗
|
|
VADemon_ has joined #archiveteam-bs |
17:53
🔗
|
|
VADemon_ has quit IRC (Read error: Connection reset by peer) |
17:54
🔗
|
|
VADemon_ has joined #archiveteam-bs |
17:55
🔗
|
|
VADemon has quit IRC (Read error: Operation timed out) |
17:59
🔗
|
|
VADemon_ has quit IRC (Read error: Connection reset by peer) |
18:00
🔗
|
|
VADemon has joined #archiveteam-bs |
18:04
🔗
|
|
VADemon has quit IRC (Read error: Connection reset by peer) |
18:04
🔗
|
|
VADemon has joined #archiveteam-bs |
19:14
🔗
|
|
JetBalsa has joined #archiveteam-bs |
20:02
🔗
|
arkiver |
godane: I found some newspapers www.liberte-algerie.com/pdf/download?id=3264 |
20:02
🔗
|
arkiver |
Change ID for earlier PDFs |
20:05
🔗
|
schbirid2 |
misread as lingerie and got excited :( |
20:05
🔗
|
arkiver |
godane: more newspapers! http://www.elmoudjahid.com/fr/archive/pdf |
20:09
🔗
|
schbirid2 |
i discovered "site:magazin.spiegel.de inurl:EpubDelivery" earlier. nothing special if you get a spiegel dump elsewhere but a nice list of free article samples |
20:14
🔗
|
godane |
i'm grabbing liberte algerie as a web archive |
20:15
🔗
|
arkiver |
no archive.org items? |
20:15
🔗
|
arkiver |
also, http://www.el-massa.com/dz/%D8%A7%D9%84%D9%86%D8%B3%D8%AE%D8%A9-%D8%A7%D9%84%D9%88%D8%B1%D9%82%D9%8A%D8%A9/%D8%A7%D9%84%D8%B9%D8%AF%D8%AF-5783.html |
20:15
🔗
|
arkiver |
change the ID 5783 to earlier if you want earlier newpapers |
20:17
🔗
|
godane |
arkiver: mostly cause i have not date metadata |
20:19
🔗
|
arkiver |
godane: earlier papers, like http://www.liberte-algerie.com/pdf/download?id=2264 , are zipped. The PDFs inside the ZIP file have the date |
20:20
🔗
|
godane |
i figure that for the earlier ones |
20:21
🔗
|
godane |
but i'm just grabbing a web archive so at least that gets uploaded |
20:23
🔗
|
arkiver |
ok |
20:24
🔗
|
arkiver |
some more here http://www.ech-chaab.com/ar/%D8%A7%D9%84%D9%86%D8%B3%D8%AE%D8%A9-%D8%A7%D9%84%D9%88%D8%B1%D9%82%D9%8A%D8%A9/item/37826-%D8%A7%D9%84%D8%B9%D8%AF%D8%AF-16934.html |
20:27
🔗
|
arkiver |
834 newspapers here http://www.ennaharonline.com/ar/archives_pdf/index.1.html |
20:30
🔗
|
arkiver |
1880 newspapers: http://www.al-fadjr.com/ar/pdf |
20:33
🔗
|
arkiver |
newspapers going back to 2011: http://www.akhersaa-dz.com/themes/rtl/pdf/ |
20:37
🔗
|
arkiver |
newspapers here, which can be found through a calendar http://www.lexpressiondz.com/autres/archives_html/index.1.html |
20:38
🔗
|
arkiver |
Newspaper can be found on the bottom of a page for a day, for example http://www.lexpressiondz.com/index.php?news=233583 |
20:40
🔗
|
arkiver |
and pdfs here, but is currently not working http://www.elkhabarerriadhi.com/pdf |
20:44
🔗
|
godane |
i'm just going to work on liberte-algerie.com cause i have too much back log |
20:45
🔗
|
arkiver |
yes |
20:45
🔗
|
arkiver |
I'm not trying to overload you with work, just pasting here what I find so it won't forgotten |
20:46
🔗
|
godane |
looks like with lexpressiondz.com i have to grab the pages to get the pdfs |
20:46
🔗
|
arkiver |
yes |
20:46
🔗
|
arkiver |
They always have some random characters in the name http://www.lexpressiondz.com/files.php?force&file=pdf/P20160118lmhkfjfh.pdf |
20:47
🔗
|
arkiver |
I found these newspapers while sorting out the 16 new algerian newssites for newsbuddy |
20:47
🔗
|
arkiver |
I'll just paste here what I find in the future |
21:03
🔗
|
SketchCow |
I live in a world where arkiver successfully filled godane's buffer |
21:07
🔗
|
|
schbirid2 has quit IRC (Quit: Leaving) |
21:11
🔗
|
godane |
its mostly cause my buffer is full already |
21:11
🔗
|
godane |
i have stuff that needs to get uploaded |
21:14
🔗
|
godane |
also i'm uploading stuff like Water Mark Church Videos |
21:14
🔗
|
godane |
2013 videos are all uploaded now |
21:37
🔗
|
|
VADemon has quit IRC (Read error: No route to host) |
21:40
🔗
|
|
VADemon has joined #archiveteam-bs |
21:52
🔗
|
|
wickedpla is now known as wp494 |
22:11
🔗
|
|
slyphic is now known as slyphic|a |
22:21
🔗
|
|
VADemon has quit IRC (Read error: Connection reset by peer) |
22:25
🔗
|
|
xmc is now known as chronomex |
22:25
🔗
|
|
chronomex is now known as xmc |
22:57
🔗
|
Smiley |
just got a shout that https://www.reddit.com/r/DIY is having some 'issues' and might shut down |
22:57
🔗
|
Smiley |
dont know how fast we can grab large reddits like this |
23:08
🔗
|
godane |
https://www.reddit.com/r/Cinema4D/comments/41zzw6/freebie_worldmachine_files/ |
23:09
🔗
|
MrRadar |
Smiley: I threw it into Archivebot |
23:09
🔗
|
MrRadar |
Though it looks like it's waiting for a pipeline to free up |
23:10
🔗
|
HCross |
I have a feeling we could do with a pipeline for "longer grabs" |