#archiveteam 2016-12-27,Tue

↑back Search

Time Nickname Message
00:30 🔗 maelstrom has joined #archiveteam
00:34 🔗 ravetcofx has quit IRC (Read error: Operation timed out)
00:40 🔗 ravetcofx has joined #archiveteam
00:42 🔗 Rondom has quit IRC (Remote host closed the connection)
00:42 🔗 Rondom has joined #archiveteam
00:43 🔗 VADemon has joined #archiveteam
01:33 🔗 maelstrom has quit IRC (Remote host closed the connection)
01:37 🔗 aschmitz has joined #archiveteam
01:40 🔗 ColdIce has joined #archiveteam
01:46 🔗 maelstrom has joined #archiveteam
02:18 🔗 SketchCow We're already archiving usenet.
02:18 🔗 SketchCow Unless you mean Usenet Binaries groups, then no
02:21 🔗 Whopper_ has joined #archiveteam
02:27 🔗 ploop_ has joined #archiveteam
02:29 🔗 wolfpld has quit IRC (hub.se irc.efnet.fr)
02:29 🔗 Whopper has quit IRC (hub.se irc.efnet.fr)
02:29 🔗 ploop has quit IRC (hub.se irc.efnet.fr)
02:29 🔗 gibigiana has quit IRC (hub.se irc.efnet.fr)
02:29 🔗 tsr has quit IRC (hub.se irc.efnet.fr)
02:29 🔗 cadbury_ has quit IRC (hub.se irc.efnet.fr)
02:34 🔗 gibigian1 has joined #archiveteam
03:13 🔗 wolfpld has joined #archiveteam
03:13 🔗 cadbury_ has joined #archiveteam
03:14 🔗 tsr has joined #archiveteam
03:32 🔗 dashcloud wolfpld: if you've got access to a source of historical Usenet or some magical Google Groups scraper you've made, we're interested and listening
03:46 🔗 phuzion has quit IRC (Read error: Operation timed out)
03:51 🔗 phuzion has joined #archiveteam
04:01 🔗 DiscantX has joined #archiveteam
04:22 🔗 pizzaiolo has left
04:45 🔗 ravetcofx has quit IRC (Read error: Operation timed out)
05:18 🔗 Sk1d has quit IRC (Ping timeout: 194 seconds)
05:22 🔗 DiscantY has joined #archiveteam
05:25 🔗 Sk1d has joined #archiveteam
05:29 🔗 DiscantX has quit IRC (Read error: Operation timed out)
05:55 🔗 maelstrom has quit IRC (Quit: Leaving)
06:37 🔗 DiscantZ has joined #archiveteam
06:42 🔗 DiscantY has quit IRC (Read error: Operation timed out)
07:07 🔗 Start has quit IRC (Quit: Disconnected.)
07:08 🔗 Start has joined #archiveteam
08:33 🔗 pizzaiolo has joined #archiveteam
09:22 🔗 DiscantZ has quit IRC (Ping timeout: 633 seconds)
09:46 🔗 Simpbrain has joined #archiveteam
09:48 🔗 atomotic has joined #archiveteam
10:06 🔗 schbirid has joined #archiveteam
10:43 🔗 dashcloud has quit IRC (Ping timeout: 260 seconds)
10:43 🔗 dashcloud has joined #archiveteam
11:13 🔗 BlueMaxim has quit IRC (Quit: Leaving)
11:22 🔗 dashcloud has quit IRC (Ping timeout: 244 seconds)
11:27 🔗 dashcloud has joined #archiveteam
11:32 🔗 pizzaiolo has quit IRC (Ping timeout: 264 seconds)
11:45 🔗 Yoshimura has joined #archiveteam
11:51 🔗 atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com)
11:56 🔗 Whopper_ has quit IRC (Read error: Operation timed out)
11:59 🔗 Whopper has joined #archiveteam
12:08 🔗 noobboob has joined #archiveteam
12:08 🔗 noobboob WHAT FORSOOTH, PRITHEE TELL ME THE SECRET WORD
12:09 🔗 zino yahoosucks
12:09 🔗 noobboob yes it does. Gracias
12:10 🔗 zino You are welcome
12:10 🔗 pizzaiolo has joined #archiveteam
12:11 🔗 pizzaiolo has quit IRC (Remote host closed the connection)
12:11 🔗 pizzaiolo has joined #archiveteam
12:25 🔗 atomotic has joined #archiveteam
12:42 🔗 wolfpld dashcloud: news-archive.icm.edu.pl has almost complete archives of pl.* hierarchy since 1996 and alt.pl.* since 2000
12:42 🔗 wolfpld I am not aware of other archival nntp sources
12:42 🔗 wolfpld and I do have gg crawler
12:43 🔗 wolfpld nothing fancy tbh, just a sensible rewrite of what's already available
12:43 🔗 wolfpld https://bitbucket.org/wolfpld/usenetarchive
12:43 🔗 wolfpld these are my tools
12:45 🔗 wolfpld https://archive.org/details/usenet-uat-pl
12:45 🔗 wolfpld and there are my archives
12:45 🔗 HCross has quit IRC (Remote host closed the connection)
12:45 🔗 wolfpld I do have more up-to-date version, but I haven't uploaded it yet
12:46 🔗 HCross has joined #archiveteam
12:49 🔗 wolfpld dashcloud: the biggest problem with scraping google groups is that their algorithms get progressively slower, the further in history you go
12:49 🔗 wolfpld eventually you start getting server timeouts
12:50 🔗 wolfpld my tool retries download attempts until they succeed
12:51 🔗 wolfpld I have seen some groups get unstuck after 1000 attempts or so
12:51 🔗 wolfpld but eventually, you'll get to the point where you just can't get anything more
12:52 🔗 wolfpld in case of group that had many threads
12:56 🔗 wolfpld I still do think it is a viable way to get data, though
12:57 🔗 wolfpld here are data sizes of the same group from various sources:
12:57 🔗 wolfpld 333M archive.org-giganews/pl.pregierz
12:57 🔗 wolfpld 448M archive.org-google/pl.pregierz
12:57 🔗 wolfpld 2,6G googlegroups/pl.pregierz
12:57 🔗 wolfpld 3,0G news-archive.icm.edu.pl/pl.pregierz
13:07 🔗 weseeyou has joined #archiveteam
13:07 🔗 weseeyou hello, backup domo animate!
13:07 🔗 weseeyou has quit IRC (Client Quit)
13:07 🔗 HCross has quit IRC (Read error: Connection reset by peer)
13:07 🔗 HCross has joined #archiveteam
13:46 🔗 schbirid https://virtuallyfun.superglobalmegacorp.com/2016/12/08/adding-gnu-1989-source-tapes-sourceforge/
13:46 🔗 schbirid https://virtuallyfun.superglobalmegacorp.com/2016/12/04/found-ancient-gnu-software/
14:09 🔗 atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com)
14:37 🔗 Simpbrain has quit IRC (Remote host closed the connection)
15:19 🔗 i336_ has quit IRC (Ping timeout: 260 seconds)
15:40 🔗 SketchCow schbirid: Throw 'em in an item
15:42 🔗 jspiros_ has joined #archiveteam
15:43 🔗 jspiros has quit IRC (Read error: Operation timed out)
15:50 🔗 VADemon has quit IRC (Quit: left4dead)
16:05 🔗 schbirid has quit IRC (Ping timeout: 255 seconds)
16:18 🔗 schbirid has joined #archiveteam
16:21 🔗 pizzaiolo https://www.reddit.com/r/Android/comments/5kfm8x/the_cyanogenmod_archives_full_downloads/
16:22 🔗 pizzaiolo which of you was it? ;)
16:23 🔗 * HCross rather quickly runs away
16:26 🔗 HCross pizzaiolo, #-bs
16:48 🔗 noobboob has quit IRC (Ping timeout: 268 seconds)
16:58 🔗 BartoCH has quit IRC (Ping timeout: 260 seconds)
17:07 🔗 BartoCH has joined #archiveteam
17:09 🔗 atomotic has joined #archiveteam
17:18 🔗 Asparagir has joined #archiveteam
17:28 🔗 maelstrom has joined #archiveteam
17:33 🔗 maelstrom has quit IRC (Ping timeout: 250 seconds)
17:33 🔗 BartoCH has quit IRC (Ping timeout: 260 seconds)
17:37 🔗 BartoCH has joined #archiveteam
17:39 🔗 maelstrom has joined #archiveteam
17:56 🔗 atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com)
17:57 🔗 Start has quit IRC (Read error: Connection reset by peer)
17:58 🔗 Start has joined #archiveteam
18:59 🔗 kris33 has joined #archiveteam
19:40 🔗 kris33 has quit IRC (Textual IRC Client: www.textualapp.com)
19:51 🔗 SketchCow That's Iilya
20:00 🔗 SketchCow Can people help me find out why want.archive.org is failing?
20:03 🔗 HCross its taking me too an "item not avaliable" page
20:31 🔗 Muad-Dib has quit IRC (Quit: ZNC - http://znc.in)
20:32 🔗 pizzaiolo has quit IRC (Read error: Operation timed out)
20:32 🔗 i336_ has joined #archiveteam
21:01 🔗 glass3 has quit IRC ()
21:31 🔗 maelstrom has quit IRC (Ping timeout: 250 seconds)
21:40 🔗 maelstrom has joined #archiveteam
22:17 🔗 krazedkat has joined #archiveteam
22:17 🔗 krazedkat has quit IRC (Client Quit)
22:25 🔗 BlueMaxim has joined #archiveteam
23:42 🔗 schbirid has quit IRC (Quit: Leaving)
23:52 🔗 swebb has quit IRC (Read error: Operation timed out)
23:53 🔗 swebb has joined #archiveteam
23:56 🔗 kniffy has quit IRC (Ping timeout: 240 seconds)
23:56 🔗 VADemon has joined #archiveteam
23:57 🔗 kniffy has joined #archiveteam

irclogger-viewer