[00:30] *** maelstrom has joined #archiveteam [00:34] *** ravetcofx has quit IRC (Read error: Operation timed out) [00:40] *** ravetcofx has joined #archiveteam [00:42] *** Rondom has quit IRC (Remote host closed the connection) [00:42] *** Rondom has joined #archiveteam [00:43] *** VADemon has joined #archiveteam [01:33] *** maelstrom has quit IRC (Remote host closed the connection) [01:37] *** aschmitz has joined #archiveteam [01:40] *** ColdIce has joined #archiveteam [01:46] *** maelstrom has joined #archiveteam [02:18] We're already archiving usenet. [02:18] Unless you mean Usenet Binaries groups, then no [02:21] *** Whopper_ has joined #archiveteam [02:27] *** ploop_ has joined #archiveteam [02:29] *** wolfpld has quit IRC (hub.se irc.efnet.fr) [02:29] *** Whopper has quit IRC (hub.se irc.efnet.fr) [02:29] *** ploop has quit IRC (hub.se irc.efnet.fr) [02:29] *** gibigiana has quit IRC (hub.se irc.efnet.fr) [02:29] *** tsr has quit IRC (hub.se irc.efnet.fr) [02:29] *** cadbury_ has quit IRC (hub.se irc.efnet.fr) [02:34] *** gibigian1 has joined #archiveteam [03:13] *** wolfpld has joined #archiveteam [03:13] *** cadbury_ has joined #archiveteam [03:14] *** tsr has joined #archiveteam [03:32] wolfpld: if you've got access to a source of historical Usenet or some magical Google Groups scraper you've made, we're interested and listening [03:46] *** phuzion has quit IRC (Read error: Operation timed out) [03:51] *** phuzion has joined #archiveteam [04:01] *** DiscantX has joined #archiveteam [04:22] *** pizzaiolo has left [04:45] *** ravetcofx has quit IRC (Read error: Operation timed out) [05:18] *** Sk1d has quit IRC (Ping timeout: 194 seconds) [05:22] *** DiscantY has joined #archiveteam [05:25] *** Sk1d has joined #archiveteam [05:29] *** DiscantX has quit IRC (Read error: Operation timed out) [05:55] *** maelstrom has quit IRC (Quit: Leaving) [06:37] *** DiscantZ has joined #archiveteam [06:42] *** DiscantY has quit IRC (Read error: Operation timed out) [07:07] *** Start has quit IRC (Quit: Disconnected.) [07:08] *** Start has joined #archiveteam [08:33] *** pizzaiolo has joined #archiveteam [09:22] *** DiscantZ has quit IRC (Ping timeout: 633 seconds) [09:46] *** Simpbrain has joined #archiveteam [09:48] *** atomotic has joined #archiveteam [10:06] *** schbirid has joined #archiveteam [10:43] *** dashcloud has quit IRC (Ping timeout: 260 seconds) [10:43] *** dashcloud has joined #archiveteam [11:13] *** BlueMaxim has quit IRC (Quit: Leaving) [11:22] *** dashcloud has quit IRC (Ping timeout: 244 seconds) [11:27] *** dashcloud has joined #archiveteam [11:32] *** pizzaiolo has quit IRC (Ping timeout: 264 seconds) [11:45] *** Yoshimura has joined #archiveteam [11:51] *** atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) [11:56] *** Whopper_ has quit IRC (Read error: Operation timed out) [11:59] *** Whopper has joined #archiveteam [12:08] *** noobboob has joined #archiveteam [12:08] WHAT FORSOOTH, PRITHEE TELL ME THE SECRET WORD [12:09] yahoosucks [12:09] yes it does. Gracias [12:10] You are welcome [12:10] *** pizzaiolo has joined #archiveteam [12:11] *** pizzaiolo has quit IRC (Remote host closed the connection) [12:11] *** pizzaiolo has joined #archiveteam [12:25] *** atomotic has joined #archiveteam [12:42] dashcloud: news-archive.icm.edu.pl has almost complete archives of pl.* hierarchy since 1996 and alt.pl.* since 2000 [12:42] I am not aware of other archival nntp sources [12:42] and I do have gg crawler [12:43] nothing fancy tbh, just a sensible rewrite of what's already available [12:43] https://bitbucket.org/wolfpld/usenetarchive [12:43] these are my tools [12:45] https://archive.org/details/usenet-uat-pl [12:45] and there are my archives [12:45] *** HCross has quit IRC (Remote host closed the connection) [12:45] I do have more up-to-date version, but I haven't uploaded it yet [12:46] *** HCross has joined #archiveteam [12:49] dashcloud: the biggest problem with scraping google groups is that their algorithms get progressively slower, the further in history you go [12:49] eventually you start getting server timeouts [12:50] my tool retries download attempts until they succeed [12:51] I have seen some groups get unstuck after 1000 attempts or so [12:51] but eventually, you'll get to the point where you just can't get anything more [12:52] in case of group that had many threads [12:56] I still do think it is a viable way to get data, though [12:57] here are data sizes of the same group from various sources: [12:57] 333M archive.org-giganews/pl.pregierz [12:57] 448M archive.org-google/pl.pregierz [12:57] 2,6G googlegroups/pl.pregierz [12:57] 3,0G news-archive.icm.edu.pl/pl.pregierz [13:07] *** weseeyou has joined #archiveteam [13:07] hello, backup domo animate! [13:07] *** weseeyou has quit IRC (Client Quit) [13:07] *** HCross has quit IRC (Read error: Connection reset by peer) [13:07] *** HCross has joined #archiveteam [13:46] https://virtuallyfun.superglobalmegacorp.com/2016/12/08/adding-gnu-1989-source-tapes-sourceforge/ [13:46] https://virtuallyfun.superglobalmegacorp.com/2016/12/04/found-ancient-gnu-software/ [14:09] *** atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) [14:37] *** Simpbrain has quit IRC (Remote host closed the connection) [15:19] *** i336_ has quit IRC (Ping timeout: 260 seconds) [15:40] schbirid: Throw 'em in an item [15:42] *** jspiros_ has joined #archiveteam [15:43] *** jspiros has quit IRC (Read error: Operation timed out) [15:50] *** VADemon has quit IRC (Quit: left4dead) [16:05] *** schbirid has quit IRC (Ping timeout: 255 seconds) [16:18] *** schbirid has joined #archiveteam [16:21] https://www.reddit.com/r/Android/comments/5kfm8x/the_cyanogenmod_archives_full_downloads/ [16:22] which of you was it? ;) [16:23] * HCross rather quickly runs away [16:26] pizzaiolo, #-bs [16:48] *** noobboob has quit IRC (Ping timeout: 268 seconds) [16:58] *** BartoCH has quit IRC (Ping timeout: 260 seconds) [17:07] *** BartoCH has joined #archiveteam [17:09] *** atomotic has joined #archiveteam [17:18] *** Asparagir has joined #archiveteam [17:28] *** maelstrom has joined #archiveteam [17:33] *** maelstrom has quit IRC (Ping timeout: 250 seconds) [17:33] *** BartoCH has quit IRC (Ping timeout: 260 seconds) [17:37] *** BartoCH has joined #archiveteam [17:39] *** maelstrom has joined #archiveteam [17:56] *** atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) [17:57] *** Start has quit IRC (Read error: Connection reset by peer) [17:58] *** Start has joined #archiveteam [18:59] *** kris33 has joined #archiveteam [19:40] *** kris33 has quit IRC (Textual IRC Client: www.textualapp.com) [19:51] That's Iilya [20:00] Can people help me find out why want.archive.org is failing? [20:03] its taking me too an "item not avaliable" page [20:31] *** Muad-Dib has quit IRC (Quit: ZNC - http://znc.in) [20:32] *** pizzaiolo has quit IRC (Read error: Operation timed out) [20:32] *** i336_ has joined #archiveteam [21:01] *** glass3 has quit IRC () [21:31] *** maelstrom has quit IRC (Ping timeout: 250 seconds) [21:40] *** maelstrom has joined #archiveteam [22:17] *** krazedkat has joined #archiveteam [22:17] *** krazedkat has quit IRC (Client Quit) [22:25] *** BlueMaxim has joined #archiveteam [23:42] *** schbirid has quit IRC (Quit: Leaving) [23:52] *** swebb has quit IRC (Read error: Operation timed out) [23:53] *** swebb has joined #archiveteam [23:56] *** kniffy has quit IRC (Ping timeout: 240 seconds) [23:56] *** VADemon has joined #archiveteam [23:57] *** kniffy has joined #archiveteam