#archiveteam 2016-10-05,Wed

↑back Search

Time Nickname Message
00:02 πŸ”— MrRadar arkiver: Could you requeue Yahoo Answers?
00:09 πŸ”— kristian_ has quit IRC (Remote host closed the connection)
00:10 πŸ”— arkiver done!
00:10 πŸ”— MrRadar Thanks
00:14 πŸ”— cadbury_ has joined #archiveteam
00:16 πŸ”— Petri152 has quit IRC (Read error: Operation timed out)
00:19 πŸ”— nwf_ has quit IRC (Read error: Operation timed out)
00:20 πŸ”— Petri152 has joined #archiveteam
00:22 πŸ”— JesseW has joined #archiveteam
00:23 πŸ”— BlueMaxim has joined #archiveteam
00:30 πŸ”— phuzion has quit IRC (Ping timeout: 611 seconds)
00:32 πŸ”— Petri152 has quit IRC (Ping timeout: 613 seconds)
00:33 πŸ”— maelstrom has joined #archiveteam
00:42 πŸ”— Petri152 has joined #archiveteam
00:44 πŸ”— nwf_ has joined #archiveteam
00:45 πŸ”— phuzion has joined #archiveteam
01:00 πŸ”— octarine has quit IRC (Ping timeout: 260 seconds)
01:03 πŸ”— hook54321 has quit IRC (Ping timeout: 260 seconds)
01:04 πŸ”— _desu___ has quit IRC (Ping timeout: 260 seconds)
01:12 πŸ”— SmileyG has quit IRC (Ping timeout: 255 seconds)
01:15 πŸ”— Smiley has joined #archiveteam
01:47 πŸ”— balrog has quit IRC (Read error: Operation timed out)
01:47 πŸ”— Atros has joined #archiveteam
01:48 πŸ”— ranma has quit IRC (Read error: Operation timed out)
01:48 πŸ”— Mayonaise has quit IRC (Read error: Operation timed out)
01:48 πŸ”— FluffyFox has joined #archiveteam
01:48 πŸ”— aMunster has joined #archiveteam
01:48 πŸ”— SadDM has quit IRC (Read error: Operation timed out)
01:48 πŸ”— Frogging has quit IRC (Read error: Operation timed out)
01:48 πŸ”— atrocity has quit IRC (Read error: Operation timed out)
01:48 πŸ”— FluffyFox is now known as Frogging
01:49 πŸ”— marvinw has quit IRC (Ping timeout: 246 seconds)
01:49 πŸ”— remsen has quit IRC (Read error: Operation timed out)
01:49 πŸ”— computerf has quit IRC (Read error: Operation timed out)
01:49 πŸ”— acridAxid has quit IRC (Ping timeout: 246 seconds)
01:49 πŸ”— oli has quit IRC (Ping timeout: 246 seconds)
01:49 πŸ”— MMovie has joined #archiveteam
01:49 πŸ”— oli has joined #archiveteam
01:50 πŸ”— balrog has joined #archiveteam
01:50 πŸ”— swebb sets mode: +o balrog
01:50 πŸ”— kremlin has joined #archiveteam
01:50 πŸ”— TC01 has quit IRC (Ping timeout: 246 seconds)
01:50 πŸ”— yakfish has quit IRC (Ping timeout: 246 seconds)
01:50 πŸ”— robink has quit IRC (Ping timeout: 246 seconds)
01:50 πŸ”— robink has joined #archiveteam
01:52 πŸ”— jspiros has quit IRC (Read error: Operation timed out)
01:53 πŸ”— Atom has quit IRC (Read error: Operation timed out)
01:53 πŸ”— marvinw has joined #archiveteam
01:54 πŸ”— remsen has joined #archiveteam
01:54 πŸ”— kyounko has quit IRC (Ping timeout: 492 seconds)
01:55 πŸ”— TC01 has joined #archiveteam
01:56 πŸ”— cf has joined #archiveteam
01:58 πŸ”— ranma has joined #archiveteam
02:00 πŸ”— acridAxid has joined #archiveteam
02:01 πŸ”— Swizzle has quit IRC (Quit: Leaving)
02:05 πŸ”— Mayonaise has joined #archiveteam
02:06 πŸ”— JesseW has quit IRC (Ping timeout: 370 seconds)
02:33 πŸ”— Atros has quit IRC (Read error: Operation timed out)
02:33 πŸ”— atrocity has joined #archiveteam
02:56 πŸ”— JesseW has joined #archiveteam
03:31 πŸ”— xhdr has quit IRC (Ping timeout: 194 seconds)
03:31 πŸ”— Deewiant has quit IRC (Ping timeout: 194 seconds)
03:31 πŸ”— Sk1d has quit IRC (Ping timeout: 194 seconds)
03:31 πŸ”— SilSte has quit IRC (Ping timeout: 194 seconds)
03:31 πŸ”— xhdr has joined #archiveteam
03:31 πŸ”— Deewiant has joined #archiveteam
03:31 πŸ”— Sk1d has joined #archiveteam
03:37 πŸ”— SilSte has joined #archiveteam
04:03 πŸ”— ndiddy has quit IRC (Ping timeout: 244 seconds)
04:26 πŸ”— maelstrom has quit IRC (Quit: Leaving)
04:36 πŸ”— Sk1d has quit IRC (Ping timeout: 250 seconds)
04:37 πŸ”— VonGuard_ is now known as VonGuard
04:43 πŸ”— Sk1d has joined #archiveteam
05:28 πŸ”— hive-mind has quit IRC (Ping timeout: 260 seconds)
05:29 πŸ”— hive-mind has joined #archiveteam
05:59 πŸ”— JesseW has quit IRC (Ping timeout: 370 seconds)
07:03 πŸ”— AlexLehm has joined #archiveteam
07:06 πŸ”— midas SketchCow: grabbed
07:09 πŸ”— AlexLehm has quit IRC (Ping timeout: 260 seconds)
07:47 πŸ”— toddf so I've increased my speed archiving genweb by using the wayback system to retrieve unique pages, cache them, then snarf them from wayback .. if I do it direct after a while I get tossed to /dev/null like I'm banned .. seems like I've got a bit to go .. Oct 1: 40172775 urls enumerated, 5861160 downloaded and scraped for more urls. Oct 5: 41830195 urls enumerated, 6142773 downloaded and scraped for more
07:47 πŸ”— toddf urls. from 85.410% ...
07:47 πŸ”— toddf ... unscraped to 85.315% unscraped. at some point enumerated urls will stop changing; I have one process priming the wayback cache, and another downloading, storing, scraping urls .. moved from sqlite3 to postgresql .. not sure how to make this go faster other than perhaps another pair of processes (that I'd have to do some form of overlap avoidance with) and I fear I may overload the genweb site if I
07:47 πŸ”— toddf do too much more. I'm ...
07:47 πŸ”— toddf ... hoping it stays online until I'm done ;-)
07:48 πŸ”— WinterFox has joined #archiveteam
07:50 πŸ”— Igloo^_^ has quit IRC (Ping timeout: 260 seconds)
07:54 πŸ”— atomotic has joined #archiveteam
07:57 πŸ”— schbirid has joined #archiveteam
08:14 πŸ”— dashcloud has quit IRC (Read error: Operation timed out)
08:21 πŸ”— MMovie1 has joined #archiveteam
08:22 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
08:34 πŸ”— dashcloud has joined #archiveteam
09:00 πŸ”— ndiddy has joined #archiveteam
09:02 πŸ”— ravetcofx has quit IRC (Read error: Operation timed out)
09:03 πŸ”— Igloo^_^ has joined #archiveteam
10:25 πŸ”— godane has quit IRC (Read error: Operation timed out)
10:34 πŸ”— godane has joined #archiveteam
10:44 πŸ”— atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com)
11:09 πŸ”— W1nterFox has joined #archiveteam
11:12 πŸ”— WinterFox has quit IRC (Ping timeout: 370 seconds)
11:20 πŸ”— BlueMaxim has quit IRC (Quit: Leaving)
11:22 πŸ”— Morbus has joined #archiveteam
11:58 πŸ”— atomotic has joined #archiveteam
13:41 πŸ”— samp has joined #archiveteam
13:45 πŸ”— samp Hello _o/ This is probably the wrong place to ask, but is there some ongoing project I can help with? I've been backuping my own pinboard links on my own using httrack and a friend of mine told me about archiveteam
14:01 πŸ”— JesseW has joined #archiveteam
14:17 πŸ”— Start has quit IRC (Quit: Disconnected.)
14:24 πŸ”— atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com)
14:43 πŸ”— W1nterFox has quit IRC (Ping timeout: 370 seconds)
14:57 πŸ”— godane has quit IRC (Quit: Leaving.)
14:58 πŸ”— godane has joined #archiveteam
15:15 πŸ”— dashcloud has quit IRC (Quit: No Ping reply in 180 seconds.)
15:20 πŸ”— Medowar0 samp: the easierst way to help is running a warrior. Current main Project: yahoo answers. Please run with low concurrent(2), since yahoo is banning ips on more concurrent.
15:21 πŸ”— dashcloud has joined #archiveteam
15:45 πŸ”— JesseW has quit IRC (Quit: Leaving.)
15:45 πŸ”— JesseW has joined #archiveteam
15:51 πŸ”— samp Medowar0: been doing that but I chose "Archiveteam's choice", will change for Yahoo answers. I can help with code too but looks like there's no current project in need of one, right?
15:56 πŸ”— JesseW we really should change "Archiveteam's choice" to point at Yahoo Answers for now
15:59 πŸ”— atomotic has joined #archiveteam
16:00 πŸ”— samp this is the main comm channel, right? I'll keep it on radar to follow the group. For now I think I'll try to run the warrior on my RPi, it was kind of useless up until now
16:03 πŸ”— Medowar0 samp JesseW: AT Choice currently is URLTeam due to the IP Ban issue of yahoo.
16:04 πŸ”— Medowar0 samp: If you want to help with coding, start writing a detection system for bans into yahoo grab and make it pause then. Because right now, they retry every minute and fail every task and then proceed to request a new one.
16:05 πŸ”— Medowar0 also there are many ways to help. Check the wiki for that.
16:05 πŸ”— samp looks good, I'll look into it
16:07 πŸ”— JesseW has quit IRC (Ping timeout: 370 seconds)
16:10 πŸ”— ravetcofx has joined #archiveteam
16:11 πŸ”— lsmag has joined #archiveteam
16:12 πŸ”— lsmag has quit IRC (Client Quit)
16:23 πŸ”— atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com)
16:41 πŸ”— Aoede has joined #archiveteam
17:04 πŸ”— hook54321 has joined #archiveteam
17:16 πŸ”— cadbury_ has quit IRC (Quit: leaving)
17:21 πŸ”— luckcolor samp: wich languages do you know to code in? If you want you can help in #chaingang
17:24 πŸ”— samp luckcolor: Python+JS in my main job, I know some other languages but I can also learn sth new, if needed
17:25 πŸ”— samp luckcolor: what's chaingang?
17:25 πŸ”— luckcolor wait
17:25 πŸ”— * luckcolor is getting the link from the wiki
17:26 πŸ”— samp sorry, I was doing the same thing
17:26 πŸ”— luckcolor http://archiveteam.org/index.php?title=ArchiveTeam_Chain_Gang
17:27 πŸ”— luckcolor to not flood this channel if you're still interested let's move the discussion to #chaingang
17:27 πŸ”— samp sure
17:33 πŸ”— Aoede has quit IRC (Quit: WeeChat 1.5)
17:37 πŸ”— cadbury_ has joined #archiveteam
18:43 πŸ”— AlexLehm has joined #archiveteam
19:01 πŸ”— BartoCH has joined #archiveteam
19:17 πŸ”— SilSte Do you have 4chan on your radar?
19:18 πŸ”— SilSte And i have a twitter user who I would like to save... its a private profile, but I can view it...
19:40 πŸ”— godane has quit IRC (Read error: Operation timed out)
19:41 πŸ”— godane has joined #archiveteam
19:58 πŸ”— bRick5772 has joined #archiveteam
20:10 πŸ”— maelstrom has joined #archiveteam
20:11 πŸ”— ete has joined #archiveteam
20:13 πŸ”— RichardG_ has joined #archiveteam
20:13 πŸ”— RichardG has quit IRC (Read error: Connection reset by peer)
20:13 πŸ”— ete sorry if this is the wrong place, but
20:13 πŸ”— ete http://news.berkeley.edu/2016/09/13/a-statement-on-online-course-content-and-accessibility/ "Federal government tells Berkeley they may not offer free online video courses, because they are discriminatory against deaf people who cannot hear the audio. Willing to reconsider if they translate them into sign language as well or add closed captioning, but the college says it can’t afford that and will probably just take the courses down. This is a metaphor for h
20:14 πŸ”— ete seems like something in danger / worth saving
20:17 πŸ”— ete has quit IRC (Remote host closed the connection)
20:33 πŸ”— bRick5772 has quit IRC (Ping timeout: 250 seconds)
20:34 πŸ”— bRick5772 has joined #archiveteam
21:15 πŸ”— BartoCH has quit IRC (Ping timeout: 260 seconds)
21:15 πŸ”— BartoCH has joined #archiveteam
21:18 πŸ”— schbirid has quit IRC (Quit: Leaving)
21:22 πŸ”— godane has quit IRC (Quit: Leaving.)
21:44 πŸ”— dashcloud has quit IRC (Read error: Operation timed out)
21:45 πŸ”— godane has joined #archiveteam
21:46 πŸ”— bRick5772 has quit IRC (Quit: Leaving.)
21:56 πŸ”— dashcloud has joined #archiveteam
22:06 πŸ”— dashcloud has quit IRC (Read error: Operation timed out)
22:09 πŸ”— dashcloud has joined #archiveteam
22:55 πŸ”— BartoCH has quit IRC (Quit: WeeChat 1.5)
22:58 πŸ”— AlexLehm has quit IRC (Ping timeout: 260 seconds)
23:01 πŸ”— tfgbd_znc has joined #archiveteam
23:16 πŸ”— RichardG_ has quit IRC (Ping timeout: 370 seconds)
23:20 πŸ”— JW_work ete (if you read the logs) β€” we know about it, but I'm not sure if there's an active effort to grab it. Help welcomed.
23:20 πŸ”— JW_work samp: URLTeam can also use help investigating shorteners; ask in #urlteam or check the wiki.
23:51 πŸ”— JesseW has joined #archiveteam

irclogger-viewer