[05:03] *** jodizzle_ has joined #webroasting [05:13] *** jodizzle has quit IRC (Ping timeout: 864 seconds) [08:13] *** wessel151 has quit IRC (Ping timeout: 260 seconds) [08:29] *** wessel151 has joined #webroasting [08:34] i am working on http://members.home.nl/ and http://members.chello.nl/ and scrape google for urls [08:34] only i need help white post progressing https://drive.google.com/open?id=1w3OrJt4hfdpWQWDeqUFHpLSMv-wXYEdb [13:09] *** t3 has quit IRC (Read error: Connection timed out) [13:11] *** t3 has joined #webroasting [14:01] *** yano has quit IRC (Ping timeout: 258 seconds) [14:38] *** Ryz has quit IRC (Remote host closed the connection) [14:38] *** kiska18 has quit IRC (Remote host closed the connection) [14:58] JAA can you help me with http://members.home.nl/ and http://members.chello.nl/ [15:15] *** yano has joined #webroasting [15:18] wessel151: Don't worry about post-processing, just extract stuff for now. [15:20] Also, please upload your results to https://transfer.notkiska.pw/. Much easier to download from there than Google Drive. [15:21] i have already found over unique 200000 items [15:23] do you want the excel ore just the urls [15:23] in a txt doc [15:23] Text files are preferred. [15:24] But CSV works as well. [16:17] *** Ryz has joined #webroasting [16:24] *** jodizzle_ is now known as jodizzle [18:06] *** wessel152 has joined #webroasting [18:08] the fist list https://transfer.notkiska.pw/HEJhh/ziggodump1.txt [18:09] not processed [18:26] 232770 items [18:33] JAA do you think that this is maybe a warrior project [18:39] I don't know. DPoS projects don't work very well for recursive crawls though. [23:16] *** wessel152 has quit IRC (Ping timeout: 262 seconds) [23:32] *** VADemon_ has joined #webroasting [23:36] *** VADemon has quit IRC (Ping timeout: 260 seconds)