[00:03] *** DoomTay has joined #archiveteam-bs [00:11] oh neat! [00:30] *** Honno has quit IRC (Read error: Operation timed out) [00:36] *** DoomTay has quit IRC (Quit: Page closed) [00:36] *** DoomTay has joined #archiveteam-bs [01:09] *** DoomTay has quit IRC (Quit: Page closed) [01:19] *** schbirid has quit IRC (Ping timeout: 260 seconds) [01:31] *** schbirid has joined #archiveteam-bs [01:41] *** DoomTay has joined #archiveteam-bs [03:19] joepie91: I just tried to point someone to pdf.yt, and they tell me uploads are disabled? [03:59] *** kristian_ has quit IRC (Leaving) [04:13] *** DoomTay has quit IRC (Quit: Page closed) [04:29] *** BlueMaxim has joined #archiveteam-bs [04:30] *** Sk1d has quit IRC (Ping timeout: 194 seconds) [04:38] *** Sk1d has joined #archiveteam-bs [04:39] *** Fusl has quit IRC (Read error: Operation timed out) [04:53] *** Meroje has quit IRC (Quit: bye!) [05:09] *** Fusl has joined #archiveteam-bs [06:38] *** DoomTay has joined #archiveteam-bs [06:38] *** Fusl has quit IRC (Read error: Operation timed out) [06:40] *** DoomTay has quit IRC (Client Quit) [06:43] *** Meroje has joined #archiveteam-bs [06:44] *** Meroje has quit IRC (Client Quit) [06:44] *** Meroje has joined #archiveteam-bs [06:45] *** Fusl has joined #archiveteam-bs [06:54] lol https://web.archive.org/web/20160804202710/https://archive.org/details/nintendopower [06:56] recursion [06:56] *** dashcloud has quit IRC (Read error: Operation timed out) [07:01] *** dashcloud has joined #archiveteam-bs [07:38] dashcloud: yes they are currently [07:47] *** dashcloud has quit IRC (Read error: Operation timed out) [07:50] *** dashcloud has joined #archiveteam-bs [08:12] *** dashcloud has quit IRC (Read error: Operation timed out) [08:15] *** Honno has joined #archiveteam-bs [08:16] *** tomwsmf has quit IRC (Read error: Operation timed out) [08:16] *** dashcloud has joined #archiveteam-bs [08:25] *** Fletcher_ has quit IRC (Ping timeout: 250 seconds) [09:31] *** PurpleSym has quit IRC (Ping timeout: 506 seconds) [09:32] *** PurpleSym has joined #archiveteam-bs [09:32] *** PotcFdk has quit IRC (Ping timeout: 506 seconds) [09:32] *** i0npulse has quit IRC (Ping timeout: 506 seconds) [11:35] *** vitzli has joined #archiveteam-bs [11:36] *** dashcloud has quit IRC (Read error: Operation timed out) [11:37] *** schbirid has quit IRC (Quit: Leaving) [11:42] *** dashcloud has joined #archiveteam-bs [12:00] *** VADemon has joined #archiveteam-bs [12:09] *** dashcloud has quit IRC (Read error: Operation timed out) [12:13] *** dashcloud has joined #archiveteam-bs [13:06] *** BlueMaxim has quit IRC (Quit: Leaving) [13:10] *** kristian_ has joined #archiveteam-bs [13:29] *** dashcloud has quit IRC (Read error: Operation timed out) [13:34] *** dashcloud has joined #archiveteam-bs [13:37] *** kristian_ has quit IRC (Leaving) [13:47] *** schbirid has joined #archiveteam-bs [13:49] joepie91: reason? :) [14:21] *** Silvan has quit IRC (Remote host closed the connection) [14:21] *** SilSte has joined #archiveteam-bs [15:14] Uploads are going for Orkut [15:14] Portalgraphics was done pretty easily. [15:16] *** i0npulse has joined #archiveteam-bs [15:24] midas: need to scale, scaling costs time, and I am short on time on account of trying to pay my bills [15:24] :P [15:24] p [15:24] :p [15:33] heh. https://www.reddit.com/r/geek/comments/4vyu4i/archiveorg_now_has_every_issue_of_nintendo_power/d62ujd2 [15:33] "You don't have to worry about Archive.org taking the magazins down- they're hosted there to prevent that very problem!" [15:33] >.> [15:36] clearly someone should have backed them up to the cloud [15:37] no matter how awesome and well-intended a host could ever be, things outside YOUR control are outside your control and could vanish any second [15:37] yerp [15:38] even IA has to contend with DMCA silliness [15:38] also, YAY SATA ERRORS [15:38] or maybe trump decides to stop serving US internet to foreigners [15:41] *** DoomTay has joined #archiveteam-bs [15:57] *** VADemon has quit IRC (Quit: left4dead) [16:26] *** brayden_ has joined #archiveteam-bs [16:26] *** swebb sets mode: +o brayden_ [16:32] *** brayden has quit IRC (Read error: Operation timed out) [16:34] *** fie_ has joined #archiveteam-bs [16:34] *** fie__ has quit IRC (Read error: Connection reset by peer) [16:43] *** fie__ has joined #archiveteam-bs [16:43] *** fie_ has quit IRC (Read error: Connection reset by peer) [16:44] whoever just crashed newsbuddy with a syn flood, I hope you are proud [16:45] so looks like i will have to download the odd number news node from sbs.com.au [16:45] *** fie__ has quit IRC (Read error: Connection reset by peer) [16:45] *** fie__ has joined #archiveteam-bs [16:46] btw without my grab you would have about 6 pages: https://web.archive.org/web/*/www.sbs.com.au/news/node/100* [16:47] at least with the urls starting with 100 [16:52] HCross: huge assumption it was someone from here... [16:53] Igloo: i'm not sure you understand HCross's tone [16:55] its just I get home after a long day, to a stream of alerts about high CPU usage and high RAM usage, and then I jump into KVM and see SYN floods and OOMs [16:55] *** fie__ has quit IRC (Read error: Connection reset by peer) [16:55] *** fie__ has joined #archiveteam-bs [16:59] Yeah, HCross is sharing his frustration. You know, WITH HIS COLLEAGUES [17:00] * SketchCow peeks out from the Amiga Emulation limo [17:00] *** purplebot has quit IRC (Quit: ZNC - http://znc.in) [17:00] *** PurpleSym has quit IRC (Quit: *) [17:01] *** purplebot has joined #archiveteam-bs [17:03] *** PurpleSym has joined #archiveteam-bs [17:03] xmc it's hard to tell over text based messaging :) [17:03] ¯\_(ツ)_/¯ works for me [17:03] I was confused too [17:03] :p [17:13] *** fie_ has joined #archiveteam-bs [17:13] *** fie__ has quit IRC (Read error: Connection reset by peer) [17:20] *** fie__ has joined #archiveteam-bs [17:20] *** fie_ has quit IRC (Read error: Connection reset by peer) [17:33] things are getting increasingly more shit: https://stdlib.com/ [17:34] what the crap is this [17:34] https://static.kurtmclester.com/s/31122388.png [17:34] this looks like cancer [17:35] Frogging: https://www.reddit.com/r/node/comments/4wqqd9/introducing_stdlibcom_building_a_standard_library [17:36] uh-huh [17:38] lol [17:38] looks like it was previously called Polybit [17:38] http://polybit.com/index [17:42] https://xkcd.com/927/ [17:47] *** RedType has left [17:47] hey HCross you think you're having issues [17:47] Some shitforbrains is trying to bruteforce my wordpress install [17:48] using the xmlrpc exploit [17:50] *** vitzli has quit IRC (Quit: Leaving) [18:19] *** dashcloud has quit IRC (Read error: Operation timed out) [18:22] *** dashcloud has joined #archiveteam-bs [18:28] k. I'm having noob issues with grab-site: [18:28] "First, start the dashboard with: ~/.local/bin/gs-server" - https://github.com/ludios/grab-site [18:28] bash: /home/ubuntu/.local/bin/gs-server: No such file or directory [18:28] What am i doing wrong? [18:29] you first have to create the virtual env [18:30] i mean i suggest you to use an virtual env [18:30] pyvenv-3.4 ~/gs-venv [18:31] to activate the virtual env you have to run: . ~/gs-venv/bin/activate [18:31] pip3 install git+https://github.com/ludios/grab-site [18:31] it's better to do it this way because grab-site use a different version of wpull that is not the latest one [18:31] *uses [18:32] hook54321: [18:32] bash: /home/ubuntu/gs-venv/bin/activate: Permission denied [18:32] Frogging: https://www.reddit.com/r/node/comments/4wqqd9/introducing_stdlibcom_building_a_standard_library/d6apqz7?context=10000 [18:32] lmao [18:32] it's a good question :p [18:34] it's been so long since i've used ubuntu. I'm just running it off of a USB right now. :/ [18:35] still runs better than windows though. [18:38] *** DoomTay has quit IRC (Quit: Page closed) [18:39] hook54321: bin/activate is a script meant to be sourced, not executed [18:47] i think i've lost my linux terminal touch, i used to be able to do most of this, now i don't even know to do at this point [18:50] *** fie_ has joined #archiveteam-bs [18:50] *** fie__ has quit IRC (Read error: Connection reset by peer) [18:51] *** godane has left [18:51] *** godane has joined #archiveteam-bs [19:01] Is anyone with FOS access around to do something for me? Working with OVH on ironing out some speed issues and they want some MTR's from FOS's end if possible [19:02] FOS? [19:02] HCross: yeah sure [19:03] what command is most useful [19:03] mtr [host]? [19:03] HCross: whoa. actual OVH support? [19:03] :P [19:04] yep. one of the staffers on their IRC really likes the IA and everything and is really helping [19:04] yipdw can you mtr to blog.leech0r.co.uk (me) please (also if possible can we run iperf?) [19:05] running [19:05] iperf -c -P5 -r using what Igloo said [19:06] Let me run iperf server 1 mo [19:07] HCross: heh, neat [19:08] iperf running now [19:08] Thanks very much [19:09] nice [19:11] *** DoomTay has joined #archiveteam-bs [19:11] how do i give a cookie to grab-site? [19:12] *** fie_ has quit IRC (Read error: Operation timed out) [19:13] use --wpull-args=ARGS to pass extra args to wpull [19:14] and --load-cookies FILE i think [19:14] https://wpull.readthedocs.io/en/master/index.html [19:15] *** fie_ has joined #archiveteam-bs [19:18] yup [19:19] will the terminal show the same stuff that the dashboard shows during the job? [19:19] *** Honno has quit IRC (Read error: Operation timed out) [19:22] thanks yipdw - OVH are making very clear progress on it right now [19:37] now whenever I use a cookie with wget it goes bad almost immediately :/ [19:41] well then. looks like somebody wants a piece of the pie: http://storage4.static.itmages.com/i/16/0809/h_1470771603_1638256_4574ffb85b.png [19:43] the joe pie? ;p [19:43] is this legit? [19:44] Frogging: that depends on your definition of 'legit' [19:44] I consider the entirety of the marketing industry to be scum [19:44] me too [19:44] but yes, this really is an offer from a real advertising company [19:44] neta [19:44] neat* [19:44] not sure I'd describe it that way :P [19:44] I'm going to think very, /very/ carefully about exactly how to respond to this [19:45] if you don't want it i'm sure you can safely ignore them [19:45] *** Coderjoe has quit IRC (Read error: Operation timed out) [19:46] Frogging: oh, I can, but I'm not convinced that that's the most productive solution [19:47] Frogging: this seems like a potentially nice entry point to write a blog post about why advertising is evil, but I need to work out a way to do that without being a complete ass :) [19:47] hehe [19:54] why would you not want to be a complete ass when talking about complete asses? [19:56] schbirid: because if not done carefully, it tends to get you ignored [19:56] i am rubber, you are glue [19:56] I have IRC for unfiltered ranting, blogposts require a bit more strategy to have the maximum effect :) [19:57] anyway, schbirid, goal of ranty blog posts is to rant enough that it pisses people off and makes them pass it around, but not so much that it makes people go "lol, what a paranoid idiot" and close the tab [19:58] :) [19:58] I'm perfectly fine with people posting the link everywhere and going "lol, would you look at this idiot", though [19:58] still has an effect :P [20:03] yeah you want to make a good argument and not give off the lunatic vibe [20:03] Looking at you, Dear Hollywood [20:09] *** Coderjoe has joined #archiveteam-bs [20:09] *** DoomTay has quit IRC (Quit: Page closed) [20:32] Frogging: preeeetty much [20:45] *** dashcloud has quit IRC (Read error: Operation timed out) [20:49] *** dashcloud has joined #archiveteam-bs [20:53] *** Honno has joined #archiveteam-bs [20:56] have any of you seen this website http://www.watchcartoononline.com/ [20:58] without adblocker its pretty bad, but It's a really amazing site that has aggregated many many cartoons and anime's [20:58] *** tomwsmf has joined #archiveteam-bs [20:58] I've been scraping some of my favorite cartoon network shows off of it with https://github.com/yasoob/watchcartoononline-dl [21:00] interesting [21:00] they actually host it themselves [21:00] well, watchanimesub.net does [21:00] probably the same site [21:02] I'm suppirsed they're still up, they've been around for years [21:04] It's amazing that they have the network resources to host that amount of streaming, those ads must pay a lot [21:25] *** DoomTay has joined #archiveteam-bs [21:30] Is there a reason why the cookies are expiring whenever i try to archive the site? :/ [21:33] *** SketchCow has quit IRC (Remote host closed the connection) [21:33] hook54321: what are you using to archive it? [21:43] *** SketchCow has joined #archiveteam-bs [21:43] *** midas sets mode: +o SketchCow [21:43] *** swebb sets mode: +o SketchCow [21:49] ravetcofx: grab-site [22:00] Could grab-site be unintentionally logging itself out of the site? [22:01] only if the logout button is a get request i think [22:02] you could Ignore Set the logout buttons anyway [22:02] *** kristian_ has joined #archiveteam-bs [22:02] so if even that was the case it wouldn't happen [22:21] *** Honno has quit IRC (Read error: Operation timed out) [22:23] luckcolor: does --ig work on grab-site? [22:24] Try it [22:25] "Error: no such option: --ig (Possible options: --ignore-sets, --igoff, --igon, --igsets)" [22:25] Hmm [22:25] Maybe try making an igset and go from there? [22:26] is the igsets file supposed to be empty? [22:26] * DoomTay shrugs [22:26] well, practically empty. [22:27] oh [22:30] where are the igsets sets stored? [22:31] maybe grab-site has none by default. you can write some, and the ArchiveBot ones are at https://github.com/ArchiveTeam/ArchiveBot/tree/master/db/ignore_patterns [22:33] it does, but i don't know where they are stored. [22:33] https://github.com/ludios/grab-site/tree/master/libgrabsite/ignore_sets [22:34] You mean, like, on your machine? [22:35] yeah [22:35] well, on my flashdrive. :P [22:36] I'm on a library computer. [22:36] well, kinda [22:42] Rental? [22:42] Rental laptop? [22:42] does that exist? [22:43] My university's library allows laptop check-outs [22:43] oh [22:43] hook54321: try grep [22:43] idk [22:43] :p [23:17] DoomTay: Python looks to be an option so I'd use that since I'm most familiar with it [23:18] I'm pretty sure most, if not all of AT's tools are in Python [23:18] So...win-win? [23:18] Lua's often used as well [23:21] I kinda feel like writing articles on "too late" site death situations even though that would only be useful to time travellers [23:48] I have walked 10 miles [23:48] good [23:48] bad news... 10 miles from my car. gotta walk back! [23:48] 20 miles in a day! [23:48] burn them calories [23:52] *** BlueMaxim has joined #archiveteam-bs [23:52] Maybe should have taken a circular path