[03:08] *** bwn has quit IRC (Ping timeout: 492 seconds) [04:11] *** bwn has joined #urlteam [04:41] *** JesseW has quit IRC (Quit: Leaving.) [05:32] *** JesseW has joined #urlteam [05:32] *** svchfoo3 sets mode: +o JesseW [06:04] *** JesseW has quit IRC (Quit: Leaving.) [06:13] *** WinterFox has joined #urlteam [06:32] *** bwn has quit IRC (Ping timeout: 492 seconds) [07:31] *** bwn has joined #urlteam [10:26] *** WinterFox has quit IRC (Remote host closed the connection) [14:19] *** Start has quit IRC (Quit: Disconnected.) [15:34] *** Start has joined #urlteam [16:07] *** Start has quit IRC (Quit: Disconnected.) [16:28] *** JesseW has joined #urlteam [16:29] *** svchfoo1 sets mode: +o JesseW [16:49] *** JesseW has quit IRC (Quit: Leaving.) [17:43] *** Start has joined #urlteam [18:02] *** Start has quit IRC (Quit: Disconnected.) [18:18] *** Start has joined #urlteam [19:40] *** bwn has quit IRC (Ping timeout: 246 seconds) [19:44] *** Start has quit IRC (Quit: Disconnected.) [20:14] *** bwn has joined #urlteam [20:44] *** newbie_ has joined #urlteam [20:44] newbie_: jessew who is here several times a day is heading up urlteam [20:45] he's not here now though [20:45] hi #urlteam i made my way here from the ArchiveBot page on the archiveteam wiki [20:45] however he will see what you say, because he looks at logs [20:46] i'd like to crawl http://bernie.to to discover all the redirects it currently has running. it's being used like a URL shortening service for the bernie sanders campaign. @major in #archivebot told me to talk it up here [20:46] *** JW_work has joined #urlteam [20:46] (major is a bot) [20:47] @xmc are you a bot? [20:47] i am not a bot [20:47] hello human! [20:47] sure, grabbing bernie.to seems like a good idea [20:47] (I'm jessew, but at work) [20:48] thank you @JW_work is it something i could help with? [20:49] certainly — look at http://www.archiveteam.org/index.php?title=URLTeam#Researching_URL_Shorteners and answer the questions listed there for bernie.to [20:49] ok thank you! [20:50] once we have that info, I can add it as a warrior job (assuming it works in a typical way) or else someone will have to write some custom code, which can then get added to to the Warrior job. [20:51] Thanks for noticing it exists, and asking us about it! [20:51] same to you! very interesting project the archiveteam, glad i found you all! have a good day! [20:52] *** newbie_ has quit IRC (Quit: Page closed) [21:08] Looking on reddit, it looks like all the shortcodes are custom words, not automatically generated shortcodes. [21:08] That won't work particularly well with the Warrior job. [21:09] You'd probably be better to just archive the contents of https://www.reddit.com/domain/bernie.to [21:12] but if you do generate a list of codes, it should be easy enough to convert them into long urls — as the long url is listed (twice) in the returned page [22:42] dictionary attack :p [22:55] sure, but we don't (yet) have code for that. [23:38] *** Start has joined #urlteam