#urlteam 2015-11-02,Mon

↑back Search

Time Nickname Message
00:01 πŸ”— arkiver Atluxity: JesseW: I can add you as admin tomorrow if you'd like
00:01 πŸ”— JesseW arkiver: sounds good. no hurry
00:01 πŸ”— arkiver ok!
00:02 πŸ”— arkiver JesseW: will also have a look tomorrow at adding more shorteners
00:02 πŸ”— JesseW excellent! it'd be good to write up some more documentation on how to identify the structure of a shortner
00:02 πŸ”— JesseW i.e. is it incremental or not, what's the alphabet, etc.
00:09 πŸ”— arkiver JesseW: I agree, will see about doing that
00:11 πŸ”— ersi has quit IRC (Read error: Operation timed out)
00:11 πŸ”— ersi_ has joined #urlteam
01:38 πŸ”— Coderjoe has quit IRC (Ping timeout: 255 seconds)
01:46 πŸ”— Coderjoe has joined #urlteam
02:29 πŸ”— FriarGius has quit IRC (Leaving)
02:29 πŸ”— FriarGius has joined #urlteam
02:37 πŸ”— JesseW has quit IRC (Leaving.)
03:00 πŸ”— Ctrl-S__ has joined #urlteam
03:08 πŸ”— JesseW has joined #urlteam
03:16 πŸ”— FriarGius has quit IRC (Leaving)
03:26 πŸ”— Start has quit IRC (Quit: Disconnected.)
03:29 πŸ”— Start has joined #urlteam
04:09 πŸ”— Atluxity arkiver: Ok
04:23 πŸ”— aaaaaaaaa has quit IRC (Leaving)
05:15 πŸ”— svchfoo1 has quit IRC (Ping timeout: 369 seconds)
05:15 πŸ”— chazchaz has quit IRC (Ping timeout: 369 seconds)
05:16 πŸ”— atlogbot has quit IRC (Ping timeout: 369 seconds)
05:34 πŸ”— FriarGius has joined #urlteam
05:39 πŸ”— svchfoo1 has joined #urlteam
05:39 πŸ”— atlogbot has joined #urlteam
05:40 πŸ”— chazchaz has joined #urlteam
05:40 πŸ”— svchfoo3 sets mode: +o svchfoo1
05:55 πŸ”— atlogbot has quit IRC (Read error: Operation timed out)
06:00 πŸ”— svchfoo1 has quit IRC (Ping timeout: 369 seconds)
06:00 πŸ”— chazchaz has quit IRC (Ping timeout: 369 seconds)
06:05 πŸ”— Deewiant has joined #urlteam
06:06 πŸ”— svchfoo1 has joined #urlteam
06:06 πŸ”— svchfoo3 sets mode: +o svchfoo1
06:06 πŸ”— atlogbot has joined #urlteam
06:06 πŸ”— chazchaz has joined #urlteam
06:41 πŸ”— chazchaz has quit IRC (Read error: Operation timed out)
06:41 πŸ”— atlogbot has quit IRC (Ping timeout: 369 seconds)
06:49 πŸ”— svchfoo1 has quit IRC (Ping timeout: 369 seconds)
06:54 πŸ”— chazchaz has joined #urlteam
07:07 πŸ”— svchfoo1 has joined #urlteam
07:07 πŸ”— atlogbot has joined #urlteam
07:07 πŸ”— svchfoo3 sets mode: +o svchfoo1
07:28 πŸ”— svchfoo1 has quit IRC (Ping timeout: 369 seconds)
07:35 πŸ”— atlogbot has quit IRC (Ping timeout: 369 seconds)
07:41 πŸ”— atlogbot has joined #urlteam
07:41 πŸ”— svchfoo1 has joined #urlteam
07:42 πŸ”— svchfoo3 sets mode: +o svchfoo1
08:11 πŸ”— JesseW has quit IRC (Leaving.)
09:05 πŸ”— ersi_ is now known as ersi
09:05 πŸ”— svchfoo3 sets mode: +o ersi
10:54 πŸ”— Muad-Dib has joined #urlteam
14:00 πŸ”— FriarGius has quit IRC (Leaving)
14:09 πŸ”— jornane has joined #urlteam
14:14 πŸ”— jornane hei, i've stumbled upon this project on the internets
14:14 πŸ”— jornane I would like to help scraping, but I was wondering how alive this project is… The last torrent is from July 20th, 2013 and the next release is planned around January 2014
14:15 πŸ”— phuzion jornane: we're always scraping URL shorteners.
14:16 πŸ”— phuzion If you want to get started with this, follow the instructions on this page to download a Warrior, and select URLTeam as your project: http://archiveteam.org/index.php?title=Warrior
14:17 πŸ”— jornane I've read up on the possibilities, I was planning on running this on an ESXi box on a clean shared gigabit line
14:17 πŸ”— phuzion Alternatively, if you have a *nix system available, you can run the urlteam pipeline directly, the instructions are here on the git repo: https://github.com/ArchiveTeam/terroroftinytown-client-grab
14:17 πŸ”— jornane ah that might be easier
14:18 πŸ”— phuzion Depends on what your definition of easy is.
14:18 πŸ”— phuzion If you've already got the ESX box, it might be easier to just throw the .OVA file at it and boot it.
14:19 πŸ”— jornane but i'm still wondering what happend to the release-cycle ;)
14:19 πŸ”— phuzion I'm not exactly sure when the torrents get released, to be honest.
14:19 πŸ”— phuzion But I can assure you this project is still active
14:22 πŸ”— jornane so is it possible to see the progress somewhere else? I checked http://www.archiveteam.org/index.php?title=URLTeam, but it refers to the torrents, and some url shorteners are published externally on IA
14:30 πŸ”— phuzion jornane: If you want realtime data, check the tracker in the topic (last link)
15:17 πŸ”— Start has quit IRC (Quit: Disconnected.)
15:17 πŸ”— achip URLTeam releases also end up in IA https://archive.org/search.php?query=URLTeam+Release&sort=date
15:42 πŸ”— Start has joined #urlteam
17:01 πŸ”— JesseW has joined #urlteam
17:02 πŸ”— marvinw_ has quit IRC (Read error: Operation timed out)
17:10 πŸ”— Start has quit IRC (Read error: Operation timed out)
17:12 πŸ”— Start has joined #urlteam
17:15 πŸ”— JesseW has quit IRC (Leaving.)
18:36 πŸ”— Start has quit IRC (Quit: Disconnected.)
18:42 πŸ”— joepie91 has quit IRC (Read error: Operation timed out)
18:45 πŸ”— joepie91 has joined #urlteam
18:45 πŸ”— svchfoo1 sets mode: +o joepie91
18:48 πŸ”— aaaaaaaaa has joined #urlteam
18:48 πŸ”— swebb sets mode: +o aaaaaaaaa
18:51 πŸ”— aaaaaaaaa has quit IRC (Client Quit)
19:11 πŸ”— Start has joined #urlteam
19:19 πŸ”— Start has quit IRC (Quit: Disconnected.)
19:52 πŸ”— SimpBrain has quit IRC (Leaving)
19:54 πŸ”— aaaaaaaaa has joined #urlteam
19:54 πŸ”— swebb sets mode: +o aaaaaaaaa
20:02 πŸ”— phuzion More threads thrown at urlteam :)
20:25 πŸ”— slang has joined #urlteam
20:33 πŸ”— aaaaaaaa_ has joined #urlteam
20:33 πŸ”— aaaaaaaaa has quit IRC (Read error: Connection reset by peer)
20:33 πŸ”— swebb sets mode: +o aaaaaaaa_
20:34 πŸ”— aaaaaaaa_ is now known as aaaaaaaaa
20:34 πŸ”— JW_work has joined #urlteam
20:35 πŸ”— JW_work jornane: http://urlte.am is out of date.
20:36 πŸ”— JW_work Since last November, new data is released directly to internet archive items, usually about once a day. There are currently over 300 such items.
20:36 πŸ”— JW_work They can be downloaded via bittorrent, or directly from IA.
20:37 πŸ”— JW_work Each one contains multiple .zip files (one per url shortener scraped during that day). The zip files contain .xz files, which when decompressed, are plain text in BECON format, i.e. shortcode vertical-bar longURL
20:45 πŸ”— aaaaaaaaa has quit IRC (Read error: Connection reset by peer)
20:46 πŸ”— aaaaaaaaa has joined #urlteam
20:46 πŸ”— swebb sets mode: +o aaaaaaaaa
20:47 πŸ”— aaaaaaaaa has quit IRC (Client Quit)
20:55 πŸ”— Start has joined #urlteam
21:17 πŸ”— aaaaaaaaa has joined #urlteam
21:17 πŸ”— swebb sets mode: +o aaaaaaaaa
21:58 πŸ”— SimpBrain has joined #urlteam
22:09 πŸ”— aaaaaaaa_ has joined #urlteam
22:09 πŸ”— aaaaaaaaa has quit IRC (Read error: Connection reset by peer)
22:09 πŸ”— swebb sets mode: +o aaaaaaaa_
22:09 πŸ”— aaaaaaaa_ is now known as aaaaaaaaa
22:16 πŸ”— Start has quit IRC (Quit: Disconnected.)
22:35 πŸ”— marvinw has joined #urlteam
23:21 πŸ”— Start has joined #urlteam

irclogger-viewer