#archiveteam-bs 2017-07-28,Fri

↑back Search

Time Nickname Message
00:01 🔗 w0rp has joined #archiveteam-bs
00:17 🔗 ZexaronS has quit IRC (Quit: Leaving)
00:45 🔗 Ravenloft has quit IRC (Ping timeout: 260 seconds)
00:55 🔗 dashcloud has quit IRC (Read error: Operation timed out)
00:56 🔗 BlueMaxim has joined #archiveteam-bs
00:59 🔗 dashcloud has joined #archiveteam-bs
01:00 🔗 zyphlar has joined #archiveteam-bs
01:05 🔗 ZexaronS has joined #archiveteam-bs
02:07 🔗 j08nY has quit IRC (Quit: Leaving)
02:42 🔗 pizzaiolo has quit IRC (pizzaiolo)
03:46 🔗 ZexaronS has quit IRC (Leaving)
03:58 🔗 godane looks like my script along time ago didn't upload alot of reuters.com videos
03:58 🔗 qw3rty6 has joined #archiveteam-bs
04:00 🔗 godane i downloaded the download pages to grab a list of items to check if they were all upload and turned up there not
04:01 🔗 godane there is about 5gb of video not uploaded for the 2008 alone
04:03 🔗 qw3rty5 has quit IRC (Read error: Operation timed out)
04:24 🔗 dashcloud has quit IRC (Read error: Connection reset by peer)
04:32 🔗 dashcloud has joined #archiveteam-bs
04:45 🔗 Sk1d has quit IRC (Ping timeout: 250 seconds)
04:52 🔗 Sk1d has joined #archiveteam-bs
05:01 🔗 Meroje has quit IRC (Ping timeout: 260 seconds)
05:02 🔗 BnAboyZ66 has quit IRC (Ping timeout: 260 seconds)
05:04 🔗 Meroje has joined #archiveteam-bs
05:05 🔗 mundus201 is now known as mundus
05:24 🔗 ld1 has quit IRC (Ping timeout: 260 seconds)
05:24 🔗 ld1 has joined #archiveteam-bs
05:25 🔗 svchfoo1 has quit IRC (Quit: Closing)
05:40 🔗 godane has left
05:40 🔗 godane has joined #archiveteam-bs
05:41 🔗 Stiletti is now known as Stiletto
06:13 🔗 ld1 has quit IRC (Ping timeout: 260 seconds)
06:14 🔗 ld1 has joined #archiveteam-bs
06:56 🔗 kristian_ has joined #archiveteam-bs
07:15 🔗 pikhq has quit IRC (Ping timeout: 268 seconds)
08:39 🔗 chazchaz_ has quit IRC (Read error: Operation timed out)
08:39 🔗 dxrt- has quit IRC (Read error: Operation timed out)
08:41 🔗 espes__ has quit IRC (Ping timeout: 268 seconds)
08:42 🔗 chazchaz has joined #archiveteam-bs
08:44 🔗 dxrt- has joined #archiveteam-bs
08:47 🔗 espes__ has joined #archiveteam-bs
08:51 🔗 kristian_ has quit IRC (Quit: Leaving)
08:55 🔗 Honno has quit IRC (Read error: Operation timed out)
10:19 🔗 j08nY has joined #archiveteam-bs
10:28 🔗 t2t2 yuku returning 403 for warrior user-agent, firefox ok.
10:29 🔗 GLaDOS that's rude
10:32 🔗 BlueMaxim has quit IRC (Quit: Leaving)
10:33 🔗 joepie91 I guess we're Firefox now? :p
10:42 🔗 GLaDOS should we make some random useragent generator?
10:43 🔗 GLaDOS as in, random per-client, perhaps based on reported nickname
10:53 🔗 t2t2 has quit IRC (Quit: "goodbye uptime")
10:55 🔗 wp494 just hash it
10:55 🔗 tuluu has joined #archiveteam-bs
10:58 🔗 pizzaiolo has joined #archiveteam-bs
11:01 🔗 pikhq has joined #archiveteam-bs
11:06 🔗 BartoCH has joined #archiveteam-bs
11:07 🔗 jspiros has quit IRC (leaving)
11:07 🔗 RichardG has quit IRC (Ping timeout: 260 seconds)
11:10 🔗 jspiros has joined #archiveteam-bs
11:49 🔗 SketchCow GLaDOS: My opinion is an optional random user-agent generator is not a bad feature to have that can be implemented if needed.
11:49 🔗 SketchCow But not as default behavior
11:58 🔗 GLaDOS yeah, definitely optional
11:58 🔗 midas we had the same issue with soundcloud
12:01 🔗 GLaDOS from a site perspective, it might be best to base it on the public IP
12:01 🔗 GLaDOS maybe that hashed together with the username
12:02 🔗 GLaDOS although the best way for the pipeline to get its public IP would have to be figured out
12:03 🔗 GLaDOS perhaps the tracker reports it back when you retreive a job?
12:31 🔗 quantum has joined #archiveteam-bs
12:34 🔗 godane has quit IRC (Read error: Operation timed out)
13:10 🔗 t2t2 has joined #archiveteam-bs
13:40 🔗 sep332 has quit IRC (Quit: konversation out)
13:41 🔗 sep332 has joined #archiveteam-bs
13:55 🔗 quantum has quit IRC (Ping timeout: 268 seconds)
14:05 🔗 godane has joined #archiveteam-bs
15:02 🔗 RichardG has joined #archiveteam-bs
15:18 🔗 pikhq has quit IRC (Read error: Operation timed out)
15:21 🔗 schbirid has joined #archiveteam-bs
15:25 🔗 pikhq has joined #archiveteam-bs
15:42 🔗 pikhq has quit IRC (Ping timeout: 268 seconds)
15:48 🔗 pikhq has joined #archiveteam-bs
16:15 🔗 godane SketchCow: i'm uploading HeroesRebornNBC youtube channel on to FOS
16:15 🔗 godane i will be in Dead-Youtube-Channels
16:17 🔗 godane there are only 2 videos on the channel now
16:17 🔗 godane but i got 72 videos from it in the past
18:06 🔗 dashcloud has quit IRC (Read error: Operation timed out)
18:25 🔗 j08nY has quit IRC (Quit: Leaving)
18:29 🔗 dashcloud has joined #archiveteam-bs
18:30 🔗 Soni has quit IRC (Ping timeout: 272 seconds)
18:32 🔗 schbirid http://libgen.io/robots.txt
18:33 🔗 xmc huh
18:34 🔗 Soni has joined #archiveteam-bs
18:34 🔗 schbirid http://gen.lib.rus.ec/robots.txt too
18:34 🔗 schbirid no idea if new
18:36 🔗 schbirid https://www.reddit.com/r/Scholar/comments/6puywe/meta_libgen_article_repository_is_down/
18:42 🔗 schbirid noone seeding https://thepiratebay.org/torrent/11674459/The+Library+Genesis+SciMag+Repository+2015-01-31+%28torrents+only%29 :(
18:46 🔗 schbirid some http://torrentproject.se/?t=scimag
18:51 🔗 fie has quit IRC (Ping timeout: 268 seconds)
19:10 🔗 mundus Someone wanna update the current running warrior project to yuku?
19:24 🔗 t2t2 mundus: it's not returning 403 for every request anymore?
19:25 🔗 mundus what?
19:25 🔗 mundus It's just the active project
19:25 🔗 ItsYoda has quit IRC (Quit: rippppp to the yoda you used to know!)
19:28 🔗 ItsYoda has joined #archiveteam-bs
19:29 🔗 zino has quit IRC (Quit: Leaving)
19:32 🔗 Whopper has quit IRC (Read error: Operation timed out)
19:32 🔗 tobbez Just tried starting it, got one item. All the fetches for it returned 403. Opening one of those urls manually gives a phpbb sql error: "You have an error in your SQL syntax; check the manual that corresponds to your MariaDB server version for the right syntax to use near 'RELECT * FROM conservativeallies_users WHERE user_id = 0' at line 1 [1064]"
19:34 🔗 hook54321 Fyi, https://www.reddit.com/r/opiaterollcall/ was recently banned, but Google still has some cached pages.
19:38 🔗 Whopper has joined #archiveteam-bs
19:49 🔗 Honno has joined #archiveteam-bs
19:49 🔗 tobbez Re: yuku, I obviously can't be certain (since I don't know what the topic of a given thread is supposed to be and so can't verify it's still the same content), but it seems the urls to access threads was changed from e.g. http://conservativeallies.yuku.com/topic/9280/ to http://conservativeallies.yuku.com/slug-usually-goes-here-t9280.html (the minimal fake slug you can get away with would be
19:49 🔗 tobbez .../-t9280.html) (if you want the canonical url you'll have to extract it from the fetched page)
19:49 🔗 godane so 1154 flv files was missing in reuters.com video 2008 uploads
19:50 🔗 godane those are now uploaded
19:50 🔗 tobbez Also, if the sample I got was representative (i.e. all 403s), the project should be put on hold again (it's unclear why it's running again)
19:53 🔗 tobbez huh, that url format change doesn't seem to be global... e.g. http://monsterkidclassichorrorforum.yuku.com/ still uses the old style
19:53 🔗 godane i'm uploading 3089 videos missing from reuters.com video 2009 uploads
19:53 🔗 tobbez ¯\_(ツ)_/¯
19:53 🔗 godane alot are is missing from 2009-09 to 2009-12
19:54 🔗 godane in a bit of weirdness 2009-02 items are all fine
19:54 🔗 godane no missing files there
20:32 🔗 TheLovina has joined #archiveteam-bs
20:32 🔗 Whopper has quit IRC (Read error: Connection reset by peer)
20:38 🔗 godane has quit IRC (Read error: Operation timed out)
20:45 🔗 godane has joined #archiveteam-bs
21:28 🔗 arkiver tobbez: we have a project for yuku
21:28 🔗 arkiver we can just load more items
21:29 🔗 arkiver huh
21:29 🔗 arkiver I see many projects have been removed from the tracker
21:30 🔗 arkiver who started yuku? I'm not sure if it was ready to be restarted
21:31 🔗 arkiver what projects have been removed from the tracker now??
21:33 🔗 arkiver GLaDOS: see above ^
21:33 🔗 arkiver was the yuku project tested properly before being restarted?
21:33 🔗 arkiver it was not run for quite some time, the website might have undergone some changes
21:34 🔗 arkiver yuku banned our useragent
21:34 🔗 arkiver the project is paused again
21:39 🔗 arkiver I'll check yuku and see if other stuff changed that needs editing of the project
21:40 🔗 arkiver also working on dayviews project, will be here https://github.com/ArchiveTeam/dayviews-grab
21:52 🔗 schbirid2 has joined #archiveteam-bs
21:55 🔗 sep332 is now known as sep332_
21:56 🔗 schbirid has quit IRC (Read error: Operation timed out)
22:40 🔗 username1 has joined #archiveteam-bs
22:43 🔗 schbirid2 has quit IRC (Read error: Operation timed out)
22:45 🔗 username1 has quit IRC (Read error: Operation timed out)
22:49 🔗 schbirid has joined #archiveteam-bs
22:57 🔗 schbirid2 has joined #archiveteam-bs
22:59 🔗 schbirid has quit IRC (Read error: Operation timed out)
23:07 🔗 Odd0002 has joined #archiveteam-bs
23:22 🔗 username1 has joined #archiveteam-bs
23:24 🔗 schbirid2 has quit IRC (Read error: Operation timed out)
23:32 🔗 kristian_ has joined #archiveteam-bs

irclogger-viewer