#archiveteam-bs 2017-07-28,Fri

↑back Search ←Prev date Next date→ Show only urls(Click on time to select a line by its url)

WhoWhatWhen
***w0rp has joined #archiveteam-bs [00:01]
.... (idle for 16mn)
ZexaronS has quit IRC (Quit: Leaving) [00:17]
...... (idle for 28mn)
Ravenloft has quit IRC (Ping timeout: 260 seconds) [00:45]
dashcloud has quit IRC (Read error: Operation timed out)
BlueMaxim has joined #archiveteam-bs
dashcloud has joined #archiveteam-bs
zyphlar has joined #archiveteam-bs
[00:55]
ZexaronS has joined #archiveteam-bs [01:05]
............. (idle for 1h2mn)
j08nY has quit IRC (Quit: Leaving) [02:07]
........ (idle for 35mn)
pizzaiolo has quit IRC (pizzaiolo) [02:42]
............. (idle for 1h4mn)
ZexaronS has quit IRC (Leaving) [03:46]
godanelooks like my script along time ago didn't upload alot of reuters.com videos [03:58]
***qw3rty6 has joined #archiveteam-bs [03:58]
godanei downloaded the download pages to grab a list of items to check if they were all upload and turned up there not
there is about 5gb of video not uploaded for the 2008 alone
[04:00]
***qw3rty5 has quit IRC (Read error: Operation timed out) [04:03]
..... (idle for 21mn)
dashcloud has quit IRC (Read error: Connection reset by peer) [04:24]
dashcloud has joined #archiveteam-bs [04:32]
Sk1d has quit IRC (Ping timeout: 250 seconds) [04:45]
Sk1d has joined #archiveteam-bs [04:52]
Meroje has quit IRC (Ping timeout: 260 seconds)
BnAboyZ66 has quit IRC (Ping timeout: 260 seconds)
Meroje has joined #archiveteam-bs
mundus201 is now known as mundus
[05:01]
.... (idle for 19mn)
ld1 has quit IRC (Ping timeout: 260 seconds)
ld1 has joined #archiveteam-bs
svchfoo1 has quit IRC (Quit: Closing)
[05:24]
.... (idle for 15mn)
godane has left
godane has joined #archiveteam-bs
Stiletti is now known as Stiletto
[05:40]
....... (idle for 32mn)
ld1 has quit IRC (Ping timeout: 260 seconds)
ld1 has joined #archiveteam-bs
[06:13]
......... (idle for 42mn)
kristian_ has joined #archiveteam-bs [06:56]
.... (idle for 19mn)
pikhq has quit IRC (Ping timeout: 268 seconds) [07:15]
................. (idle for 1h24mn)
chazchaz_ has quit IRC (Read error: Operation timed out)
dxrt- has quit IRC (Read error: Operation timed out)
espes__ has quit IRC (Ping timeout: 268 seconds)
chazchaz has joined #archiveteam-bs
dxrt- has joined #archiveteam-bs
espes__ has joined #archiveteam-bs
kristian_ has quit IRC (Quit: Leaving)
Honno has quit IRC (Read error: Operation timed out)
[08:39]
................. (idle for 1h24mn)
j08nY has joined #archiveteam-bs [10:19]
t2t2yuku returning 403 for warrior user-agent, firefox ok. [10:28]
GLaDOSthat's rude [10:29]
***BlueMaxim has quit IRC (Quit: Leaving) [10:32]
joepie91I guess we're Firefox now? :p [10:33]
GLaDOSshould we make some random useragent generator?
as in, random per-client, perhaps based on reported nickname
[10:42]
***t2t2 has quit IRC (Quit: "goodbye uptime") [10:53]
wp494just hash it [10:55]
***tuluu has joined #archiveteam-bs
pizzaiolo has joined #archiveteam-bs
pikhq has joined #archiveteam-bs
[10:55]
BartoCH has joined #archiveteam-bs
jspiros has quit IRC (leaving)
RichardG has quit IRC (Ping timeout: 260 seconds)
jspiros has joined #archiveteam-bs
[11:06]
........ (idle for 39mn)
SketchCowGLaDOS: My opinion is an optional random user-agent generator is not a bad feature to have that can be implemented if needed.
But not as default behavior
[11:49]
GLaDOSyeah, definitely optional [11:58]
midaswe had the same issue with soundcloud [11:58]
GLaDOSfrom a site perspective, it might be best to base it on the public IP
maybe that hashed together with the username
although the best way for the pipeline to get its public IP would have to be figured out
perhaps the tracker reports it back when you retreive a job?
[12:01]
...... (idle for 28mn)
***quantum has joined #archiveteam-bs
godane has quit IRC (Read error: Operation timed out)
[12:31]
........ (idle for 36mn)
t2t2 has joined #archiveteam-bs [13:10]
....... (idle for 30mn)
sep332 has quit IRC (Quit: konversation out)
sep332 has joined #archiveteam-bs
[13:40]
quantum has quit IRC (Ping timeout: 268 seconds) [13:55]
godane has joined #archiveteam-bs [14:05]
............ (idle for 57mn)
RichardG has joined #archiveteam-bs [15:02]
.... (idle for 16mn)
pikhq has quit IRC (Read error: Operation timed out)
schbirid has joined #archiveteam-bs
pikhq has joined #archiveteam-bs
[15:18]
.... (idle for 17mn)
pikhq has quit IRC (Ping timeout: 268 seconds) [15:42]
pikhq has joined #archiveteam-bs [15:48]
...... (idle for 27mn)
godaneSketchCow: i'm uploading HeroesRebornNBC youtube channel on to FOS
i will be in Dead-Youtube-Channels
there are only 2 videos on the channel now
but i got 72 videos from it in the past
[16:15]
...................... (idle for 1h49mn)
***dashcloud has quit IRC (Read error: Operation timed out) [18:06]
.... (idle for 19mn)
j08nY has quit IRC (Quit: Leaving)
dashcloud has joined #archiveteam-bs
Soni has quit IRC (Ping timeout: 272 seconds)
[18:25]
schbiridhttp://libgen.io/robots.txt [18:32]
xmchuh [18:33]
***Soni has joined #archiveteam-bs [18:34]
schbiridhttp://gen.lib.rus.ec/robots.txt too
no idea if new
https://www.reddit.com/r/Scholar/comments/6puywe/meta_libgen_article_repository_is_down/
[18:34]
noone seeding https://thepiratebay.org/torrent/11674459/The+Library+Genesis+SciMag+Repository+2015-01-31+%28torrents+only%29 :(
some http://torrentproject.se/?t=scimag
[18:42]
***fie has quit IRC (Ping timeout: 268 seconds) [18:51]
.... (idle for 19mn)
mundusSomeone wanna update the current running warrior project to yuku? [19:10]
t2t2mundus: it's not returning 403 for every request anymore? [19:24]
munduswhat?
It's just the active project
[19:25]
***ItsYoda has quit IRC (Quit: rippppp to the yoda you used to know!)
ItsYoda has joined #archiveteam-bs
zino has quit IRC (Quit: Leaving)
Whopper has quit IRC (Read error: Operation timed out)
[19:25]
tobbezJust tried starting it, got one item. All the fetches for it returned 403. Opening one of those urls manually gives a phpbb sql error: "You have an error in your SQL syntax; check the manual that corresponds to your MariaDB server version for the right syntax to use near 'RELECT * FROM conservativeallies_users WHERE user_id = 0' at line 1 [1064]" [19:32]
hook54321Fyi, https://www.reddit.com/r/opiaterollcall/ was recently banned, but Google still has some cached pages. [19:34]
***Whopper has joined #archiveteam-bs [19:38]
Honno has joined #archiveteam-bs [19:49]
tobbezRe: yuku, I obviously can't be certain (since I don't know what the topic of a given thread is supposed to be and so can't verify it's still the same content), but it seems the urls to access threads was changed from e.g. http://conservativeallies.yuku.com/topic/9280/ to http://conservativeallies.yuku.com/slug-usually-goes-here-t9280.html (the minimal fake slug you can get away with would be
.../-t9280.html) (if you want the canonical url you'll have to extract it from the fetched page)
[19:49]
godaneso 1154 flv files was missing in reuters.com video 2008 uploads
those are now uploaded
[19:49]
tobbezAlso, if the sample I got was representative (i.e. all 403s), the project should be put on hold again (it's unclear why it's running again)
huh, that url format change doesn't seem to be global... e.g. http://monsterkidclassichorrorforum.yuku.com/ still uses the old style
[19:50]
godanei'm uploading 3089 videos missing from reuters.com video 2009 uploads [19:53]
tobbez¯\_(ツ)_/¯ [19:53]
godanealot are is missing from 2009-09 to 2009-12
in a bit of weirdness 2009-02 items are all fine
no missing files there
[19:53]
........ (idle for 38mn)
***TheLovina has joined #archiveteam-bs
Whopper has quit IRC (Read error: Connection reset by peer)
[20:32]
godane has quit IRC (Read error: Operation timed out) [20:38]
godane has joined #archiveteam-bs [20:45]
......... (idle for 43mn)
arkivertobbez: we have a project for yuku
we can just load more items
huh
I see many projects have been removed from the tracker
who started yuku? I'm not sure if it was ready to be restarted
what projects have been removed from the tracker now??
GLaDOS: see above ^
was the yuku project tested properly before being restarted?
it was not run for quite some time, the website might have undergone some changes
yuku banned our useragent
the project is paused again
[21:28]
I'll check yuku and see if other stuff changed that needs editing of the project
also working on dayviews project, will be here https://github.com/ArchiveTeam/dayviews-grab
[21:39]
***schbirid2 has joined #archiveteam-bs
sep332 is now known as sep332_
schbirid has quit IRC (Read error: Operation timed out)
[21:52]
......... (idle for 44mn)
username1 has joined #archiveteam-bs
schbirid2 has quit IRC (Read error: Operation timed out)
username1 has quit IRC (Read error: Operation timed out)
schbirid has joined #archiveteam-bs
[22:40]
schbirid2 has joined #archiveteam-bs
schbirid has quit IRC (Read error: Operation timed out)
[22:57]
Odd0002 has joined #archiveteam-bs [23:07]
.... (idle for 15mn)
username1 has joined #archiveteam-bs
schbirid2 has quit IRC (Read error: Operation timed out)
[23:22]
kristian_ has joined #archiveteam-bs [23:32]

↑back Search ←Prev date Next date→ Show only urls(Click on time to select a line by its url)