Commit graph

14 commits

Author SHA1 Message Date
Thibaut Courouble
9e37ff5c8a Add instrumentation for slow response processing 2018-11-25 13:17:32 -05:00
Thibaut Courouble
0095779b3f Instrument FileScraper read errors rather than use puts 2018-03-25 16:05:24 -04:00
Thibaut Courouble
865efea85c Tweak image max size 2017-07-16 17:43:14 -04:00
Thibaut Courouble
62c8cb8be4 Log size difference of index.json and db.json 2017-07-16 17:43:14 -04:00
Thibaut Courouble
a6855329e8 Add image scraping and optimization filter
Rel: #633
2017-07-16 15:41:25 -04:00
Thibaut Courouble
568ecbd533 Log size of index.json and db.json 2017-07-02 19:42:06 -04:00
Thibaut Courouble
ee21c0bf29 Add frozen_string_literal to subscriber files" 2017-07-02 19:26:00 -04:00
Thibaut Courouble
64eb1c86fc Log errors instead of exiting when scraping docs 2016-09-05 15:21:20 -04:00
Thibaut
018628ea7d Add two-pass redirection rewriter
... to avoid having to maintain huge lists of redirects. This works by doing a first pass to detect which internal URL is redirected where, before doing a second (normal) pass that rewrites all these URLs (links) with their final destination. There's a bit of monkey-patching I'm not proud of, but this works(tm).
2015-04-05 17:46:07 -04:00
Thibaut
5c4c1ce2b6 Log entries/types/files diff in docs:generate command 2015-03-22 12:11:40 -04:00
Thibaut
bcd4a5b522 Use String#remove 2014-04-19 12:11:24 -04:00
Thibaut
864188e24c Use String#sub instead of String#gsub when possible 2013-12-13 11:21:38 +00:00
Thibaut
b92db88506 Refactor Docs::Scraper 2013-12-12 11:17:55 +00:00
Thibaut
18986c1814 Going open source 2013-10-24 20:49:33 +02:00