2013-03-17 18:16:25 -04:00
|
|
|
Miniflux - Minimalist News Reader
|
2013-02-17 21:48:21 -05:00
|
|
|
=================================
|
|
|
|
|
|
|
|
Miniflux is a minimalist web-based news reader.
|
|
|
|
|
|
|
|
Features
|
|
|
|
--------
|
|
|
|
|
|
|
|
- Host anywhere (shared hosting, vps or localhost)
|
|
|
|
- Easy setup => copy and paste and you are done!
|
|
|
|
- CSS optimized for readability
|
2013-07-04 13:01:14 +02:00
|
|
|
- Keeps history of read items
|
2013-02-17 21:48:21 -05:00
|
|
|
- Remove Feedburner Ads and analytics trackers
|
2013-07-04 13:01:14 +02:00
|
|
|
- Import/Export of OPML feeds
|
|
|
|
- Feed updates via a cronjob or with the user interface with one click
|
2013-02-17 21:48:21 -05:00
|
|
|
- Protected by a login/password (only one possible user)
|
2013-09-08 18:29:27 -04:00
|
|
|
- Use secure headers (only external images and Youtube/Vimeo/Dailymotion videos are allowed)
|
2013-02-24 14:09:16 -05:00
|
|
|
- Open external links inside a new tab with a `rel="noreferrer"` attribute
|
2013-03-17 18:16:25 -04:00
|
|
|
- Mobile CSS (responsive design)
|
2013-07-18 18:49:40 +01:00
|
|
|
- Keyboard shortcuts (pressing '?' displays a pop-up listing the shortcuts; pressing 'q' closes it)
|
2013-06-14 23:22:22 -04:00
|
|
|
- Basic bookmarks
|
2013-10-01 18:47:33 -04:00
|
|
|
- Translated in English, French, German, Italian, Czech, Spanish, Portuguese and Simplified Chinese
|
2013-09-08 18:29:27 -04:00
|
|
|
- Themes support
|
|
|
|
- Alternative login with a Google Account or Mozilla Persona
|
|
|
|
- **Full article download for feeds that display only a summary** (website scraper based on Xpath rules)
|
2013-02-17 21:48:21 -05:00
|
|
|
|
2013-04-05 21:52:58 -04:00
|
|
|
Todo and known bugs
|
|
|
|
-------------------
|
2013-02-17 21:48:21 -05:00
|
|
|
|
2013-04-05 21:52:58 -04:00
|
|
|
- See Issues: <https://github.com/fguillot/miniflux/issues>
|
2013-02-17 21:48:21 -05:00
|
|
|
|
2013-03-17 18:30:07 -04:00
|
|
|
License
|
|
|
|
-------
|
|
|
|
|
|
|
|
- AGPL: <http://www.gnu.org/licenses/agpl-3.0.txt>
|
|
|
|
|
2013-04-05 21:58:17 -04:00
|
|
|
Authors
|
|
|
|
-------
|
|
|
|
|
2013-07-08 18:31:25 -04:00
|
|
|
Original author: [Frédéric Guillot](http://fredericguillot.com/)
|
|
|
|
|
|
|
|
### Contributors
|
|
|
|
|
2013-09-19 17:54:50 -04:00
|
|
|
People who sent a pull-request, report a bug, make a new theme or share a super cool idea:
|
2013-07-08 18:31:25 -04:00
|
|
|
|
|
|
|
- André Kelpe: https://github.com/fs111
|
|
|
|
- Ayodio: https://github.com/ayodio
|
2013-08-04 14:26:46 -04:00
|
|
|
- Bjauy: https://github.com/bjauy
|
2013-09-18 21:02:46 -04:00
|
|
|
- Bohwaz: https://github.com/bohwaz
|
2013-08-18 12:28:15 -04:00
|
|
|
- Chase Arnold: https://github.com/chase4926
|
2013-07-19 21:01:55 -04:00
|
|
|
- Chris Lemonier: https://github.com/chrislemonier
|
2013-11-22 22:49:52 -05:00
|
|
|
- Delehef: https://github.com/delehef
|
2013-07-08 18:31:25 -04:00
|
|
|
- Derjus: https://github.com/derjus
|
|
|
|
- Eauland: https://github.com/eauland
|
|
|
|
- Félix: https://github.com/dysosmus
|
2013-10-01 18:47:33 -04:00
|
|
|
- Geriel Castro: https://github.com/GerielCastro
|
2013-07-08 18:31:25 -04:00
|
|
|
- Horsely: https://github.com/horsley
|
2013-07-14 12:35:11 -04:00
|
|
|
- Ing. Jan Kaláb: https://github.com/Pitel
|
2013-09-18 21:02:46 -04:00
|
|
|
- Itoine: https://github.com/itoine
|
2013-07-21 11:11:41 -04:00
|
|
|
- James Scott-Brown: https://github.com/jamesscottbrown
|
2013-07-08 18:31:25 -04:00
|
|
|
- Luca Marra: https://github.com/facciocose
|
2013-07-20 09:10:17 -04:00
|
|
|
- Maxime: https://github.com/EpocDotFr
|
2013-07-08 18:31:25 -04:00
|
|
|
- MonsieurPaulLeBoulanger: https://github.com/MonsieurPaulLeBoulanger
|
|
|
|
- Necku: https://github.com/Necku
|
2013-11-11 20:38:54 -05:00
|
|
|
- Nicolas Dewaele: http://adminrezo.fr/
|
2013-09-18 19:55:59 -04:00
|
|
|
- Silvus: https://github.com/Silvus
|
2013-12-17 21:45:03 -05:00
|
|
|
- Skasi7: https://github.com/skasi7
|
2013-07-08 18:31:25 -04:00
|
|
|
- Thiriot Christophe: https://github.com/doubleface
|
2013-09-30 21:30:23 -04:00
|
|
|
- Vincent Ozanam
|
2013-07-08 18:31:25 -04:00
|
|
|
- Ygbillet: https://github.com/ygbillet
|
|
|
|
|
2013-07-08 18:40:19 -04:00
|
|
|
PS: Many people sent a bug report too (see [issues tracker](https://github.com/fguillot/miniflux/issues))
|
2013-07-08 18:39:07 -04:00
|
|
|
|
|
|
|
Roadmap
|
|
|
|
-------
|
|
|
|
|
|
|
|
- http://miniflux.net/roadmap.html
|
|
|
|
|
|
|
|
ChangeLog
|
2013-07-08 18:40:19 -04:00
|
|
|
---------
|
2013-07-08 18:39:07 -04:00
|
|
|
|
|
|
|
- http://miniflux.net/changes.html
|
2013-04-05 21:58:17 -04:00
|
|
|
|
2013-02-17 21:48:21 -05:00
|
|
|
Requirements
|
|
|
|
------------
|
|
|
|
|
2014-01-05 13:28:38 -05:00
|
|
|
- Recent version of libxml2 >= 2.7.x (version 2.6.32 on Debian Lenny is not supported anymore)
|
2013-03-20 20:26:28 -04:00
|
|
|
- PHP >= 5.3.7
|
2013-03-17 18:16:25 -04:00
|
|
|
- PHP XML extensions (SimpleXML, DOM...)
|
2013-04-05 21:52:58 -04:00
|
|
|
- PHP Sqlite extension
|
2014-01-05 13:28:38 -05:00
|
|
|
- cURL extension for PHP or stream context with (`allow_url_fopen=On`)
|
|
|
|
- Short tags enabled for PHP < 5.4
|
2013-02-17 21:48:21 -05:00
|
|
|
|
2013-03-17 18:16:25 -04:00
|
|
|
Libraries used
|
|
|
|
--------------
|
2013-02-17 21:48:21 -05:00
|
|
|
|
|
|
|
- [PicoFeed](https://github.com/fguillot/picoFeed)
|
|
|
|
- [PicoFarad](https://github.com/fguillot/picoFarad)
|
|
|
|
- [PicoTools](https://github.com/fguillot/picoTools)
|
|
|
|
- [PicoDb](https://github.com/fguillot/picoDb)
|
|
|
|
- [SimpleValidator](https://github.com/fguillot/simpleValidator)
|
2013-03-20 20:26:28 -04:00
|
|
|
- [PHP 5.5 password backport](https://github.com/ircmaxell/password_compat)
|
2013-02-17 21:48:21 -05:00
|
|
|
|
|
|
|
Installation
|
|
|
|
------------
|
|
|
|
|
2014-01-05 13:28:38 -05:00
|
|
|
From the archive:
|
|
|
|
|
2013-03-20 20:26:28 -04:00
|
|
|
1. You must have a web server with PHP installed (version 5.3.7 minimum) with the Sqlite and XML extensions
|
2013-07-04 13:01:14 +02:00
|
|
|
2. Download the source code and copy the directory `miniflux` where you want
|
|
|
|
3. Check if the directory `data` is writeable (Miniflux stores everything inside a Sqlite database)
|
2013-03-17 18:16:25 -04:00
|
|
|
4. With your browser go to <http://yourpersonalserver/miniflux>
|
2013-03-21 19:57:47 -04:00
|
|
|
5. The default login and password is **admin/admin**
|
2013-03-17 18:16:25 -04:00
|
|
|
6. Start to use the software
|
2014-01-05 13:28:38 -05:00
|
|
|
7. Don't forget to change your password!
|
|
|
|
|
|
|
|
From the repository:
|
|
|
|
|
|
|
|
1. `git clone https://github.com/fguillot/miniflux.git`
|
2014-01-06 21:58:30 -05:00
|
|
|
2. Go to the third step just above
|
2014-01-05 13:28:38 -05:00
|
|
|
|
|
|
|
Update
|
|
|
|
------
|
|
|
|
|
|
|
|
From the archive:
|
|
|
|
|
|
|
|
1. Close your session (logout)
|
|
|
|
2. Rename your actual miniflux directory (to keep a backup)
|
|
|
|
3. Uncompress the new archive and copy your database file `db.sqlite` in the directory `data`
|
|
|
|
4. Make the directory `data` writeable by the web server user
|
|
|
|
5. Login and check if everything is ok
|
|
|
|
6. Remove the old miniflux directory
|
|
|
|
|
|
|
|
From the repository:
|
|
|
|
|
|
|
|
1. Close your session (logout)
|
|
|
|
2. `git pull`
|
|
|
|
3. Login and check if everything is ok
|
|
|
|
|
|
|
|
Security
|
|
|
|
--------
|
|
|
|
|
|
|
|
- Don't forget to change the default user/password
|
|
|
|
- Don't allow everybody to access to the directory `data` from the URL. There is already a `.htaccess` for Apache but nothing for Nginx.
|
2013-03-17 18:16:25 -04:00
|
|
|
|
|
|
|
FAQ
|
|
|
|
----
|
|
|
|
|
2013-07-04 13:01:14 +02:00
|
|
|
### How do I update my feeds with a cronjob?
|
2013-03-17 18:16:25 -04:00
|
|
|
|
2013-05-21 12:25:13 +02:00
|
|
|
You just need to be inside the directory `miniflux` and run the script `cronjob.php`.
|
|
|
|
|
|
|
|
Parameters | Type | Value
|
|
|
|
--------------------|--------------------------------|-----------------------------
|
2013-05-21 19:18:41 +01:00
|
|
|
--limit | optional | number of feeds
|
2013-05-21 12:25:13 +02:00
|
|
|
--call-interval | optional, excluded by --limit, require --update-interval | time in minutes < update interval time
|
|
|
|
--update-interval | optional, excluded by --limit, require --call-interval | time in minutes >= call interval time
|
2013-03-17 18:16:25 -04:00
|
|
|
|
2013-05-18 20:35:16 +02:00
|
|
|
|
|
|
|
Examples:
|
2013-03-17 18:16:25 -04:00
|
|
|
|
|
|
|
crontab -e
|
|
|
|
|
2013-05-21 12:25:13 +02:00
|
|
|
# Update all feeds
|
2013-03-17 18:16:25 -04:00
|
|
|
0 */4 * * * cd /path/to/miniflux && php cronjob.php >/dev/null 2>&1
|
2013-05-18 20:35:16 +02:00
|
|
|
|
2013-05-21 12:25:13 +02:00
|
|
|
# Update the 10 oldest feeds each time
|
2013-05-18 20:35:16 +02:00
|
|
|
0 */4 * * * cd /path/to/miniflux && php cronjob.php --limit=10 >/dev/null 2>&1
|
|
|
|
|
2013-05-21 12:25:13 +02:00
|
|
|
# Update all feeds in 60 minutes (updates the 8 oldest feeds each time with a total of 120 feeds).
|
|
|
|
* */4 * * * cd /path/to/miniflux && php cronjob.php --call-interval=4 --update-interval=60 >/dev/null 2>&1
|
2013-03-17 18:16:25 -04:00
|
|
|
|
2013-05-22 13:12:58 +02:00
|
|
|
Note: cronjob.php can also be called from the web; in this case specify the options as GET variables.
|
|
|
|
Example: <http://yourpersonalserver/miniflux/cronjob.php?call-interval=4&update-interval=60>
|
2013-05-21 19:38:09 +01:00
|
|
|
|
2013-07-04 13:01:14 +02:00
|
|
|
### How does Miniflux update my feeds from the user interface?
|
2013-03-17 18:16:25 -04:00
|
|
|
|
2013-07-04 13:01:14 +02:00
|
|
|
Miniflux uses an Ajax request to refresh each subscription.
|
2013-03-17 18:16:25 -04:00
|
|
|
By default, there is only 5 feeds updated in parallel.
|
|
|
|
|
2013-07-04 13:01:14 +02:00
|
|
|
### I have 600 subscriptions, can Miniflux handle that?
|
2013-03-17 18:16:25 -04:00
|
|
|
|
|
|
|
Your life is cluttered.
|
2013-03-20 20:26:28 -04:00
|
|
|
|
2013-07-04 13:02:12 +02:00
|
|
|
### Why are there no categories? Why is feature X missing?
|
2013-03-20 20:26:28 -04:00
|
|
|
|
|
|
|
Miniflux is a minimalist software. Less is more.
|
|
|
|
|
|
|
|
### I found a bug, what next?
|
|
|
|
|
2013-03-21 19:57:47 -04:00
|
|
|
Report the bug to the [issues tracker](https://github.com/fguillot/miniflux/issues) and I will fix it.
|
|
|
|
|
|
|
|
You can report feeds that doesn't works properly too.
|
|
|
|
|
|
|
|
### Which browser is compatible with Miniflux?
|
|
|
|
|
2013-07-04 13:01:14 +02:00
|
|
|
Miniflux is tested with the latest versions of Mozilla Firefox, Google Chrome and Safari.
|
2013-03-21 19:57:47 -04:00
|
|
|
|
2013-08-17 17:47:18 -04:00
|
|
|
I don't use Microsoft products, and as such I have no idea if Miniflux works correctly with Internet Explorer.
|
2013-07-06 12:11:29 -04:00
|
|
|
|
2013-08-17 17:47:18 -04:00
|
|
|
### How do I override application variables?
|
2013-07-06 12:11:29 -04:00
|
|
|
|
2013-08-17 17:47:18 -04:00
|
|
|
There are few settings that can't be changed by the user interface.
|
2013-07-06 12:11:29 -04:00
|
|
|
These parameters are defined with PHP constants.
|
|
|
|
|
2013-08-17 17:47:18 -04:00
|
|
|
To override them, create a `config.php` file at the root of the project and change the values.
|
2013-07-06 12:11:29 -04:00
|
|
|
|
|
|
|
By example, to override the default HTTP timeout value:
|
|
|
|
|
|
|
|
<?php
|
|
|
|
|
|
|
|
// My specific HTTP timeout (5 seconds)
|
|
|
|
define('HTTP_TIMEOUT', 5);
|
|
|
|
|
2013-07-28 17:53:17 -04:00
|
|
|
PS: This file must be a PHP file (nothing before the open tag `<?php`).
|
|
|
|
|
2013-07-06 12:11:29 -04:00
|
|
|
Actually, the following constants can be overrided:
|
|
|
|
|
|
|
|
- `HTTP_TIMEOUT` => default value is 10 seconds
|
|
|
|
- `APP_VERSION` => default value is master
|
|
|
|
- `DB_FILENAME` => default value is `data/db.sqlite`
|
2013-08-29 19:34:11 -04:00
|
|
|
- `DEBUG` => default is true (enable logging of PicoFeed)
|
|
|
|
- `DEBUG_FILENAME` => default is `data/debug.log`
|
2013-07-16 21:58:11 -04:00
|
|
|
- `THEME_DIRECTORY` => default is themes
|
2013-07-23 18:26:53 -04:00
|
|
|
- `SESSION_SAVE_PATH` => default is empty (used to store session files in a custom directory)
|
2013-09-23 19:22:13 -04:00
|
|
|
- `PROXY_HOSTNAME` => default is empty (make HTTP requests through a HTTP proxy if set)
|
|
|
|
- `PROXY_PORT` => default is 3128 (default port of Squid)
|
|
|
|
- `PROXY_USERNAME` => default is empty (set the proxy username is needed)
|
|
|
|
- `PROXY_PASSWORD` => default is empty
|
2013-07-23 18:26:53 -04:00
|
|
|
|
2013-07-26 21:00:39 -04:00
|
|
|
### How to change the session save path?
|
2013-07-23 18:26:53 -04:00
|
|
|
|
|
|
|
With several shared hosting providers, sessions are cleaned frequently, to avoid to login too often,
|
|
|
|
you can save sessions in a custom directory.
|
|
|
|
|
|
|
|
- Create a directory, by example `sessions`
|
|
|
|
- This directory must be writeable by the web server user
|
|
|
|
- This directory must NOT be accessible from the outside world (add a `.htaccess` if necessary)
|
|
|
|
- Override the application variable like described above: `define('SESSION_SAVE_PATH', 'sessions');`
|
|
|
|
- Now, your sessions are saved in the directory `sessions`
|
2013-07-16 21:58:11 -04:00
|
|
|
|
2013-07-28 17:53:17 -04:00
|
|
|
### How to override/extends the content filtering blacklist/whitelist?
|
|
|
|
|
|
|
|
Miniflux use [PicoFeed](https://github.com/fguillot/picoFeed) to parse the content of each item.
|
|
|
|
These variables are public static arrays, extends the actual array or replace it.
|
|
|
|
|
|
|
|
**Be careful, you can break everything by doing that!!!**
|
|
|
|
|
|
|
|
Put your modifications in your custom `config.php` like described above.
|
|
|
|
|
|
|
|
By example to add a new iframe whitelist:
|
|
|
|
|
|
|
|
\PicoFeed\Filter::$iframe_whitelist[] = 'http://www.kickstarter.com';
|
|
|
|
|
|
|
|
Or to replace the entire whitelist:
|
|
|
|
|
|
|
|
\PicoFeed\Filter::$iframe_whitelist = array('http://www.kickstarter.com');
|
|
|
|
|
|
|
|
Available variables:
|
|
|
|
|
|
|
|
// Allow only specified tags and attributes
|
|
|
|
\PicoFeed\Filter::$whitelist_tags
|
|
|
|
|
|
|
|
// Strip content of these tags
|
|
|
|
\PicoFeed\Filter::$blacklist_tags
|
|
|
|
|
|
|
|
// Allow only specified URI scheme
|
|
|
|
\PicoFeed\Filter::$whitelist_scheme
|
|
|
|
|
|
|
|
// List of attributes used for external resources: src and href
|
|
|
|
\PicoFeed\Filter::$media_attributes
|
|
|
|
|
|
|
|
// Blacklist of external resources
|
|
|
|
\PicoFeed\Filter::$media_blacklist
|
|
|
|
|
|
|
|
// Required attributes for tags, if the attribute is missing the tag is dropped
|
|
|
|
\PicoFeed\Filter::$required_attributes
|
|
|
|
|
|
|
|
// Add attribute to specified tags
|
|
|
|
\PicoFeed\Filter::$add_attributes
|
|
|
|
|
|
|
|
// Attributes that must be integer
|
|
|
|
\PicoFeed\Filter::$integer_attributes
|
|
|
|
|
|
|
|
// Iframe allowed source
|
|
|
|
\PicoFeed\Filter::$iframe_whitelist
|
|
|
|
|
|
|
|
For more details, have a look to the class `vendor/PicoFeed/Filter.php`.
|
|
|
|
|
2013-07-29 22:33:01 -04:00
|
|
|
### Where is the API documentation?
|
|
|
|
|
|
|
|
<http://miniflux.net/api.html>
|
|
|
|
|
2013-07-16 21:58:11 -04:00
|
|
|
### How to create a theme for Miniflux?
|
|
|
|
|
|
|
|
It's very easy to write a custom theme for Miniflux.
|
|
|
|
|
|
|
|
A theme is just a CSS file, images and fonts.
|
|
|
|
A theme doesn't change the behaviour of the application but only the page design.
|
|
|
|
|
|
|
|
The first step is to create a new directory structure for your theme:
|
|
|
|
|
|
|
|
mkdir -p themes/mysuperskin/{css,img,fonts}
|
|
|
|
|
|
|
|
The name of your theme should be only alphanumeric.
|
|
|
|
There is the following directories inside your theme:
|
|
|
|
|
|
|
|
- `css`: Your stylesheet, the file must be named `app.css` (required)
|
|
|
|
- `img`: Theme images (not required)
|
|
|
|
- `fonts`: Theme fonts (not required)
|
|
|
|
|
|
|
|
For a very basic theme example, have a look to the directory `examples\mytheme`.
|
|
|
|
|
|
|
|
Miniflux use responsive design, so it's better if your theme can handle mobile devices.
|
|
|
|
|
|
|
|
If you write a very cool theme for Miniflux, **send me your code to be available in the default installation!**
|
2013-07-19 21:01:55 -04:00
|
|
|
It would be awesome for everybody :)
|
|
|
|
|
|
|
|
### List of themes:
|
|
|
|
|
2013-07-20 09:10:17 -04:00
|
|
|
- Original theme By Frederic Guillot
|
|
|
|
- Midnight By Luca Marra
|
|
|
|
- Green by Maxime (aka EpocDotFr)
|
2013-09-18 22:48:29 -04:00
|
|
|
- Bootstrap 3 (Light) By Silvus
|
|
|
|
- Bootswatch Cyborg By Silvus
|
2013-07-20 09:10:17 -04:00
|
|
|
|
2013-10-01 18:47:33 -04:00
|
|
|
### How to create or update a translation?
|
|
|
|
|
|
|
|
- Translations are stored inside the directory `locales`
|
|
|
|
- There is sub-directory for each language, by example for french we have `fr_FR`, for italian `it_IT` etc...
|
|
|
|
- A translation is a PHP file that return an Array with a key-value pairs
|
|
|
|
- The key is the original text in english and the value is the translation for the corresponding language
|
|
|
|
|
|
|
|
French translations are always the most recent (because I am french).
|
|
|
|
|
|
|
|
Create a new translation:
|
|
|
|
|
|
|
|
1. Make a new directory: `locales/xx_XX` by example `locales/fr_CA` for French Canadian
|
|
|
|
2. Create a new file for the translation: `locales/xx_XX/translations.php`
|
|
|
|
3. Use the content of the french locales to have the most recent keys and replace the values
|
|
|
|
4. Inside the file `model.php`, add a new entry for your translation in the function `get_languages()`
|
|
|
|
5. Check with your local installation of Miniflux if everything is ok
|
|
|
|
6. Send a pull-request with Github
|
|
|
|
|
2013-07-20 09:10:17 -04:00
|
|
|
### Coding standards for contributors
|
|
|
|
|
|
|
|
- Line indentation: 4 spaces
|
|
|
|
- Line endings: Unix
|
2013-07-21 10:23:05 -04:00
|
|
|
- File encoding: UTF-8
|
2013-08-31 11:27:21 -04:00
|
|
|
|
|
|
|
### How the content grabber works?
|
|
|
|
|
|
|
|
1. Try with rules first (xpath patterns) for the domain name (see `PicoFeed\Rules\`)
|
|
|
|
2. Try to find the text content by using common attributes for class and id
|
|
|
|
3. Fallback to Readability if no content is found
|
|
|
|
4. Finally, if nothing is found, the feed content is displayed
|
|
|
|
|
|
|
|
The content downloader use a fake user agent, actually Google Chrome under Mac Os X.
|
|
|
|
|
|
|
|
However the content grabber doesn't work very well with all websites.
|
|
|
|
**The best results are obtained with Xpath rules file.**
|
|
|
|
|
|
|
|
There is a PHP script inside PicoFeed to import Fivefilters rules, but I dont' use it because almost of these patterns are not up to date.
|
|
|
|
|
|
|
|
### How to write a grabber rules file?
|
|
|
|
|
|
|
|
Add a PHP file to the directory `PicoFeed\Rules`, the filename must be the domain name:
|
|
|
|
|
|
|
|
Example with the BBC website, `www.bbc.co.uk.php`:
|
|
|
|
|
|
|
|
<?php
|
|
|
|
return array(
|
|
|
|
'test_url' => 'http://www.bbc.co.uk/news/world-middle-east-23911833',
|
|
|
|
'body' => array(
|
|
|
|
'//div[@class="story-body"]',
|
|
|
|
),
|
|
|
|
'strip' => array(
|
|
|
|
'//script',
|
|
|
|
'//form',
|
|
|
|
'//style',
|
|
|
|
'//*[@class="story-date"]',
|
|
|
|
'//*[@class="story-header"]',
|
|
|
|
'//*[@class="story-related"]',
|
|
|
|
'//*[contains(@class, "byline")]',
|
|
|
|
'//*[contains(@class, "story-feature")]',
|
|
|
|
'//*[@id="video-carousel-container"]',
|
|
|
|
'//*[@id="also-related-links"]',
|
|
|
|
'//*[contains(@class, "share") or contains(@class, "hidden") or contains(@class, "hyper")]',
|
|
|
|
)
|
|
|
|
);
|
|
|
|
|
|
|
|
Actually, only `body`, `strip` and `test_url` are supported.
|
|
|
|
|
|
|
|
Don't forget to send a pull request or a ticket to share your contribution with everybody,
|
|
|
|
|
|
|
|
### List of content grabber rules
|
|
|
|
|
|
|
|
**If you want to add new rules, just open a ticket and I will do it.**
|
|
|
|
|
|
|
|
- *.blog.lemonde.fr
|
|
|
|
- *.blog.nytimes.com
|
2013-09-18 22:48:29 -04:00
|
|
|
- *.nytimes.com
|
2013-09-30 22:15:18 -04:00
|
|
|
- *.phoronix.com
|
2013-08-31 11:27:21 -04:00
|
|
|
- *.slate.com
|
2013-09-18 22:48:29 -04:00
|
|
|
- *.theguardian.com
|
2013-08-31 19:05:19 -04:00
|
|
|
- *.wikipedia.org
|
2013-09-18 22:48:29 -04:00
|
|
|
- *.wired.com
|
2013-08-31 11:27:21 -04:00
|
|
|
- *.wsj.com
|
2013-09-08 18:29:27 -04:00
|
|
|
- github.com
|
2013-11-26 21:01:13 -05:00
|
|
|
- golem.de
|
2013-11-30 17:05:41 -05:00
|
|
|
- ing.dk
|
|
|
|
- karriere.jobfinder.dk
|
2013-09-08 18:29:27 -04:00
|
|
|
- lifehacker.com
|
2013-09-30 22:15:18 -04:00
|
|
|
- lists.*
|
|
|
|
- medium.com
|
|
|
|
- pastebin.com
|
2013-09-18 22:48:29 -04:00
|
|
|
- plus.google.com
|
2013-08-31 11:27:21 -04:00
|
|
|
- rue89.com
|
2013-08-31 19:05:19 -04:00
|
|
|
- smallhousebliss.com
|
2013-11-26 21:01:13 -05:00
|
|
|
- spiegel.de
|
2013-08-31 19:05:19 -04:00
|
|
|
- techcrunch.com
|
2013-11-30 17:05:41 -05:00
|
|
|
- version2.dk
|
2013-08-31 11:27:21 -04:00
|
|
|
- www.bbc.co.uk
|
2013-09-18 22:48:29 -04:00
|
|
|
- www.businessweek.com
|
2013-08-31 11:27:21 -04:00
|
|
|
- www.cnn.com
|
|
|
|
- www.egscomics.com
|
2013-09-08 18:29:27 -04:00
|
|
|
- www.forbes.com
|
2013-08-31 11:27:21 -04:00
|
|
|
- www.lemonde.fr
|
2013-09-18 22:48:29 -04:00
|
|
|
- www.lepoint.fr
|
|
|
|
- www.npr.org
|
2013-08-31 11:27:21 -04:00
|
|
|
- www.numerama.com
|
|
|
|
- www.slate.fr
|