Enhancement #3423
Collectd 5: avoid rrd write problems on overloaded machines
Status: | CLOSED | Start date: | ||
---|---|---|---|---|
Priority: | Normal | Due date: | ||
Assignee: | - | % Done: | 100% | |
Category: | nethserver-collectd | |||
Target version: | v6.8 | |||
Resolution: | NEEDINFO: | No |
Description
When a machine become overloaded, collectd can't correctly write data to rrd files.
See https://collectd.org/wiki/index.php/Inside_the_RRDtool_plugin
The suggested configuration (from EPEL 7) is /etc/collectd.d/rrdtoolconf
:
LoadPlugin rrdtool <Plugin rrdtool> DataDir "/var/lib/collectd/rrd" CacheTimeout 120 CacheFlush 900 </Plugin>
After the modification, please also backport this related fix: https://github.com/NethServer/dev/issues/5098
Associated revisions
conf: enable cache. Refs #3423
History
#1 Updated by Giacomo Sanchietti almost 5 years ago
- Category set to nethserver-collectd
- Status changed from NEW to TRIAGED
- Target version set to v6.8
- % Done changed from 0 to 20
#2 Updated by Giacomo Sanchietti almost 5 years ago
- Status changed from TRIAGED to ON_DEV
- Assignee set to Giacomo Sanchietti
- % Done changed from 20 to 30
#3 Updated by Giacomo Sanchietti almost 5 years ago
- Status changed from ON_DEV to MODIFIED
- % Done changed from 30 to 60
#4 Updated by Giacomo Sanchietti almost 5 years ago
- Added cache setup in
/etc/collectd.d/00plugins.conf
- Cherry-picked https://github.com/NethServer/nethserver-collectd/commit/b441eda1b4783035b80df46769ddbe1bdabefa8c
#5 Updated by Giacomo Sanchietti almost 5 years ago
- Status changed from MODIFIED to ON_QA
- Assignee deleted (
Giacomo Sanchietti) - % Done changed from 60 to 70
In nethserver-testing:
- nethserver-collectd-2.0.2-1.2.gaea7f3d.ns6.noarch.rpm
- Update an existing installation
- Check collectd is restarted and rrd files are written
- Execute
/etc/cron.daily/collectd_cleanup
- Check the script delete only files not modified in last 24 hours
#6 Updated by Davide Principi almost 5 years ago
- Assignee set to Davide Principi
#7 Updated by Davide Principi almost 5 years ago
- Status changed from ON_QA to VERIFIED
- Assignee deleted (
Davide Principi) - % Done changed from 70 to 90
#8 Updated by Giacomo Sanchietti almost 5 years ago
- Status changed from VERIFIED to CLOSED
- % Done changed from 90 to 100
Released:
- nethserver-collectd-2.0.3-1.ns6.noarch.rpm