Enhancement #3423

Collectd 5: avoid rrd write problems on overloaded machines

Added by Giacomo Sanchietti about 3 years ago. Updated about 3 years ago.

Status:CLOSEDStart date:
Priority:NormalDue date:
Assignee:-% Done:

100%

Category:nethserver-collectd
Target version:v6.8
Resolution: NEEDINFO:No

Description

When a machine become overloaded, collectd can't correctly write data to rrd files.
See https://collectd.org/wiki/index.php/Inside_the_RRDtool_plugin

The suggested configuration (from EPEL 7) is /etc/collectd.d/rrdtoolconf :

LoadPlugin rrdtool
<Plugin rrdtool>
       DataDir "/var/lib/collectd/rrd" 
       CacheTimeout 120
       CacheFlush   900
</Plugin>

After the modification, please also backport this related fix: https://github.com/NethServer/dev/issues/5098

Associated revisions

Revision 5df4dbb3
Added by Giacomo Sanchietti about 3 years ago

conf: enable cache. Refs #3423

History

#1 Updated by Giacomo Sanchietti about 3 years ago

  • Category set to nethserver-collectd
  • Status changed from NEW to TRIAGED
  • Target version set to v6.8
  • % Done changed from 0 to 20

#2 Updated by Giacomo Sanchietti about 3 years ago

  • Status changed from TRIAGED to ON_DEV
  • Assignee set to Giacomo Sanchietti
  • % Done changed from 20 to 30

#3 Updated by Giacomo Sanchietti about 3 years ago

  • Status changed from ON_DEV to MODIFIED
  • % Done changed from 30 to 60

#5 Updated by Giacomo Sanchietti about 3 years ago

  • Status changed from MODIFIED to ON_QA
  • Assignee deleted (Giacomo Sanchietti)
  • % Done changed from 60 to 70
In nethserver-testing:
  • nethserver-collectd-2.0.2-1.2.gaea7f3d.ns6.noarch.rpm
Test case 1
  • Update an existing installation
  • Check collectd is restarted and rrd files are written
Test case 2
  • Execute /etc/cron.daily/collectd_cleanup
  • Check the script delete only files not modified in last 24 hours

#6 Updated by Davide Principi about 3 years ago

  • Assignee set to Davide Principi

#7 Updated by Davide Principi about 3 years ago

  • Status changed from ON_QA to VERIFIED
  • Assignee deleted (Davide Principi)
  • % Done changed from 70 to 90

#8 Updated by Giacomo Sanchietti about 3 years ago

  • Status changed from VERIFIED to CLOSED
  • % Done changed from 90 to 100
Released:
  • nethserver-collectd-2.0.3-1.ns6.noarch.rpm

Also available in: Atom PDF