Project

General

Profile

Actions

Bug #7189

closed

issues with process management on physical hosting LXC containers

Added by LibrA LinuX over 8 years ago. Updated over 8 years ago.

Status:
Released
Priority:
2
Assignee:
Matthieu CERDA
Category:
Packaging
Target version:
Severity:
UX impact:
User visibility:
Effort required:
Priority:
Name check:
Fix check:
Regression:

Description

Seems i have process issue similar to the Bug #4498. Actually i have several physical servers each one have a rudder-agent installed, and hosting several lxc containers, each one having also rudder-agent installed.
Every 5 minutes, on cron job i received the stdout of the rudder cron job :
WARNING: No disable file detected and no CFEngine process neither. Relaunching CFEngine processes... Done

I think the problem is coming from the physical host that see all cf-execd|cf-agent process (its process and the LXC ones)
When launching the cron job on physical host, ther is too many cfengine instance processes running and the job is killing all of them before restarting only its local process.
It then kills all the cfengine process of the LXC containers...

  1. if [ -e /opt/rudder/bin/check-rudder-agent ]; then /opt/rudder/bin/check-rudder-agent; else if [ ! -e /opt/rudder/etc/disable-agent -a `ps -efww | grep -E "(cf-execd|cf-agent)" | grep -E "/var/rudder/cfengine-community/bin/(cf-execd|cf-agent)" | grep -v grep | wc -l` -eq 0 ]; then /var/rudder/cfengine-community/bin/cf-agent -f failsafe.cf >/dev/null 2>\&1 \&\& /var/rudder/cfengine-community/bin/cf-agent >/dev/null 2>\&1; if [ $? != 0 ]; then if [ -f /opt/rudder/etc/rudder-restart-message.txt ]; then cat /opt/rudder/etc/rudder-restart-message.txt; else echo "Rudder agent was unable to restart on $(hostname)."; fi; fi; fi; fi
    WARNING: Too many instance of CFEngine cf-execd processes running. Killing them... Done
    WARNING: No disable file detected and no CFEngine process neither. Relaunching CFEngine processes...
    Done
    root@libra1 ~#

Related issues 4 (0 open4 closed)

Related to Rudder - Bug #4498: Several issues with process management on Proxmox host (and container)RejectedActions
Related to Rudder - Bug #7335: check-rudder-agent silently fails if namespaces are not supportedReleasedVincent MEMBRÉ2015-10-30Actions
Related to Rudder - Bug #7338: All reports are missing (totally orange) for a node due to multiple cf-execd processesReleasedNicolas CHARLES2015-10-30Actions
Related to Rudder - Bug #7381: Process management issues on nodes hosting LXC containersReleasedAlexis MoussetActions
Actions #1

Updated by Nicolas CHARLES over 8 years ago

  • Assignee set to Benoît PECCATTE
  • Priority changed from N/A to 2
  • Target version changed from 3.1.2 to 2.11.14

Thank you for the bug report, this is something Benoit will be able to fix !

Actions #2

Updated by Nicolas CHARLES over 8 years ago

  • Related to Bug #4498: Several issues with process management on Proxmox host (and container) added
Actions #3

Updated by Benoît PECCATTE over 8 years ago

  • Status changed from New to In progress
Actions #4

Updated by Benoît PECCATTE over 8 years ago

  • Status changed from In progress to Pending technical review
  • Assignee changed from Benoît PECCATTE to Matthieu CERDA
  • Pull Request set to https://github.com/Normation/rudder-packages/pull/747
Actions #5

Updated by Benoît PECCATTE over 8 years ago

  • Status changed from Pending technical review to Pending release
  • % Done changed from 0 to 100
Actions #6

Updated by Matthieu CERDA over 8 years ago

Actions #7

Updated by Vincent MEMBRÉ over 8 years ago

  • Category changed from Agent to Packaging
Actions #8

Updated by Vincent MEMBRÉ over 8 years ago

  • Status changed from Pending release to Released

This bug has been fixed in Rudder 2.10.17, 2.11.14, 3.0.9 and 3.1.2 which were released today.

Actions #9

Updated by Jonathan CLARKE over 8 years ago

  • Related to Bug #7335: check-rudder-agent silently fails if namespaces are not supported added
Actions #10

Updated by Nicolas CHARLES over 8 years ago

  • Related to Bug #7338: All reports are missing (totally orange) for a node due to multiple cf-execd processes added
Actions #11

Updated by Alexis Mousset over 8 years ago

  • Related to Bug #7381: Process management issues on nodes hosting LXC containers added
Actions

Also available in: Atom PDF