csf notification: exe:/opt/suphp/sbin/suphp cmd:/opt/suphp/sbin/suphp
Posted: 20 Sep 2019, 13:04
Apologies if this is not really a csf related issue, but notifications have been sent via csf with an issue that my webhost hasn't identified.
I believe from the below that I am experiencing the symptoms of an issue - that my site is not the cause of the issue.
Normal occurrence daily at around 8:34pm
server load .27
memory used 39.36
During the day/night, don't see load above .8
At approx. 8:35pm each night, server unreachable for approx. 2 minutes, then get email notifications from csf including High 5 minute load average alert [1 Min Load Avg > 20, as high as > 100] and another for excessive processes, which lists:
User:customer PID:25066 PPID:24914 Run Time:27(secs) Memory:13236(kb) RSS:1572(kb) exe:/opt/suphp/sbin/suphp cmd:/opt/suphp/sbin/suphp
User:customer PID:25067 PPID:24955 Run Time:27(secs) Memory:13236(kb) RSS:1572(kb) exe:/opt/suphp/sbin/suphp cmd:/opt/suphp/sbin/suphp
User:customer PID:25068 PPID:25065 Run Time:25(secs) Memory:13236(kb) RSS:1576(kb) exe:/opt/suphp/sbin/suphp cmd:/opt/suphp/sbin/suphp
User:customer PID:25069 PPID:25046 Run Time:24(secs) Memory:13236(kb) RSS:1576(kb) exe:/opt/suphp/sbin/suphp cmd:/opt/suphp/sbin/suphp
User:customer PID:25070 PPID:23350 Run Time:24(secs) Memory:13236(kb) RSS:1576(kb) exe:/opt/suphp/sbin/suphp cmd:/opt/suphp/sbin/suphp
User:customer PID:25071 PPID:24942 Run Time:23(secs) Memory:13236(kb) RSS:1576(kb) exe:/opt/suphp/sbin/suphp cmd:/opt/suphp/sbin/suphp
User:customer PID:25072 PPID:25039 Run Time:21(secs) Memory:13236(kb) RSS:1572(kb) exe:/opt/suphp/sbin/suphp cmd:/opt/suphp/sbin/suphp
User:customer PID:25073 PPID:24793 Run Time:20(secs) Memory:13236(kb) RSS:1576(kb) exe:/opt/suphp/sbin/suphp cmd:/opt/suphp/sbin/suphp
User:customer PID:25074 PPID:25001 Run Time:19(secs) Memory:13236(kb) RSS:1572(kb) exe:/opt/suphp/sbin/suphp cmd:/opt/suphp/sbin/suphp
User:customer PID:25076 PPID:25033 Run Time:18(secs) Memory:13236(kb) RSS:1576(kb) exe:/opt/suphp/sbin/suphp cmd:/opt/suphp/sbin/suphp
User:customer PID:25079 PPID:24797 Run Time:17(secs) Memory:13236(kb) RSS:1572(kb) exe:/opt/suphp/sbin/suphp cmd:/opt/suphp/sbin/suphp
User:customer PID:25084 PPID:25078 Run Time:16(secs) Memory:13236(kb) RSS:1576(kb) exe:/opt/suphp/sbin/suphp cmd:/opt/suphp/sbin/suphp
User:customer PID:25085 PPID:24924 Run Time:11(secs) Memory:13236(kb) RSS:1576(kb) exe:/opt/suphp/sbin/suphp cmd:/opt/suphp/sbin/suphp
User:customer PID:25086 PPID:25082 Run Time:10(secs) Memory:13236(kb) RSS:1572(kb) exe:/opt/suphp/sbin/suphp cmd:/opt/suphp/sbin/suphp
User:customer PID:25087 PPID:24954 Run Time:9(secs) Memory:13236(kb) RSS:1576(kb) exe:/opt/suphp/sbin/suphp cmd:/opt/suphp/sbin/suphp
User:customer PID:25088 PPID:25004 Run Time:7(secs) Memory:13236(kb) RSS:1576(kb) exe:/opt/suphp/sbin/suphp cmd:/opt/suphp/sbin/suphp
User:customer PID:25089 PPID:24795 Run Time:5(secs) Memory:13236(kb) RSS:1572(kb) exe:/opt/suphp/sbin/suphp cmd:/opt/suphp/sbin/suphp
User:customer PID:25090 PPID:24923 Run Time:0(secs) Memory:13236(kb) RSS:1576(kb) exe:/opt/suphp/sbin/suphp cmd:/opt/suphp/sbin/suphp
I say again: At approx. 8:35pm each night, server unreachable for approx. 2 minutes. Server load any other time < .8
Webhost support claims the following, in descending order, could be the cause, even though the issue occurs at a set time and not at a peak useage time for visitors:
- support claim an issue with two databases. "...due to the databases "xxxyyy" and "yyyxxx". Now I have repaired the database from my end."
There were no issues with these databases and didn't require 'fixing'. Issue still exists.
- support now inform me "I have now added the executables in csf pignore list".
Not sure how not being notified of an issue helps. Issue still exists.
- support now state they will monitor the server, findings are "We have experienced the load spike now. We found that the mysql process is consuming more resources on that time. "
- support then say "I have checked the cron log for the mentioned time and found the user "customer" was run the following cron on that time."
I have a cron set up to run every 15 minutes. I reschedule cron so would not run during 8-9pm. Issue still exists.
- support now say "The server load appears to be normal now. There were lots of Apache connections from the IP "185.25.35.x"."
- support also say "You may need to optimize the scripts with the help of your developer. Else, you may need to change the Cron intervals."
Already did the cron change, no difference. Added 185.25.35.x to deny file in csf. Issue still exists.
- support then say "The website script "xxxyyy.php" was using high process usage. "
Top Process | 17.0 - this is the most used script so logically it will show up as the top process.
- support "I have shared the slow query log's of 24 hours below. Please discuss with MYSQL developer and see if the database can be optimized."
Have made many optimizations, Issue still exists.
Given the issue only occurs at one specific time, is it possible the webhost is doing a server backup or using the server as a backup that could cause this issue, or could another customer on the 'node' or physical server be causing the issue?
Any other possible explanations?
I've looked at all crons with
cat /var/spool/cron/*
crons around the same time:
this one likely set up by webhost to look at logs
35 * * * * /usr/bin/test -x /usr/local/cpanel/bin/tail-check && /usr/local/cpanel/bin/tail-check
this one unlikely to be the cause as no load issue at any other time
5,20,35,50 * * * * /usr/local/cpanel/scripts/eximstats_spam_check 2>&1
I believe from the below that I am experiencing the symptoms of an issue - that my site is not the cause of the issue.
Normal occurrence daily at around 8:34pm
server load .27
memory used 39.36
During the day/night, don't see load above .8
At approx. 8:35pm each night, server unreachable for approx. 2 minutes, then get email notifications from csf including High 5 minute load average alert [1 Min Load Avg > 20, as high as > 100] and another for excessive processes, which lists:
User:customer PID:25066 PPID:24914 Run Time:27(secs) Memory:13236(kb) RSS:1572(kb) exe:/opt/suphp/sbin/suphp cmd:/opt/suphp/sbin/suphp
User:customer PID:25067 PPID:24955 Run Time:27(secs) Memory:13236(kb) RSS:1572(kb) exe:/opt/suphp/sbin/suphp cmd:/opt/suphp/sbin/suphp
User:customer PID:25068 PPID:25065 Run Time:25(secs) Memory:13236(kb) RSS:1576(kb) exe:/opt/suphp/sbin/suphp cmd:/opt/suphp/sbin/suphp
User:customer PID:25069 PPID:25046 Run Time:24(secs) Memory:13236(kb) RSS:1576(kb) exe:/opt/suphp/sbin/suphp cmd:/opt/suphp/sbin/suphp
User:customer PID:25070 PPID:23350 Run Time:24(secs) Memory:13236(kb) RSS:1576(kb) exe:/opt/suphp/sbin/suphp cmd:/opt/suphp/sbin/suphp
User:customer PID:25071 PPID:24942 Run Time:23(secs) Memory:13236(kb) RSS:1576(kb) exe:/opt/suphp/sbin/suphp cmd:/opt/suphp/sbin/suphp
User:customer PID:25072 PPID:25039 Run Time:21(secs) Memory:13236(kb) RSS:1572(kb) exe:/opt/suphp/sbin/suphp cmd:/opt/suphp/sbin/suphp
User:customer PID:25073 PPID:24793 Run Time:20(secs) Memory:13236(kb) RSS:1576(kb) exe:/opt/suphp/sbin/suphp cmd:/opt/suphp/sbin/suphp
User:customer PID:25074 PPID:25001 Run Time:19(secs) Memory:13236(kb) RSS:1572(kb) exe:/opt/suphp/sbin/suphp cmd:/opt/suphp/sbin/suphp
User:customer PID:25076 PPID:25033 Run Time:18(secs) Memory:13236(kb) RSS:1576(kb) exe:/opt/suphp/sbin/suphp cmd:/opt/suphp/sbin/suphp
User:customer PID:25079 PPID:24797 Run Time:17(secs) Memory:13236(kb) RSS:1572(kb) exe:/opt/suphp/sbin/suphp cmd:/opt/suphp/sbin/suphp
User:customer PID:25084 PPID:25078 Run Time:16(secs) Memory:13236(kb) RSS:1576(kb) exe:/opt/suphp/sbin/suphp cmd:/opt/suphp/sbin/suphp
User:customer PID:25085 PPID:24924 Run Time:11(secs) Memory:13236(kb) RSS:1576(kb) exe:/opt/suphp/sbin/suphp cmd:/opt/suphp/sbin/suphp
User:customer PID:25086 PPID:25082 Run Time:10(secs) Memory:13236(kb) RSS:1572(kb) exe:/opt/suphp/sbin/suphp cmd:/opt/suphp/sbin/suphp
User:customer PID:25087 PPID:24954 Run Time:9(secs) Memory:13236(kb) RSS:1576(kb) exe:/opt/suphp/sbin/suphp cmd:/opt/suphp/sbin/suphp
User:customer PID:25088 PPID:25004 Run Time:7(secs) Memory:13236(kb) RSS:1576(kb) exe:/opt/suphp/sbin/suphp cmd:/opt/suphp/sbin/suphp
User:customer PID:25089 PPID:24795 Run Time:5(secs) Memory:13236(kb) RSS:1572(kb) exe:/opt/suphp/sbin/suphp cmd:/opt/suphp/sbin/suphp
User:customer PID:25090 PPID:24923 Run Time:0(secs) Memory:13236(kb) RSS:1576(kb) exe:/opt/suphp/sbin/suphp cmd:/opt/suphp/sbin/suphp
I say again: At approx. 8:35pm each night, server unreachable for approx. 2 minutes. Server load any other time < .8
Webhost support claims the following, in descending order, could be the cause, even though the issue occurs at a set time and not at a peak useage time for visitors:
- support claim an issue with two databases. "...due to the databases "xxxyyy" and "yyyxxx". Now I have repaired the database from my end."
There were no issues with these databases and didn't require 'fixing'. Issue still exists.
- support now inform me "I have now added the executables in csf pignore list".
Not sure how not being notified of an issue helps. Issue still exists.
- support now state they will monitor the server, findings are "We have experienced the load spike now. We found that the mysql process is consuming more resources on that time. "
- support then say "I have checked the cron log for the mentioned time and found the user "customer" was run the following cron on that time."
I have a cron set up to run every 15 minutes. I reschedule cron so would not run during 8-9pm. Issue still exists.
- support now say "The server load appears to be normal now. There were lots of Apache connections from the IP "185.25.35.x"."
- support also say "You may need to optimize the scripts with the help of your developer. Else, you may need to change the Cron intervals."
Already did the cron change, no difference. Added 185.25.35.x to deny file in csf. Issue still exists.
- support then say "The website script "xxxyyy.php" was using high process usage. "
Top Process | 17.0 - this is the most used script so logically it will show up as the top process.
- support "I have shared the slow query log's of 24 hours below. Please discuss with MYSQL developer and see if the database can be optimized."
Have made many optimizations, Issue still exists.
Given the issue only occurs at one specific time, is it possible the webhost is doing a server backup or using the server as a backup that could cause this issue, or could another customer on the 'node' or physical server be causing the issue?
Any other possible explanations?
I've looked at all crons with
cat /var/spool/cron/*
crons around the same time:
this one likely set up by webhost to look at logs
35 * * * * /usr/bin/test -x /usr/local/cpanel/bin/tail-check && /usr/local/cpanel/bin/tail-check
this one unlikely to be the cause as no load issue at any other time
5,20,35,50 * * * * /usr/local/cpanel/scripts/eximstats_spam_check 2>&1