Fuite de swap et process killed

Bonjour,

Depuis 3 semaines, j’ai un comportement étrange de ma smart avec une chute du swap tous les jours entre 5h et 5h15 entrainant des erreurs mémoire insuffisantes et des process killés.

Voici l’état de santé :

Voici l’historique du swap (avec monitor depuis un esclave jeedom) :

J’ai fait un dmsg et j’ai pu trouvé le process kill dont voici un extrait :

[53443.811756] kworker/u8:2 invoked oom-killer: gfp_mask=0x3000d0, order=2, oom_score_adj=0
[53443.818156] kworker/u8:2 cpuset=/ mems_allowed=0
[53443.822706] CPU: 1 PID: 307 Comm: kworker/u8:2 Not tainted 3.14.79-94 #1
[53443.829354] Workqueue: khelper __call_usermodehelper
[53443.838085] Call trace:
[53443.842183] [<ffffffc001088e40>] dump_backtrace+0x0/0x128
[53443.846314] [<ffffffc001088f8c>] show_stack+0x24/0x30
[53443.851050] [<ffffffc001885b2c>] dump_stack+0x88/0xac
[53443.856055] [<ffffffc00117f874>] dump_header.isra.6+0x84/0x1e0
[53443.861831] [<ffffffc00117ffc4>] oom_kill_process+0x2d4/0x408
[53443.867527] [<ffffffc0011805f0>] out_of_memory+0x2e8/0x318
[53443.872961] [<ffffffc001185dc4>] __alloc_pages_nodemask+0xa0c/0xa20
[53443.879172] [<ffffffc0010ab978>] copy_process.part.8+0x138/0x14e0
[53443.885206] [<ffffffc0010acee0>] do_fork+0xc0/0x3e8
[53443.890037] [<ffffffc0010ad24c>] kernel_thread+0x44/0x58
[53443.895299] [<ffffffc0010c867c>] __call_usermodehelper+0xa4/0xc0
[53443.901249] [<ffffffc0010cbd90>] process_one_work+0x138/0x500
[53443.906940] [<ffffffc0010ccdf8>] worker_thread+0x130/0x438
[53443.912378] [<ffffffc0010d4150>] kthread+0xd8/0xf0
[53443.917491] Mem-Info:
[53443.921218] Normal per-cpu:
[53443.924874] CPU    0: hi:  186, btch:  31 usd:  53
[53443.928603] CPU    1: hi:  186, btch:  31 usd:  29
[53443.932459] CPU    2: hi:  186, btch:  31 usd: 174
[53443.937203] CPU    3: hi:  186, btch:  31 usd: 139
[53443.941982] active_anon:300978 inactive_anon:100344 isolated_anon:0
                active_file:585 inactive_file:568 isolated_file:36
                unevictable:0 dirty:0 writeback:0 unstable:0
                free:1489 slab_reclaimable:4149 slab_unreclaimable:7487
                mapped:9882 shmem:25388 pagetables:3687 bounce:0
                free_cma:0
[53443.991013] Normal free:5880kB min:5300kB low:6624kB high:7948kB active_anon:1203912kB inactive_anon:401376kB active_file:2192kB inactive_file:2332kB unevictable:0kB isolated(anon):0kB isolated(file):16kB present:1947648kB managed:1758616kB mlocked:0kB dirty:0kB writeback:0kB mapped:39344kB shmem:101552kB slab_reclaimable:16596kB slab_unreclaimable:29948kB kernel_stack:5984kB pagetables:14748kB unstable:0kB bounce:0kB free_cma:0kB writeback_tmp:0kB pages_scanned:784 all_unreclaimable? no
[53444.041956] lowmem_reserve[]: 0 0
[53444.049313] Normal: 1538*4kB (UEM) 15*8kB (EM) 0*16kB 0*32kB 0*64kB 0*128kB 0*256kB 0*512kB 0*1024kB 0*2048kB 0*4096kB = 6272kB
[53444.060075] Node 0 hugepages_total=0 hugepages_free=0 hugepages_surp=0 hugepages_size=2048kB
[53444.067492] 26768 total pagecache pages
[53444.072200] 198 pages in swap cache
[53444.076760] Swap cache stats: add 75404, delete 75206, find 6377/9974
[53444.082191] Free swap  = 0kB
[53444.086928] Total swap = 262140kB
[53444.091486] 486912 pages RAM
[53444.095933] 0 pages HighMem/MovableOnly
[53444.100455] 47258 pages reserved
[53444.104884] [ pid ]   uid  tgid total_vm      rss nr_ptes swapents oom_score_adj name
[53444.111592] [  229]     0   229     6943      643      11       68             0 systemd-journal
[53444.120426] [  246]     0   246     3399       51       7      127         -1000 systemd-udevd
[53444.128819] [  412]     0   412     1811       47       6       56             0 bluetoothd
[53444.137078] [  413]     0   413      898       15       5       40             0 atd
[53444.144781] [  414]     0   414     1322       35       5       43             0 cron
[53444.152550] [  415]     0   415     1763       36       7       89             0 systemd-logind
[53444.161334] [  416]     0   416    54454      331      10      112             0 rsyslogd
[53444.169291] [  418]   107   418     1581       91       6       66          -900 dbus-daemon
[53444.177638] [  425]     0   425     2776       78       8      107             0 connmand
[53444.185723] [  501]     0   501     1731       54       7      204             0 dhclient
[53444.193858] [  551]     0   551    37435       36      39     1348             0 php-fpm7.0
[53444.202307] [  565]     0   565     2598       38       7      151         -1000 sshd
[53444.209900] [  567]     0   567      979        0       4       36             0 agetty
[53444.217826] [  568]     0   568     1035        0       5       37             0 agetty
[53444.225756] [  575]     0   575    37638      394      39     1213             0 apache2
[53444.233789] [  584]    33   584    37435       16      34     1368             0 php-fpm7.0
[53444.242264] [  585]    33   585    37435       16      34     1368             0 php-fpm7.0
[53444.250343] [  786]   106   786   189784    50561     235     8705             0 mysqld
[53444.258277] [  789]     0   789    42470      676      17     1748             0 fail2ban-server
[53444.266970] [  847]   104   847    18213       39       8      104             0 ntpd
[53444.274757] [ 1884]     0  1884     1632        0       7       97             0 sudo
[53444.282528] [ 1885]     0  1885   142980     2269      43     4582             0 python3
[53444.290677] [ 1925]     0  1925     1632        0       7       97             0 sudo
[53444.298294] [ 1926]     0  1926    79758      903      22     2489             0 python
[53444.306215] [ 2003]    33  2003    37714      127      39     1790             0 php
[53444.313917] [ 4796]     0  4796     1632        0       7       97             0 sudo
[53444.321676] [ 4803]     0  4803     2256      147       7      114             0 openvpn
[53444.329865] [ 5902]    33  5902    61374     1462      21     2029             0 python
[53444.337631] [ 6287]    33  6287    37714      206      42     1858             0 php
[53444.345286] [ 6317]    33  6317     5197      635      13     1878             0 python
[53444.353248] [ 6660]     0  6660     1632        0       6       95             0 sudo
[53444.361008] [ 6661]     0  6661    80688      390      24     3526             0 python3
[53444.369181] [ 7073]    33  7073    37731      538      40     1610             0 php
[53444.376679] [ 7095]    33  7095   116612     1448      27     2019             0 python
[53444.384646] [ 7230]    33  7230   251016     6308      79     2482             0 nodejs
[53444.392575] [ 7264]    33  7264   119350     1864      31     4057             0 python
[53444.400671] [ 7566]    33  7566   151103     2156      52     9002             0 python
[53444.408442] [ 7980]    33  7980   209746     3611      38      853             0 python3
[53444.416440] [15792]     0 15792     1105       62       5       27             0 atd
[53444.424145] [15793]     0 15793      467       22       4        0             0 sh
[53444.431956] [15794]     0 15794      879       72       4        0             0 bash
[53444.439491] [16887]     0 16887    13323     3741      29        0             0 apt-get
[53444.447492] [25558]     0 25558     1841      675       6        0             0 dpkg
[53444.455280] [ 5588]    33  5588   337439     9902      64        0             0 python3
[53444.463302] [ 8696]     0  8696      467       22       4        0             0 php7.0-cli.post
[53444.472198] [ 8702]     0  8702     4649     3084      12        0             0 frontend
[53444.480120] [ 8817]     0  8817      950      143       4        1             0 ucf
[53444.487792] [15689]    33 15689    37842     1663      45     1044             0 apache2
[53444.495790] [30487]    33 30487    56585     1952      46     1035             0 apache2
[53444.504049] [26276]    33 26276    37844     1690      45     1044             0 apache2
[53444.511860] [22563]    33 22563    98149     3701      26        0             0 python
[53444.519796] [ 7955]    33  7955    37859     1663      44     1043             0 apache2
[53444.527810] [ 7956]    33  7956    38355     2136      46     1042             0 apache2
[53444.535973] [ 6733]    33  6733    37843     1494      45     1044             0 apache2
[53444.543864] [23520]    33 23520    38335     2044      45     1044             0 apache2
[53444.551910] [ 6426]    33  6426    37800     1586      44     1044             0 apache2
[53444.559903] [ 5340]    33  5340    37803     1611      44     1044             0 apache2
[53444.567916] [ 3554]    33  3554    37803     1668      44     1044             0 apache2
[53444.576096] [19084]    33 19084    57829     8502      48        0             0 php
[53444.583622] [19417]    33 19417   299199   257721     544     5499             0 php
[53444.591309] [32437]    33 32437    37741      957      40     1053             0 apache2
[53444.599319] [32454]    33 32454    18608      849      28        0             0 php
[53444.606968] [32473]     0 32473      908       39       4        0             0 bluepy-helper
[53444.615710] [32498]     0 32498     1529       85       6       11             0 cron
[53444.623298] [32499]     0 32499     1529       84       6       11             0 cron
[53444.631059] [32500]     0 32500     1529       84       6       11             0 cron
[53444.638814] [32531]     0 32531      467       22       4        0             0 sh
[53444.646386] [32532]     0 32532      467       23       4        0             0 sh
[53444.654220] [32533]    33 32533      467       22       4        0             0 sh
[53444.661591] [32538]    33 32538    11729      324      15        0             0 php
[53444.669264] [32539]     0 32539    11729      322      13        0             0 php
[53444.676916] [32540]     0 32540     1568       94       6        0             0 su
[53444.684538] [32542]     0 32542     1529       84       6       11             0 cron
[53444.692471] [32543]     0 32543     1529       91       6       11             0 cron
[53444.700057] [32544]     0 32544     1529       91       6       11             0 cron
[53444.707810] [32575]     0 32575      467       22       4        0             0 sh
[53444.715384] [32576]     0 32576      467       22       4        0             0 sh
[53444.723002] [32579]    33 32579      467       22       4        0             0 sh
[53444.730764] [32584]     0 32584     1568       94       6        0             0 su
[53444.738175] [32585]     0 32585    11729      315      15        0             0 php
[53444.745834] [32590]    33 32590    11729      314      15        0             0 php
[53444.753533] [32606]     0 32606     1529       84       6       11             0 cron
[53444.761298] [32607]     0 32607     1529       84       6       11             0 cron
[53444.769220] [32608]     0 32608     1529       84       6       11             0 cron
[53444.776795] [32613]    33 32613    10751      310      12        0             0 php
[53444.784495] [32616]    33 32616    37741      803      40     1058             0 apache2
[53444.792519] [32624]     0 32624      467       22       4        0             0 sh
[53444.800278] [32625]     0 32625      467       22       4        0             0 sh
[53444.807691] [32626]    33 32626      467       22       5        0             0 sh
[53444.815262] [32628]    33 32628    10751      307      13        0             0 php
[53444.822968] [32629]    33 32629    11729      328      15        0             0 php
[53444.830637] [32630]     0 32630     1568       93       6        0             0 su
[53444.838388] [32631]     0 32631    11729      325      15        0             0 php
[53444.845879] [32639]     0 32639     1529       84       6       11             0 cron
[53444.853672] [32640]     0 32640     1529       84       6       11             0 cron
[53444.861442] [32641]     0 32641     1529       89       6       11             0 cron
[53444.869191] [32648]     0 32648      467       23       4        0             0 sh
[53444.877009] [32649]     0 32649      467       22       4        0             0 sh
[53444.884374] [32650]    33 32650      467       22       4        0             0 sh
[53444.891966] [32652]     0 32652     1568       93       6        0             0 su
[53444.899549] [32653]     0 32653    11729      314      15        0             0 php
[53444.907205] [32654]    33 32654    11729      333      14        0             0 php
[53444.914942] [32655]    33 32655    37739      784      38     1058             0 apache2
[53444.923100] [32659]     0 32659     1529       84       6       11             0 cron
[53444.930692] [32660]     0 32660     1529       84       6       11             0 cron
[53444.938449] [32661]     0 32661     1529       84       6       11             0 cron
[53444.946191] [32665]    33 32665    37739      722      38     1065             0 apache2
[53444.954384] [32667]    33 32667    10749      107      11        0             0 php
[53444.961914] [32674]     0 32674      467       23       4        0             0 sh
[53444.969493] [32675]     0 32675      467       22       3        0             0 sh
[53444.977065] [32678]    33 32678      467       23       3        0             0 sh
[53444.984846] [32680]     0 32680     1568       93       6        0             0 su
[53444.992273] [32681]     0 32681    11729      326      13        0             0 php
[53444.999944] [32682]    33 32682    11729      333      13        0             0 php
[53445.007614] [32686]     0 32686     1529       84       6       11             0 cron
[53445.015362] [32688]     0 32688     1529       85       6       11             0 cron
[53445.023329] [32689]     0 32689     1529       90       6       11             0 cron
[53445.030913] [32691]    33 32691    37739      722      38     1065             0 apache2
[53445.038933] [32693]    33 32693    10749      106      12        0             0 php
[53445.046588] [32700]     0 32700      467       22       3        0             0 sh
[53445.054349] [32701]    33 32701      467       22       4        0             0 sh
[53445.061793] [32703]     0 32703      467       22       4        0             0 sh
[53445.069376] [32706]    33 32706    11729      334      15        0             0 php
[53445.077030] [32707]     0 32707    11729      322      15        0             0 php
[53445.084730] [32708]     0 32708     1568       93       6        0             0 su
[53445.092475] [32712]     0 32712     1529       91       6       11             0 cron
[53445.100085] [32713]     0 32713     1529       84       6       11             0 cron
[53445.107848] [32714]     0 32714     1529       84       6       11             0 cron
[53445.115584] [32716]    33 32716    37709      659      36     1089             0 apache2
[53445.123793] [32719]    33 32719    37666      515      35     1133             0 apache2
[53445.131655] [32720]    33 32720    37711      681      37     1080             0 apache2
[53445.139679] [32722]    33 32722     1299       71       5        0             0 bash
[53445.147411] [32724]    33 32724    37695      661      36     1094             0 apache2
[53445.155461] [32726]     0 32726      467       23       4        0             0 sh
[53445.163053] [32727]     0 32727      467       23       4        0             0 sh
[53445.170847] [32728]    33 32728      467       22       4        0             0 sh
[53445.178222] [32729]    33 32729    37658      475      35     1158             0 apache2
[53445.186222] [32730]     0 32730    10751      115      12        0             0 php
[53445.193924] [32731]     0 32731     1568       93       6        0             0 su
[53445.201599] [32735]    33 32735    10751      106      11        0             0 php
[53445.209325] [32738]    33 32738    37652      446      35     1184             0 apache2
[53445.217186] [32739]    33 32739    37652      438      35     1184             0 apache2
[53445.225234] [32740]    33 32740    37652      441      35     1184             0 apache2
[53445.233253] [32745]     0 32745     1529       84       6       11             0 cron
[53445.241016] [32746]     0 32746     1529       84       6       11             0 cron
[53445.248952] [32747]     0 32747     1529       84       6       11             0 cron
[53445.256517] [32748]    33 32748    37652      437      35     1184             0 apache2
[53445.264568] [32752]    33 32752    37652      455      35     1184             0 apache2
[53445.272583] [32753]    33 32753     1296       76       5        0             0 bash
[53445.280492] [32756]    33 32756    37652      451      35     1184             0 apache2
[53445.288361] [32763]    33 32763      467       23       4        0             0 sh
[53445.295935] [32764]     0 32764      467       22       4        0             0 sh
[53445.303550] [32765]     0 32765      467       22       4        0             0 sh
[53445.311139] [  301]    33   301    37652      450      35     1184             0 apache2
[53445.319250] [  302]     0   302    10749      125      11        0             0 php
[53445.326811] [  303]     0   303     1561      111       6        0             0 su
[53445.334425] [  304]    33   304    10749      125      11        0             0 php
[53445.342104] [  315]    33   315    37650      439      35     1184             0 apache2
[53445.350125] [  316]    33   316     1294       85       4        0             0 bash
[53445.358052] [  318]    33   318    37644      420      35     1189             0 apache2
[53445.365885] [  319]    33   319    37652      451      35     1184             0 apache2
[53445.373933] [  320]    33   320    37644      421      35     1189             0 apache2
[53445.381952] [  321]    33   321    37644      421      35     1189             0 apache2
[53445.389969] [  322]    33   322    37644      421      35     1189             0 apache2
[53445.398174] [  323]    33   323    37644      421      35     1189             0 apache2
[53445.405987] [  326]     0   326     1529      109       6       18             0 cron
[53445.413778] [  327]     0   327     1529      103       6       18             0 cron
[53445.421545] [  328]     0   328     1529      103       6       18             0 cron
[53445.429300] Out of memory: Kill process 19417 (php) score 522 or sacrifice child
**[53445.436775] Killed process 19417 (php) total-vm:1196796kB, anon-rss:1017528kB, file-rss:13324kB**
[53475.770837] ERROR::handle_hc_nak_intr:1307: Can not read device info from hub.We take it error

[53477.419347] bluepy-helper[2373]: unhandled level 1 translation fault (11) at 0x550000000b, esr 0x92000005
[53477.426490] pgd = ffffffc002f3f000
[53477.429317] [550000000b] *pgd=0000000000000000

J’ai déjà essayé de :

  • Désactiver les plugins 1 par 1 sans réel succès

  • Nettoyer les virtuels non utilisé et supprimer les plugins non utilisés

  • Déporter tous ce qui est multimédia, info conso elec, scénarios sur un autre Jeedom pour libérer de la ressource

  • Mettre un hub usb avec alimentation externe pour écarter un problème périphérique

  • Faire un recovery et restaurer ma sauvegarde

  • Passer tous les logs en erreur pour limiter les ressources

Maintenant je n’ai plus d’idée …
Est ce que vous auriez des idées ?
Merci d’avance

Bonjour,
Si tu lances un backups il se passe quoi au niveau du swap ?

Bonjour Loic et merci de l’intêret.

J’ai fait 2 opérations :

  • backup
  • reboot puis backup

Dans la pièce jointe, les screenshot de santé, log backup, dmesg à chaque étape.

Swap Leak

Je remarque juste des erreurs dans le log de backup que je n’avais pas remarqué avant.

Mise à jour en 3.3.35 mais toujours même problème.

Là je sais pas désolé…

J’ai ouvert un ticket au support qui a été fermé et je n’ai pas reçu de réponse.
J’ai bien reçu l’accusé d’ouverture de ticket donc ma boite mail fonctionne, rien dans les spams.

Y’aurait il quelqu’un de l’équipe Jeedom pour récuperer mon ticket ?
Merci d’avance.

Toujours en recherche de la raison du plantage. Je suis tombé sur 2 taches système qui ont lieu un peu avant l’heure : CRON Cache / Persist & CRON History / Archive.
Lorsque je lance la premiere, j’ai l’espace disque du SMART qui s’effondre et lorsque je lance le second, il dure très longtemps et fini l’espace disque avant de faire s’écrouler le swap et cela entraine les kill de process …
Y a t il de la logique dans cette constatation ?

Bonjour @fredlamour

avez vous trouvé une solution à votre problème, depuis 3 jours, je dois vider manuellement mon swap qui explose après 24H

Bonjour,

J’ai procédé à une nouvelle installation et c’est redevenu stable. Entre temps il y avait eu une mise à jour du core.
Impossible de trouver la cause …