FOR TIPS, gUIDES & TUTORIALS

Question

We have a production environment with around 100 RUT955 devices. A couple of months ago we updated serveral devices to fw version RUT9_R_00.07.01.4.

After about 50 days uptime we are getting high ping responses from those devices. Remote ssh is not available (connection refused or timeout). WebGUI login fails with "device busy".

Device logs show out of memory problems with process "ports_eventsd" as the source:

[4268791.398774] oom-kill:constraint=CONSTRAINT_NONE,nodemask=(null),task=port_eventsd,pid=2632,uid=0 
[4268791.407894] Out of memory: Killed process 2632 (port_eventsd) total-vm:67244kB, anon-rss:65532kB, file-rss:4kB, shmem-rss:0kB, UID:0 pgtables:80kB oom_score_adj:0 
[4268792.032643] oom_reaper: reaped process 2632 (port_eventsd), now anon-rss:0kB, file-rss:0kB, shmem-rss:0kB

[4586670.162124] oom-kill:constraint=CONSTRAINT_NONE,nodemask=(null),task=port_eventsd,pid=2632,uid=0
[4586670.171269] Out of memory: Killed process 2632 (port_eventsd) total-vm:71784kB, anon-rss:70076kB, file-rss:4kB, shmem-rss:0kB, UID:0 pgtables:88kB oom_score_adj:0
[4586670.965407] oom_reaper: reaped process 2632 (port_eventsd), now anon-rss:0kB, file-rss:0kB, shmem-rss:0kB

After the oom-kill device response time is back to normal. But before this we see 24-72 hrs with reduced performance and lost connectivity.

Is this a fw related bug? Any workaround?

Answer 1 · 2022-08-08T06:05:13+0000

commented Aug 15, 2022 by anonymous

root@Teltonika-RUT956:~# cat /proc/2558/status
Name:   port_eventsd
Umask:  0022
State:  S (sleeping)
Tgid:   2558
Ngid:   0
Pid:    2558
PPid:   1
TracerPid:      0
Uid:    0       0       0       0
Gid:    0       0       0       0
FDSize: 32
Groups:
NStgid: 2558
NSpid:  2558
NSpgid: 1
NSsid:  1
VmPeak:    37844 kB
VmSize:    37844 kB
VmLck:         0 kB
VmPin:         0 kB
VmHWM:     37020 kB
VmRSS:     37020 kB
RssAnon:           36128 kB
RssFile:             892 kB
RssShmem:              0 kB
VmData:    36156 kB
VmStk:       132 kB
VmExe:        16 kB
VmLib:      1536 kB
VmPTE:        52 kB
VmSwap:        0 kB
CoreDumping:    0
THP_enabled:    0
Threads:        1
SigQ:   0/953
SigPnd: 00000000000000000000000000000000
ShdPnd: 00000000000000000000000000000000
SigBlk: 00000000000000000000000000000000
SigIgn: 00000000000000000000000000001000
SigCgt: 00000000000000000000000000024002
CapInh: 0000000000000000
CapPrm: 0000003fffffffff
CapEff: 0000003fffffffff
CapBnd: 0000003fffffffff
CapAmb: 0000000000000000
NoNewPrivs:     0
Seccomp:        0
Speculation_Store_Bypass:       unknown
Cpus_allowed:   1
Cpus_allowed_list:      0
voluntary_ctxt_switches:        2305029
nonvoluntary_ctxt_switches:     1480045
root@Teltonika-RUT956:~#

commented Aug 15, 2022 by anonymous

commented Aug 16, 2022 by anonymous

FOR TIPS, gUIDES & TUTORIALS

Most popular tags

RUT955 out of memory after ~50 days uptime

Please log in or register to add a comment.

Please log in or register to answer this question.

1 Answer

Please log in or register to add a comment.