HansSchulze
12-13-2011, 04:41 PM
I have been debugging an issue with my DGN2200 DSL/Wifi/router that causes it to erroneously emit DHCPNACK 12 hours after granting a lease. WinXP explorer actually crashes when that happens, but as soon as WinXP or I manually repair the network link I get a new lease, good for 12 hours more network traffic.
The problem is that the Javelin seems to also slowly hang up its SMB server threads, such that after a few 12 hour periods, there is no longer a response from the SMB, and I get "network path not found" for the WinXP mapped drive letters. If I reboot the Javelin S4 through the still-working web interface, everything is fine.
CPU Utilization=0, Version=02.01.4000.20, event log doesn't show any issues, osdrive is 250GB free, almost empty. "top" shows some interesting stacking of Dlna and php processes, as if something tried to start other threads.
Mem: 194816K used, 61440K free, 0K shrd, 11136K buff, 67712K cached
Load average: 0.05, 0.05, 0.07 (State: S=sleeping R=running, W=waiting)
PID USER STATUS RSS PPID %CPU %MEM COMMAND
2796 root S 172 1 3.1 0.0 alert_agent
23001 root R 144 22243 3.1 0.0 busybox
1485 root S < 1136 1456 0.0 0.4 php
1481 root S < 968 1455 0.0 0.3 php
2632 root S 728 1 0.0 0.2 mlnet
3192 root S N 728 3191 0.0 0.2 mlnet
3191 root S 728 2632 0.0 0.2 mlnet
17570 root S 660 1 0.0 0.2 DlnaServer
17585 root S 660 17577 0.0 0.2 DlnaServer
17577 root S 660 17570 0.0 0.2 DlnaServer
17586 root S 660 17577 0.0 0.2 DlnaServer
17587 root S 660 17577 0.0 0.2 DlnaServer
17588 root S 660 17577 0.0 0.2 DlnaServer
17593 root S 484 17591 0.0 0.1 MediaSpider
17578 root S 484 1 0.0 0.1 MediaSpider
17591 root S 484 17578 0.0 0.1 MediaSpider
2647 root S 236 1 0.0 0.0 smbd
2651 root S 224 1 0.0 0.0 nmbd
2515 root S 204 2508 0.0 0.0 mysqld
2514 root S 204 2508 0.0 0.0 mysqld
2508 root S 204 2506 0.0 0.0 mysqld
2506 root S 204 2300 0.0 0.0 mysqld
2511 root S 204 2508 0.0 0.0 mysqld
2509 root S 204 2508 0.0 0.0 mysqld
2510 root S 204 2508 0.0 0.0 mysqld
2512 root S 204 2508 0.0 0.0 mysqld
2516 root S 204 2508 0.0 0.0 mysqld
2517 root S 204 2508 0.0 0.0 mysqld
1454 root S < 200 1 0.0 0.0 lighttpd
2650 root S 196 2647 0.0 0.0 smbd
22242 root S 144 1308 0.0 0.0 in.telnetd
22243 root S 140 22242 0.0 0.0 sh
17457 nobody S 136 1 0.0 0.0 mdnsd
2798 root S 124 1 0.0 0.0 chknetd
1480 root S < 120 1478 0.0 0.0 php
1484 root S < 120 1482 0.0 0.0 php
1478 root S < 120 1455 0.0 0.0 php
1482 root S < 120 1456 0.0 0.0 php
1455 root S < 120 1454 0.0 0.0 php
1456 root S < 120 1454 0.0 0.0 php
1479 root S < 120 1478 0.0 0.0 php
1483 root S < 120 1482 0.0 0.0 php
3190 nobody S 104 1 0.0 0.0 proftpd
17406 root S 104 1 0.0 0.0 fagent
17411 root S 104 1 0.0 0.0 fagent
17460 root S 104 1 0.0 0.0 mDNSResponderPo
2795 root S 96 1 0.0 0.0 i2eventd
30051 root S 84 1 0.0 0.0 syslogd
1316 root S 80 1 0.0 0.0 cron
1 root S 80 0 0.0 0.0 init
1308 root S 80 1 0.0 0.0 inetd
1422 root S 68 1 0.0 0.0 lld2d
17401 root S 68 1 0.0 0.0 dhcpcd
1305 daemon S 60 1 0.0 0.0 portmap
2300 root S 56 1 0.0 0.0 mysqld_safe
1457 root S < 56 1 0.0 0.0 stunnel
2681 root S 56 1 0.0 0.0 rpc.statd
3269 root S 56 1 0.0 0.0 afpd
2975 root S 48 1 0.0 0.0 getty
2672 root S 44 1 0.0 0.0 rpc.mountd
1328 lp S 44 1 0.0 0.0 lpd
2783 root S 36 1 0.0 0.0 buttonctl
2670 root S 24 1 0.0 0.0 rpc.rquotad
236 root SW 0 2 0.0 0.0 kswapd0
1083 root SW< 0 2 0.0 0.0 loop0
1086 root SW< 0 2 0.0 0.0 loop1
325 root SW 0 2 0.0 0.0 xfsdatad/0
1176 root SW 0 2 0.0 0.0 kdmflush
1242 root SW 0 2 0.0 0.0 xfsbufd
1236 root SW 0 2 0.0 0.0 xfsbufd
1239 root SW 0 2 0.0 0.0 xfsbufd
1080 root SW 0 2 0.0 0.0 flush-1:0
324 root SW 0 2 0.0 0.0 xfslogd/0
1360 root SW 0 2 0.0 0.0 flush-254:2
1186 root SW 0 2 0.0 0.0 kdmflush
4 root SW 0 2 0.0 0.0 events/0
1238 root SW 0 2 0.0 0.0 xfssyncd
3 root SW 0 2 0.0 0.0 ksoftirqd/0
I am hoping for a resolution of the Netgear issue, but it's slow. RMA'd router does the same as the original one, so it must be a protocol stack issue.
It would be nice to get some debug info for this. How?
The problem is that the Javelin seems to also slowly hang up its SMB server threads, such that after a few 12 hour periods, there is no longer a response from the SMB, and I get "network path not found" for the WinXP mapped drive letters. If I reboot the Javelin S4 through the still-working web interface, everything is fine.
CPU Utilization=0, Version=02.01.4000.20, event log doesn't show any issues, osdrive is 250GB free, almost empty. "top" shows some interesting stacking of Dlna and php processes, as if something tried to start other threads.
Mem: 194816K used, 61440K free, 0K shrd, 11136K buff, 67712K cached
Load average: 0.05, 0.05, 0.07 (State: S=sleeping R=running, W=waiting)
PID USER STATUS RSS PPID %CPU %MEM COMMAND
2796 root S 172 1 3.1 0.0 alert_agent
23001 root R 144 22243 3.1 0.0 busybox
1485 root S < 1136 1456 0.0 0.4 php
1481 root S < 968 1455 0.0 0.3 php
2632 root S 728 1 0.0 0.2 mlnet
3192 root S N 728 3191 0.0 0.2 mlnet
3191 root S 728 2632 0.0 0.2 mlnet
17570 root S 660 1 0.0 0.2 DlnaServer
17585 root S 660 17577 0.0 0.2 DlnaServer
17577 root S 660 17570 0.0 0.2 DlnaServer
17586 root S 660 17577 0.0 0.2 DlnaServer
17587 root S 660 17577 0.0 0.2 DlnaServer
17588 root S 660 17577 0.0 0.2 DlnaServer
17593 root S 484 17591 0.0 0.1 MediaSpider
17578 root S 484 1 0.0 0.1 MediaSpider
17591 root S 484 17578 0.0 0.1 MediaSpider
2647 root S 236 1 0.0 0.0 smbd
2651 root S 224 1 0.0 0.0 nmbd
2515 root S 204 2508 0.0 0.0 mysqld
2514 root S 204 2508 0.0 0.0 mysqld
2508 root S 204 2506 0.0 0.0 mysqld
2506 root S 204 2300 0.0 0.0 mysqld
2511 root S 204 2508 0.0 0.0 mysqld
2509 root S 204 2508 0.0 0.0 mysqld
2510 root S 204 2508 0.0 0.0 mysqld
2512 root S 204 2508 0.0 0.0 mysqld
2516 root S 204 2508 0.0 0.0 mysqld
2517 root S 204 2508 0.0 0.0 mysqld
1454 root S < 200 1 0.0 0.0 lighttpd
2650 root S 196 2647 0.0 0.0 smbd
22242 root S 144 1308 0.0 0.0 in.telnetd
22243 root S 140 22242 0.0 0.0 sh
17457 nobody S 136 1 0.0 0.0 mdnsd
2798 root S 124 1 0.0 0.0 chknetd
1480 root S < 120 1478 0.0 0.0 php
1484 root S < 120 1482 0.0 0.0 php
1478 root S < 120 1455 0.0 0.0 php
1482 root S < 120 1456 0.0 0.0 php
1455 root S < 120 1454 0.0 0.0 php
1456 root S < 120 1454 0.0 0.0 php
1479 root S < 120 1478 0.0 0.0 php
1483 root S < 120 1482 0.0 0.0 php
3190 nobody S 104 1 0.0 0.0 proftpd
17406 root S 104 1 0.0 0.0 fagent
17411 root S 104 1 0.0 0.0 fagent
17460 root S 104 1 0.0 0.0 mDNSResponderPo
2795 root S 96 1 0.0 0.0 i2eventd
30051 root S 84 1 0.0 0.0 syslogd
1316 root S 80 1 0.0 0.0 cron
1 root S 80 0 0.0 0.0 init
1308 root S 80 1 0.0 0.0 inetd
1422 root S 68 1 0.0 0.0 lld2d
17401 root S 68 1 0.0 0.0 dhcpcd
1305 daemon S 60 1 0.0 0.0 portmap
2300 root S 56 1 0.0 0.0 mysqld_safe
1457 root S < 56 1 0.0 0.0 stunnel
2681 root S 56 1 0.0 0.0 rpc.statd
3269 root S 56 1 0.0 0.0 afpd
2975 root S 48 1 0.0 0.0 getty
2672 root S 44 1 0.0 0.0 rpc.mountd
1328 lp S 44 1 0.0 0.0 lpd
2783 root S 36 1 0.0 0.0 buttonctl
2670 root S 24 1 0.0 0.0 rpc.rquotad
236 root SW 0 2 0.0 0.0 kswapd0
1083 root SW< 0 2 0.0 0.0 loop0
1086 root SW< 0 2 0.0 0.0 loop1
325 root SW 0 2 0.0 0.0 xfsdatad/0
1176 root SW 0 2 0.0 0.0 kdmflush
1242 root SW 0 2 0.0 0.0 xfsbufd
1236 root SW 0 2 0.0 0.0 xfsbufd
1239 root SW 0 2 0.0 0.0 xfsbufd
1080 root SW 0 2 0.0 0.0 flush-1:0
324 root SW 0 2 0.0 0.0 xfslogd/0
1360 root SW 0 2 0.0 0.0 flush-254:2
1186 root SW 0 2 0.0 0.0 kdmflush
4 root SW 0 2 0.0 0.0 events/0
1238 root SW 0 2 0.0 0.0 xfssyncd
3 root SW 0 2 0.0 0.0 ksoftirqd/0
I am hoping for a resolution of the Netgear issue, but it's slow. RMA'd router does the same as the original one, so it must be a protocol stack issue.
It would be nice to get some debug info for this. How?