UDOO locks up after a while

Discussion in 'Troubleshooting' started by dhylands, Nov 5, 2013.

  1. dhylands

    dhylands New Member

    Joined:
    Nov 4, 2013
    Messages:
    6
    Likes Received:
    0
    So I just received my UDOO (quad) a little while ago.

    I used the linux sdcard and updated following the directions.

    My UDOO boots up fine. However, if I leave it, then it will lock up after several hours and require a reset/power cycle to become usable again.

    I have wired ethernet, USB keyboard and mouse and an HDMI TV/monitor.

    On my UDOO, I booted it up and opened a terminal window, so that I could verify if typing anything on the keyboard would respond.
    On my desktop machine, I started: ping -i 60 IP-Address-of-UDOO

    and it lasted for 3845 seconds (about 58 hours) before the pings started to fail. The keyboard and mouse were also unresponsive. A previous time I tried this (but without the pings), it lasted less than 12 hours before locking up.

    The heatsink on the UDOO was warm to the touch, but not overly so.

    I thought I'd ask if this was a known issue, and whether there was any additional information I can grab that would be useful.
     
  2. DracoLlasa

    DracoLlasa UDOOer

    Joined:
    Oct 15, 2013
    Messages:
    419
    Likes Received:
    3
    This is a new issue to me, i would like to see what is shown on the serial/debug console when this "lock up occurs"
    Do you know how to access the serial console?

    I will also run the same test on my quad, i have never left it on that long.
     
  3. dhylands

    dhylands New Member

    Joined:
    Nov 4, 2013
    Messages:
    6
    Likes Received:
    0
    I've powered the board up again, and started the ping process.

    I connected the debug console this time, and will save that.

    In my previous email, I reported 3845 seconds before it hung. It was actually 3485 minutes (since I set the ping interval to 60 seconds, and there were 3485 successfull pings). 3485 / 60 = 58 hours.

    I also found my my contactless IR thermometer, so I can watch the temperature on the heatsink.
     
  4. DracoLlasa

    DracoLlasa UDOOer

    Joined:
    Oct 15, 2013
    Messages:
    419
    Likes Received:
    3
    just to update
    Code:
    ubuntu@imx6-qsdl:~$ uptime
     00:45:27 up 1 day, 12:02,
    
    I have 36 hours at this point. I think i can let it run another day or so, but i will have to start doing some work on it soon.

    After the first 20 hours or so when i first opened a serial connection i really had to tap enter a few times to wake it up and get a console but it came up. i was then able to SSH in without problem. I do not however have a working gui. it could be because i booted with no HDMI connection and left it a bit without, i think that causes issues with Xserver, or it could be something else... regardless, overall 36 hours into it and no issues. Im not pinging. just keeping a serial connection open and sometimes and SSH session.

    I will update more tomorrow, or whenever the time comes that i have to shut down/reboot to do work with it.
     
  5. dhylands

    dhylands New Member

    Joined:
    Nov 4, 2013
    Messages:
    6
    Likes Received:
    0
    Mine has been running for 52 hours so far.

    I haven't seen anything additional printed on the debug console since it booted.

    I'm running the ping once per minte and have a script running in t a terminal on the monitor which shows the temp once every 10 minutes.

    The temp normally sits around 44 occaisonally going up to 46 and then back down to 44.

    I'll keep letting mine run for a few more days anyways.

    If I do:

    dmesg | tail

    then I see messages that aren't showing up on the debug console.

    I started:

    sudo tail -f /var/log/syslog.log

    on the debug serial (unless I hear of something better to do from you)
     
  6. DracoLlasa

    DracoLlasa UDOOer

    Joined:
    Oct 15, 2013
    Messages:
    419
    Likes Received:
    3
    it sounds like the system is being responsive for you. you are running commands its responding. you are able to interact with the system right?
    Are you concerned that there doesn't alear to be active logging?
    When im logged in on the console i dont normally see random output so i think thats ok.. it may be different not logged in not sure.

    just for starters though, its /var/log/syslog
    The file is named syslog not syslog.log. When i review mine i and there isnt much there except for a dhcpclient entry every 30 minutes

    Code:
    
    ubuntu@imx6-qsdl:/var/log$ sudo tail -n 20 syslog
    Nov  7 15:47:38 imx6-qsdl dhclient: DHCPREQUEST of 192.168.0.11 on wlan6 to 192.168.0.1 port 67
    Nov  7 15:47:39 imx6-qsdl dhclient: DHCPACK of 192.168.0.11 from 192.168.0.1
    Nov  7 15:47:39 imx6-qsdl dhclient: bound to 192.168.0.11 -- renewal in 1752 seconds.
    Nov  7 16:16:51 imx6-qsdl dhclient: DHCPREQUEST of 192.168.0.11 on wlan6 to 192.168.0.1 port 67
    Nov  7 16:16:52 imx6-qsdl dhclient: DHCPACK of 192.168.0.11 from 192.168.0.1
    Nov  7 16:16:52 imx6-qsdl dhclient: bound to 192.168.0.11 -- renewal in 1463 seconds.
    Nov  7 16:17:01 imx6-qsdl CRON[8177]: (root) CMD (   cd / && run-parts --report /etc/cron.hourly)
    Nov  7 16:41:15 imx6-qsdl dhclient: DHCPREQUEST of 192.168.0.11 on wlan6 to 192.168.0.1 port 67
    Nov  7 16:41:16 imx6-qsdl dhclient: DHCPACK of 192.168.0.11 from 192.168.0.1
    Nov  7 16:41:16 imx6-qsdl dhclient: bound to 192.168.0.11 -- renewal in 1705 seconds.
    Nov  7 17:09:41 imx6-qsdl dhclient: DHCPREQUEST of 192.168.0.11 on wlan6 to 192.168.0.1 port 67
    Nov  7 17:09:41 imx6-qsdl dhclient: DHCPACK of 192.168.0.11 from 192.168.0.1
    Nov  7 17:09:41 imx6-qsdl dhclient: bound to 192.168.0.11 -- renewal in 1450 seconds.
    Nov  7 17:17:01 imx6-qsdl CRON[8198]: (root) CMD (   cd / && run-parts --report /etc/cron.hourly)
    Nov  7 17:33:51 imx6-qsdl dhclient: DHCPREQUEST of 192.168.0.11 on wlan6 to 192.168.0.1 port 67
    Nov  7 17:33:52 imx6-qsdl dhclient: DHCPACK of 192.168.0.11 from 192.168.0.1
    Nov  7 17:33:52 imx6-qsdl dhclient: bound to 192.168.0.11 -- renewal in 1752 seconds.
    Nov  7 18:03:04 imx6-qsdl dhclient: DHCPREQUEST of 192.168.0.11 on wlan6 to 192.168.0.1 port 67
    Nov  7 18:03:05 imx6-qsdl dhclient: DHCPACK of 192.168.0.11 from 192.168.0.1
    Nov  7 18:03:05 imx6-qsdl dhclient: bound to 192.168.0.11 -- renewal in 1750 seconds.
    
    At this point my uptime is 53 hours and not a single issue for me, but i haev to move around some power cables so im not sure i will be able to keep it running, im going to try and switch over to battery real time without rebooting so if that works i will keep it running while i re-arrange my cables but so far things are good on my test

    Code:
    ubuntu@imx6-qsdl:/var/log$ uptime
     18:16:59 up 2 days,  5:33,  1 user,  load average: 0.01, 0.05, 0.05
    
     
  7. dhylands

    dhylands New Member

    Joined:
    Nov 4, 2013
    Messages:
    6
    Likes Received:
    0
    Sometime before 103 hours, I noticed that my GUI had frozen.

    In this particular case, the pings were still working. I observed a series of kernel crashes. I'll put the first one here, which appears to be some type of kernel heap corruption, while executing a system call on behalf of the wpa_supplicant. I was using wired ethernet, but the wireless adapter is plugged into my UDOO and connecting to my wireless access poiint.

    [351858.173619] wlan4: deauthenticated from 00:1c:10:1b:57:24 (Reason: 7)
    [351858.601744] cfg80211: Calling CRDA to update world regulatory domain
    [351859.990121] wlan4: authenticate with 00:1c:10:1b:57:24 (try 1)
    [351860.000819] wlan4: authenticated
    [351860.008615] wlan4: associate with 00:1c:10:1b:57:24 (try 1)
    [351860.022543] wlan4: RX ReassocResp from 00:1c:10:1b:57:24 (capab=0x411 status=0 aid=1)
    [351860.030543] wlan4: associated
    [351861.020251] kernel BUG at mm/slub.c:2949!
    [351861.024537] Unable to handle kernel NULL pointer dereference at virtual address 00000000
    [351861.032960] pgd = ba844000
    [351861.035808] [00000000] *pgd=4fb79831, *pte=00000000, *ppte=00000000
    [351861.042264] Internal error: Oops: 817 [#2] PREEMPT SMP
    [351861.047501] Modules linked in: vivante drm
    [351861.051750] CPU: 0 Tainted: G D W (3.0.35 #1)
    [351861.057178] PC is at __bug+0x18/0x24
    [351861.060855] LR is at __bug+0x14/0x24
    [351861.064535] pc : [<80041af8>] lr : [<80041af4>] psr: 600f0013
    [351861.064545] sp : ba39d9d0 ip : 00000000 fp : 000000c0
    [351861.076219] r10: 00000000 r9 : 00000000 r8 : ba39da54
    [351861.081545] r7 : 000001f4 r6 : 00000000 r5 : 80341034 r4 : 8edb0000
    [351861.088173] r3 : 00000000 r2 : 00000000 r1 : b1ab9220 r0 : 00000034
    [351861.094806] Flags: nZCv IRQs on FIQs on Mode SVC_32 ISA ARM Segment user
    [351861.102044] Control: 10c53c7d Table: 4a84404a DAC: 00000015
    [351861.107892] Process wpa_supplicant (pid: 5697, stack limit = 0xba39c2f0)
    [351861.114695] Stack: (0xba39d9d0 to 0xba39e000)
    [351861.119157] d9c0: 8edb0000 800ee248 000000c0 8023ab90
    [351861.127446] d9e0: bfe59c00 8edb0000 80341034 00000000 000001f4 ba39da54 00000000 00000000
    [351861.135734] da00: 000000c0 8023ab90 8edb0000 ba39da18 00000000 80341e3c 00000000 00000000
    [351861.144022] da20: 00000000 ba39da24 ba39da24 00000000 ba0d8400 bfd19e40 00000004 00000007
    [351861.152310] da40: 00006884 8034205c bfe59c00 80000580 00000003 00000004 a00f0013 00000001
    [351861.160598] da60: 000001f4 000000c0 80000580 00000007 00000004 00006884 00000000 80330770
    [351861.168886] da80: 00000000 00006884 ba080b80 00000004 000001f4 8004c018 bfc7782c ba0d8400
    [351861.177174] daa0: bfa7b000 bfa7b1fc ffdf0f38 bfa7b000 00000004 00000000 000000c0 00000007
    [351861.185462] dac0: 000001f4 00006884 ba39db5c 80330880 00000000 ba080b80 00000004 000001f4
    [351861.193750] dae0: ba39dbd8 00000004 00000000 00006884 ba39db5c bfa7b000 00000007 000000c0
    [351861.202038] db00: 000001f4 80330a98 ba39db5c 00000004 000001f4 00000009 ba0d8908 bfa7b254
    [351861.210326] db20: 80330278 ba39db8c ba39dbc0 b9d1c184 00006888 00000021 ba72b040 ba39db8c
    [351861.218614] db40: 00000058 803380ac ba39db5c 00000004 000001f4 80330614 80330278 baab98c0
    [351861.226902] db60: bfa7b000 80337620 ba39de3c 80b2ffb4 80b2ffe0 8020bde0 00000000 80b2ffb4
    [351861.235190] db80: 80aab8a4 80b2ffb4 80b2ffb8 8023de44 80b2ffb4 bfa7b000 00000000 b9d1c184
    [351861.243477] dba0: ba39dbc0 00000001 ba72b040 00000000 00000058 8032c494 ba72b238 ba0a1830
    [351861.251765] dbc0: 00000004 00000000 ba72b240 00000000 bb9fb1b7 44d61153 671fef6d d1041d2f
    [351861.260052] dbe0: 00000000 00000000 00000000 00000000 b9d1c000 ba72b238 00000000 ba72b000
    [351861.268341] dc00: 00000001 805518f4 b9d1c184 b9d1c18e b9d1c000 b9d1c000 ba0a1420 80551e70
    [351861.276629] dc20: b9d1c000 ba0a1420 ba0a1420 b9d1c240 00000001 b9d1c000 00000000 b9d1c210
    [351861.284916] dc40: ba0a1428 8054a920 00000006 b9d1c234 bfcd2900 ba39dc70 ba0a1000 bfa7a000
    [351861.293204] dc60: b9d1c240 80524d30 b9d1c240 ba39dc70 b9d1c218 b9d1c234 00000010 00000006
    [351861.301492] dc80: 000fac04 00000000 00000001 00000000 80b61cbc 80b61cbc b9d1c200 bfcd2600
    [351861.309780] dca0: 80b61e04 00000000 ba39dcb8 80480ed8 80988d50 00000000 0000579b 00001641
    [351861.318069] dcc0: b9d1c200 b9d1c210 b9d1c214 bfca9600 bfa7a000 ba0a1000 b9d1c200 bfcd2600
    [351861.326356] dce0: 80480cf4 bfa35c00 80bab880 bfca922c ba39dd14 8047f520 bfcd2600 00000000
    [351861.334644] dd00: bfcd2600 80480cec bfca9200 8047f178 00000058 7fffffff 00000000 ba39df74
    [351861.342932] dd20: bfcd2600 ba39de00 bfa35c00 00000000 00000058 00000000 00000000 8047fab4
    [351861.351219] dd40: bfbf2500 bfcd2900 ba6a8180 bfe39900 00000000 00001641 00000000 00000000
    [351861.359506] dd60: bfbf255c ba39dd88 00000058 00000000 00000000 00000058 ba39dee0 00000000
    [351861.367794] dd80: 00000008 8044d624 4fd197c0 ba8ed480 00000000 00000001 ffffffff 00000000
    [351861.376081] dda0: 00000000 00000000 00000000 00000000 ba80d960 80341098 00000000 00000000
    [351861.384369] ddc0: 00001718 bfd197c0 ba39de00 00001718 bfd197c0 ba8ed480 ba0d8468 ffdf0f00
    [351861.392657] dde0: 8033860c 803300fc baab98c0 bfa7b000 00000000 bfe59c00 baab98c0 bfa7b000
    [351861.400945] de00: 00000000 80330418 bfb7c480 00000058 bc303520 4a7c2f00 ba39dd48 ba39df74
    [351861.409233] de20: ba39de60 ba39dee0 00000001 ba39dee0 8003e704 ba39dee0 00000008 00000000
    [351861.417521] de40: ba39df74 ba39df74 bc303520 00000000 ba39dee0 8044e334 ba39df20 ba39df20
    [351861.425808] de60: 00000010 00000000 00000000 ba091680 bfe59ce8 00000001 00000000 bfe59ce8
    [351861.434096] de80: 00000000 ba09168c 80b06080 8034fe14 ba0a1000 7e87e0e8 00000020 00000000
    [351861.442384] dea0: 00008933 00000144 ffdf9510 00000510 00000a25 00000000 bfe59d38 00000a25
    [351861.450672] dec0: 00000fff bfe59d30 6e616c77 b9d1c3e0 600f0013 00000000 bfe59ce8 c09b0340
    [351861.458960] dee0: 000aada0 00000000 00000000 bfe59ce8 c09b0340 80352ef0 0084c940 80105664
    [351861.467248] df00: 80b19a64 bc757cc0 bc61da20 bfe59c00 a00f0193 00000001 00000000 00000048
    [351861.475535] df20: 00000000 00000001 00000001 00000080 00000048 80b6c1f0 80b08500 800a8b1c
    [351861.483823] df40: 00000001 bc303520 00000000 7e87e068 00000128 8003e704 ba39c000 00000000
    [351861.492111] df60: 00000000 8044e458 00000000 800a8c3c 80b08500 ba39de60 0000000c ba39dee0
    [351861.500398] df80: 00000001 00000000 00000000 00000000 fffffff7 00000000 2ad28000 000a7d40
    [351861.508685] dfa0: 00099690 8003e580 2ad28000 000a7d40 00000006 7e87e068 00000000 00000000
    [351861.516972] dfc0: 2ad28000 000a7d40 00099690 00000128 00000000 00000003 00000000 00000000
    [351861.525261] dfe0: 00000000 7e87e04c 2ad1a727 2ac040cc 400f0010 00000006 ba39dff4 ba3add00
    [351861.533575] [<80041af8>] (__bug+0x18/0x24) from [<800ee248>] (kfree+0x138/0x144)
    [351861.541104] [<800ee248>] (kfree+0x138/0x144) from [<8023ab90>] (kref_put+0x4c/0x7c)
    [351861.548890] [<8023ab90>] (kref_put+0x4c/0x7c) from [<80341e3c>] (usb_start_wait_urb+0x7c/0xc4)
    [351861.557621] [<80341e3c>] (usb_start_wait_urb+0x7c/0xc4) from [<8034205c>] (usb_control_msg+0xc8/0xec)
    [351861.566969] [<8034205c>] (usb_control_msg+0xc8/0xec) from [<80330770>] (rt2x00usb_vendor_request+0xb8/0x128)
    [351861.576918] [<80330770>] (rt2x00usb_vendor_request+0xb8/0x128) from [<80330880>] (rt2x00usb_vendor_req_buff_lock+0x68/0x114)
    [351861.588255] [<80330880>] (rt2x00usb_vendor_req_buff_lock+0x68/0x114) from [<80330a98>] (rt2x00usb_vendor_request_buff+0x70/0xb0)
    [351861.599944] [<80330a98>] (rt2x00usb_vendor_request_buff+0x70/0xb0) from [<803380ac>] (rt2x00usb_register_read+0x34/0x44)
    [351861.610936] [<803380ac>] (rt2x00usb_register_read+0x34/0x44) from [<80337620>] (rt2800_config_pairwise_key+0x78/0xec)
    [351861.621664] [<80337620>] (rt2800_config_pairwise_key+0x78/0xec) from [<8032c494>] (rt2x00mac_set_key+0x11c/0x17c)
    [351861.632049] [<8032c494>] (rt2x00mac_set_key+0x11c/0x17c) from [<805518f4>] (ieee80211_key_enable_hw_accel+0x9c/0x130)
    [351861.642778] [<805518f4>] (ieee80211_key_enable_hw_accel+0x9c/0x130) from [<80551e70>] (ieee80211_key_link+0xb0/0x164)
    [351861.653525] [<80551e70>] (ieee80211_key_link+0xb0/0x164) from [<8054a920>] (ieee80211_add_key+0xb0/0x124)
    [351861.663227] [<8054a920>] (ieee80211_add_key+0xb0/0x124) from [<80524d30>] (nl80211_new_key+0xf4/0x120)
    [351861.672659] [<80524d30>] (nl80211_new_key+0xf4/0x120) from [<80480ed8>] (genl_rcv_msg+0x1e4/0x22c)
    [351861.681736] [<80480ed8>] (genl_rcv_msg+0x1e4/0x22c) from [<8047f520>] (netlink_rcv_skb+0xbc/0xd8)
    [351861.690724] [<8047f520>] (netlink_rcv_skb+0xbc/0xd8) from [<80480cec>] (genl_rcv+0x1c/0x24)
    [351861.699189] [<80480cec>] (genl_rcv+0x1c/0x24) from [<8047f178>] (netlink_unicast+0x2a8/0x300)
    [351861.707829] [<8047f178>] (netlink_unicast+0x2a8/0x300) from [<8047fab4>] (netlink_sendmsg+0x1f8/0x304)
    [351861.717267] [<8047fab4>] (netlink_sendmsg+0x1f8/0x304) from [<8044d624>] (sock_sendmsg+0x94/0xb4)
    [351861.726263] [<8044d624>] (sock_sendmsg+0x94/0xb4) from [<8044e334>] (__sys_sendmsg+0x2cc/0x2dc)
    [351861.735080] [<8044e334>] (__sys_sendmsg+0x2cc/0x2dc) from [<8044e458>] (sys_sendmsg+0x3c/0x68)
    [351861.743825] [<8044e458>] (sys_sendmsg+0x3c/0x68) from [<8003e580>] (ret_fast_syscall+0x0/0x30)
    [351861.752550] Code: e92d4010 e59f000c eb14aaaf e3a03000 (e5833000)
    [351861.759846] ---[ end trace 4c126b63ac9c831e ]---
     
  8. ektor5

    ektor5 Administrator Staff Member

    Joined:
    Jul 1, 2013
    Messages:
    97
    Likes Received:
    48
    dhylands,

    If you are planning to use UDOO in a server use, you should use a more server oriented operating system (Debian, Arch Linux, Ubuntu minimal, ecc..) without video output / GUI.

    Take a look here: http://www.udoo.org/forum/viewtopic.php?f=19&t=251
     
  9. dhylands

    dhylands New Member

    Joined:
    Nov 4, 2013
    Messages:
    6
    Likes Received:
    0
    I'm not quite sure how that's relevant to my issue, since the problem seems to be related to wifi, and not the display.
    I will tak e alook though - thanks for the link.

    If I use my UDOO, I expect I'll be using it for a robot. Using a headless distro is mostly what I use.

    Do any of the distro have recent (i.e. 3.12) kernels?
     
  10. DracoLlasa

    DracoLlasa UDOOer

    Joined:
    Oct 15, 2013
    Messages:
    419
    Likes Received:
    3
    i would suspect not

    i recall reading that the Freescale released kernel is 3.05 with 3.10 available as an alpha release
    But im not an expert on these things so maybe its possible to put 3.12 on there even though Freescale doesnt offer it

    (for those that arent aware, Freescale makes the i.MX6 series chips used in the UDOOs)
     
  11. Lifeboat_Jim

    Lifeboat_Jim New Member

    Joined:
    Sep 16, 2013
    Messages:
    399
    Likes Received:
    1
    3.0.35 is the basis, as that's what Freescale is basing their efforts on.
     
  12. DracoLlasa

    DracoLlasa UDOOer

    Joined:
    Oct 15, 2013
    Messages:
    419
    Likes Received:
    3
    There we go, i was close, was going from memory.
    I do recall there was an alpha release of a newer version and i thought it was 3.10 or 3.0.10 maybe?
     

Share This Page