Hello DevZone,
I'm developing a battery-powered Zigbee Sleepy End Device, and I have a randomly "loose network connection" issue with my device. By randomly I mean that the same device can work for 2 weeks in the test and in the next test with the "same" testing environment it will lose network in one day. And by "loose network connection" I mean that device messages can be seen by ZC but commands from ZC to ED are sent but not received. ZC still think that ED is in the network and ED thinks that it is in the same network but something in their communication is wrong.
Hardware:
- My device is based on HOLYIOT-18010 module(nRF52840 in it).
- I use Home Assistant with Zigbee2MQTT addon as my Zigbee network coordinator.
- ZC is the SONOFF USB dongle.
Software:
- SEGGER IDE
- SDK 4.2 for Thread and Zigbee
- Wireshark and ZBOSS Sniffer with TI CC2531 USB Dongle
-------------------
Test Setup:
I started ZBOSS Sniffer to write data to the file and connected 3 version of the tested device to the Home Assistant.
Three device are:
- My device battery powered(like normal)
- My second device battery powered(like normal)
- Nordic development board PCA10056 with same firmware version as my device(to check if the problem maybe in the HOLYIOT module)
-----------------
Test results:
After 17 hours I checked all device if they are still working and one of them was not. In the Home Assistant I check reporting from the device(when device sends attribute change) and sending command from HA to the device(when HA sends attribute change to the device), reporting was okish(HA receives about 50% of them) but command send is failed (HA sended 30 commands and device didn't responded once).
In HA device still in the network and after I connected device to the SEGGER with debugger I saw that ZB_JOINED is true and network PAN is correct.
-----------------
Logs:
In the logs file three devices addresses are:
- My device battery powered - 0x6EF0 - this device has "lost network connection"
- My second device battery powered - 0xC28A
- Nordic dev board - 0x0445
I saw on DevZone issues about missing network key change in SED or not responding to any messages SED, so I started to look for those.
Nordic dev board 0x0445 communicated to HA and received messages from it until I stopped the logs.
Second device 0xC28A communicated to HA and received messages from it until I stopped the logs.
And First device 0x6EF0 (that lost network connection) stoped responding to HA messages after network rejoin.
HA still think that first device(0x6EF0) is in the network and sends find route request.
The first device still send attribute reporting and OTA request to HA.
They are both in the same network have same Network security Key but don't hear each other.
I can't understand what causes this issue and I need to know where to search next.
Please suggest what this issue can be and how it can be fixed. This is the last major issue with the device until the device release.
Thanks in advance.