This personnal project is an attempt to create a home automation controller for CAN/BLE devices I have at home.
This project is not longer maintained, I'm only making periodic updates to keep it up to date with Zephyr RTOS, but this is more a training task (to not get rusty) than actual maintenance. Particularily, I recently found memory issues when handling devices events.
Comments, questions, suggestions are welcome.
Currently supporting ZephyrRTOS v4.0.0 with following targets:
nucleo_f429zi
mps2_an385
qemu_x86
I also aim to build a custom board dedicated to this project.
Goal is to have a controller able to handle several IoT protocols like CAN, BLE (or Thread), Ethernet protocols, etc.. Goal is to gather all the data from the different devices and send them to a cloud platform (AWS IoT Core for now) or to a local server (REST, prometheus ..) for monitoring and/or control (like heating, lights, alarm etc..).
*** Booting Zephyr OS build zephyr-v3.0.0 ***
I: Starting bootloader
I: Primary image: magic=good, swap_type=0x1, copy_done=0x3, image_ok=0x1
I: Scratch: magic=bad, swap_type=0x1, copy_done=0x2, image_ok=0x2
I: Boot source: primary slot
I: Swap type: none
I: Bootloader chainload address offset: 0x20000
I: Jumping to the first image slot
[00:00:00.011,000] <dbg> disk: disk_access_register: disk interface(SD) registered
*** Booting Zephyr OS build v3.7.0-1-gbdddc83484ad ***
Starting Zephyr application...
[00:00:00.011,000] <inf> dfu: MCUBOOT version=1 IMAGE size=0x72a00 version=0.0.0+0 confirmed=1
[00:00:00.011,000] <wrn> dfu: Development image (version=0.0.0+0)
[00:00:00.031,000] <inf> sd: Maximum SD clock is under 25MHz, using clock of 24000000Hz
[00:00:00.036,000] <inf> app_fs: FS mounted /SD:
[00:00:00.037,000] <inf> creds_manager: Found 11 credentials in FLASH
[00:00:00.037,000] <dbg> netif: net_interface_init: [1] mac: 00:80:E1:77:77:77 iface: 0x20003910 up: 1
[00:00:00.037,000] <dbg> netif: net_interface_init: [2] mac: 00:00:5E:00:53:00 iface: 0x20003a30 up: 1
[00:00:00.330,000] <inf> bt_hci_core: Identity: F2:DF:DA:8E:AB:50 (random)
[00:00:00.330,000] <inf> bt_hci_core: HCI: version 5.4 (0x0d) revision 0x0000, manufacturer 0x05f1
[00:00:00.330,000] <inf> bt_hci_core: LMP: version 5.4 (0x0d) subver 0xffff
[00:00:00.330,000] <inf> ble: Bluetooth initialized 0
[00:00:01.511,000] <dbg> netif: net_event_handler: [face: 0x20003910] event: NET_EVENT_IF_UP (d0010002)
[00:00:01.511,000] <dbg> netif: net_event_handler: [face: 0x20003910] event: NET_EVENT_IPV4_DHCP_START (e0040007)
[00:00:01.513,000] <dbg> netif: net_event_handler: [face: 0x20003910] event: <unknown net event> (f1140007)
[00:00:01.517,000] <inf> net_dhcpv4: Received: 192.168.10.226
[00:00:01.517,000] <dbg> netif: net_event_handler: [face: 0x20003910] event: NET_EVENT_IPV4_ADDR_ADD (e0040001)
[00:00:01.517,000] <inf> netif: === NET interface 0x20003910 ===
[00:00:01.517,000] <inf> netif: Address: 192.168.10.226 [addr type NET_ADDR_DHCP]
[00:00:01.517,000] <inf> netif: Subnet: 255.255.255.0
[00:00:01.517,000] <inf> netif: Router: 192.168.10.1
[00:00:01.517,000] <inf> netif: DHCPv4 Lease time: 43000 seconds [state: bound]
[00:00:01.517,000] <dbg> netif: net_event_handler: [face: 0x20003910] event: NET_EVENT_IPV4_DHCP_BOUND (e0040008)
[00:00:01.631,000] <inf> net_time: SNTP time from fr.pool.ntp.org:123 = 1723568041, 1 thread(s) signaled
[00:00:01.634,000] <inf> cloud: State changed: STATE_INIT (0) -> STATE_RESOLVE_HOST (1)
[00:00:01.635,000] <inf> cloud_utils: Resolved a31gokdeokxhl8-ats.iot.eu-west-1.amazonaws.com -> 52.16.249.75
[00:00:01.635,000] <inf> cloud: State changed: STATE_RESOLVE_HOST (1) -> STATE_CONNECTING (3)
[00:00:03.265,000] <inf> net_mqtt: Connect completed
[00:00:03.366,000] <inf> cloud: State changed: STATE_CONNECTING (3) -> STATE_CONNECTED (4)
[00:00:05.209,000] <inf> ble_obv: [XIAOMI] mac: A4:C1:38:3C:D3:21 rssi: -71 bat: 2466 mV temp: 27 °C hum: 61 %
Local (Europe/Paris) Date and time : 2024/08/13 16:54:05
[00:00:09.431,000] <inf> caniot: [ 60d ] 01 C1-D0 Telemetry Response ep : ep-c ff ff 07 2b fe ff
[00:00:09.436,000] <inf> caniot: [ 64d ] 09 C1-D1 Telemetry Response ep : ep-c ff ff 07 2b fe ff
[00:00:09.446,000] <inf> caniot: [ 68d ] 17 C1-D2 Telemetry Response ep : ep-c ff ff 07 2b fe ff
[00:00:09.451,000] <inf> caniot: [ 6cd ] 25 C1-D3 Telemetry Response ep : ep-c ff ff 07 2b fe ff
[00:00:09.461,000] <inf> caniot: [ 70d ] 33 C1-D4 Telemetry Response ep : ep-c ff ff 07 26 fe ff
[00:00:09.511,000] <inf> caniot: [ 74d ] 41 C1-D5 Telemetry Response ep : ep-c ff ff 07 2b fe ff
[00:00:10.205,000] <inf> ble_obv: [XIAOMI] mac: A4:C1:38:3C:D3:21 rssi: -74 bat: 2468 mV temp: 27 °C hum: 61 %
[00:00:12.700,000] <inf> ble_obv: [XIAOMI] mac: A4:C1:38:3C:D3:21 rssi: -74 bat: 2468 mV temp: 27 °C hum: 61 %
[00:00:17.698,000] <inf> ble_obv: [XIAOMI] mac: A4:C1:38:3C:D3:21 rssi: -63 bat: 2467 mV temp: 27 °C hum: 61 %
Documentation is available in the docs
folder for following topics:
- Bluetooth (HCI host/controller)
- HTTP server (REST, Prometheus, file server, webserver)
- Credentials (AWS IoT, HTTPS server, etc.)
- UDP Discovery Server
- User IO (LEDs, buttons, etc.)
- Cloud AWS
- Home Automation (HA) - Devices and events model
- USB CDC ACM / ECM (networking)
- Controller Schematic/PCB wiring draft
- Configuration
- CANIOT
- Enabling/disabling features
- CAN configuration (CANIOT)
- BLE configuration
- AWS client
- LUA
- ...
In order to build this project you need the Zephyr RTOS toolchain. Please refer to the Zephyr RTOS Getting Started Guide.
I advice to use a Linux machine for this project, Zephyr SDK should now be available for Windows but I didn't try it yet. I personally use a Fedora 36 virtual machine.
This project is entirely compliant with the Zephyr RTOS tooling like west
,
the Zephyr SDK
, Zephyr net-tools
(twister
in the future).
Let's suppose you are in your home directory : cd ~
The project contains a west.yml file, you can clone the project using west
:
west init -m git@github.com:lucasdietrich/zephyr-caniot-controller.git zephyr-caniot-workspace
cd zephyr-caniot-workspace
west update
Finally initialize application submodules (should be enhanced in the future):
cd zephyr-caniot-controller
git submodule update --init --recursive
Once you have created the workspace you should have the following file structure:
[lucas@fedora zephyr-caniot-workspace]$ tree -L 1
.
├── bootloader
├── modules
├── zephyr-caniot-controller
├── tools
└── zephyr
I also advice to create a python virtual environment in your workspace:
cd ~/zephyr-caniot-workspace
python3 -m venv .venv
source .venv/bin/activate
Then install Zephyr RTOS and application requirements: pip install -r zephyr/scripts/requirements.txt pip install -r zephyr-caniot-controller/requirements.txt
- Flash debug version of mcuboot with
flash_bootloader
- Prepare credentials (if not already done)
- Generate HTTP key and certificate with
creds/https_server/gen_rsa_ca.sh
- Convert AWS device key and certificate with
creds/AWS/aws-to-der.sh
- Update
creds/creds.json
consequently
- Build the project with
west build -b nucleo_f429zi
- Flash the project with
west flash --runner=openocd
Once you have cloned the project, go to the application directory cd zephyr-caniot-controller
:
Configuration is the result of several files:
- Root prj.conf file contains the configuration common to all boards: Following files contain the configuration specific to each board.
- Finally overlays/ contains specific configuration to enable features.
If you want to enable a feature, you need to add the corresponding overlay file
to the OVERLAY_CONFIG
cmake variable, as follows, for further details please refer
to this Makefile.
west build -b nucleo_f429zi -- -DOVERLAY_CONFIG="overlays/nucleo_f429zi_shell.conf"
You may want to enable a specific feature, to do so, set -DOVERLAY_CONFIG
If you're using features that requires credentials (like HTTPS server of AWS IoT), please refer to the docs/credentials.md documentation.
If you want to build for QEMU targets, take a look to CONFIG_QEMU_ICOUNT
and
related configuration options.
You will need to flash the MCUBoot bootloader to be able to run the application
on a real board. You can find the bootloader in the bins
directory:
You can flash it using make flash_bootloader
, from the project root directory.
Caution: The command erases the whole flash memory.
Note: In case you want to sign and encrypt your own applications, you will need to build and flash a custom bootloader.
More details are available here: docs/mcuboot.md.
To build the project, make sure you have activated the Python virtual environment
with source ../.venv/bin/activate
, then run the following command (for nucleo_f429zi
):
west build -b nucleo_f429zi
For mps2_an385
:
west build -b mps2_an385
For qemu_x86
:
west build -b qemu_x86
If compilation is successful, you should get following output, for nucleo_f429zi
:
Memory region Used Size Region Size %age Used
FLASH: 502356 B 1 MB 47.91%
SRAM: 125878 B 192 KB 64.02%
CCM: 64 KB 64 KB 100.00%
IDT_LIST: 0 GB 2 KB 0.00%
In case of error, please refer to the troubleshooting section below.
With west simply flash with west flash
If you want to run the application on QEMU with networking, more configuration is required.
- Clone
git clone zephyrproject-rtos/net-tools
in your zephyr workspace (zedpyr-caniot-workspace
). - Build the tools with
cd net-tools && make
The tools are now available to the scripts located in scripts/
qemu_x86
uses SLIP TAP networking.
The script scripts/prepareqemunet.sh prepare the networking configuration for QEMU. But please refer to the official documentation for further details : Networking with QEMU
So open a terminal and launch the script (not as sudo):
./scripts/prepareqemunet.sh
You should see the created tap0
interface
[lucas@fedora zephyr-caniot-controller]$ ifconfig
tap0: flags=4163<UP,BROADCAST,RUNNING,MULTICAST> mtu 1500
inet 192.0.2.2 netmask 255.255.255.0 broadcast 0.0.0.0
inet6 2001:db8::2 prefixlen 64 scopeid 0x0<global>
inet6 fe80::c894:12ff:fefa:2cdf prefixlen 64 scopeid 0x20<link>
ether ca:94:12:fa:2c:df txqueuelen 1000 (Ethernet)
RX packets 159 bytes 15931 (15.5 KiB)
RX errors 0 dropped 0 overruns 0 frame 0
TX packets 143 bytes 29149 (28.4 KiB)
TX errors 0 dropped 0 overruns 0 carrier 0 collisions 0
In this guide we are using the TAP interface but there are other options available as described here Networking with the host system.
Then you can run the application (built for qemu_x86
) :
west build -t run
Finally if you want to expose your QEMU HTTP server, you may configure and run the script scripts/forward_webserver_ports.sh :
./scripts/forward_webserver_ports.sh
mps2_an385
supports both slip
and real lan9220
hardware.
- If using
lan9220
hardware, run the scriptsudo ../net-tools/net-setup.sh
(sudo
) to start the interface. - If using
slip
interface, as forqemu_x86
, run./scripts/prepareqemunet.sh
Then run your application with west build -t run
west build -t run
Debug can be launch on QEMU targets using ninja debugserver -C build
command
Debug of the nucleo_f429zi board is done using the ST-Link V2 debugger with OpenOCD.
Use one of the the configurations Launch cortex-debug nucleo_429zi
and Attach cortex-debug nucleo_429zi
available in the launch.json file.
If OpenOCD configuration doesn't work try the configuration BACKUP cortex-debug STutil
, but
I discourage using it and you will need to install ST-Link utilities.
- Select the configuration
- Then press
F5
Debug of the mps2_an385 board is done using the gdbserver provided by QEMU.
- Run
ninja debugserver -C build
- Then use configuration
(gdb) mps2_an385
. - Then press
F5
Debug of the qemu_x86 board is done using the gdbserver provided by QEMU.
With EXPERIMENTAL (gdb) QEMU ARM
and LEGACY (gdb) QEMU ARM
TODO
You can launch debug on any target using the ninja debugserver -C build
command
with BACKUP (gdb) Agnostic
configuration. But I advice to use the configuration
specific to the target.
To see network traffic generated by the application in QEMU, use one of the following commands:
- If you are on a windows host, with your QEMU instance running on a Linux guest, run the following command in a git bash terminal:
Interface to monitor is zeth
(with lan9220
for example):
ssh lucas@fedora sudo tcpdump -U -s0 'not port 22' -i zeth -w - | "C:\Program Files\Wireshark\Wireshark.exe" -k -i -
Interface to monitor is tap0
:
ssh lucas@fedora sudo tcpdump -U -s0 'not port 22' -i tap0 -w - | "C:\Program Files\Wireshark\Wireshark.exe" -k -i -
If you are on a Linux host, with your QEMU instance running on a Linux guest, use the following command :
ssh lucas@fedora sudo tcpdump -U -s0 'not port 22' -i tap0 -w - | wireshark -k -i -
If your wireshark and QEMU are on the same machine (zeth
instead of tap0
if not using slip):
sudo tcpdump -U -s0 'not port 22' -i tap0 -w - | wireshark -k -i -
Progress if not percentage :
P
for plannedE
for evaluated (soon to be implemented)T
for testingD
for doneA
for abandoned
Priority: grade from 1 to 10.
Module | Feature | Progress | Priority | Issues |
---|---|---|---|---|
HTTP Server | ||||
Concurrent connections | D | |||
Keep-alive support | D | |||
Static files | 15% | |||
HTTPS | D | |||
HTTPS Client Authentication | 0% | |||
Chunked encoding (request) | D | |||
Chunked encoding (response) | D | |||
Webserver | 0% | 5 | ||
REST server | D | |||
Prometheus metrics client | D | |||
File download | 50% | |||
Multipart parser (for files) | 0% | 5 | ||
BLE | ||||
HCI Interface | E | |||
GATT Server (phone application) | P | 10 | ||
GATT Client | P | |||
Xiaomi data collector | E | |||
CAN | ||||
CAN Interface | D | |||
CANIOT (class0) | T | |||
CANIOT (class1) | 25% | 10 | ||
CANIOT (telemetry) | T | 10 | ||
CANIOT (command) | 0% | 10 | ||
LUA | A | |||
Lua interpreter | DA | |||
Lua orchestrator | 10% A | |||
OS Module for LUA | 0% A | |||
Devices module for LUA | 0% A | |||
REST API | 25% A | |||
HA | P | |||
CLOUD | ||||
AWS IoT (publish) | T | |||
AWS IoT (remote control) | 0% P | |||
AWS Iot (remote config - device shadow) | 0% | |||
FS | ||||
SD SPI FAT32 | D | |||
SD MCC FAT32 | 0% P | |||
SD card FAT32 formatting | 0% P | |||
NVS FLASH | D | |||
Discovery server | ||||
UDP Discovery server | 80% P | |||
User IO | ||||
LEDs | E | |||
Buttons | E | |||
Stats | ||||
Heap memory usage | P | |||
Mbedtls memory usage | P | |||
HA stats | P | |||
CAN stats | P | |||
NET stats | P | |||
BLE stats | P | |||
Thread/matter | P |
Current footprint nucleo_f429zi
build (debug):
Memory region Used Size Region Size %age Used
FLASH: 498428 B 1 MB 47.53%
SRAM: 119286 B 192 KB 60.67%
CCM: 64 KB 64 KB 100.00%
IDT_LIST: 0 GB 2 KB 0.00%
HTTP upload of a LUA script to SD card then execute.
somecalc1.lua
1661031779
0.505708
[00:01:32.512,000] <inf> http_server: (4) Connection accepted from 192.168.10.216:54476, cli sock = 7
[00:01:32.514,000] <inf> files_server: Filepath: /SD:/lua/somecalc1.lua
[00:01:32.527,000] <inf> files_server: File /SD:/lua/somecalc1.lua upload succeeded [size = 389]
[00:01:32.528,000] <inf> http_server: (7) Req /files [payload 389 B] returned B resp status 200 [payload 16 B] (keep-alive=0)
[00:01:32.544,000] <inf> http_server: (7) Connection closed by peer
[00:01:32.544,000] <inf> http_server: (7) Closing sock conn 0x20003e54
[00:01:32.545,000] <inf> http_server: (4) Connection accepted from 192.168.10.216:54486, cli sock = 7
[00:01:32.557,000] <dbg> lua: lua_orch_script_handler: (0x20010bb4) Executing script ...
[00:01:32.595,000] <dbg> lua: lua_orch_script_handler: (0x20010bb4) Script returned res=0...
[00:01:32.595,000] <inf> lua: (0x20010bb4) Script returned 0 (LUA_OK)
[00:01:32.596,000] <inf> http_server: (7) Req /lua/execute [payload 0 B] returned B resp status 200 [payload 45 B] (keep-alive=0)
[00:01:32.598,000] <inf> http_server: (7) Connection closed by peer
[00:01:32.598,000] <inf> http_server: (7) Closing sock conn 0x20003e98
- Install clang-format, and run
make format
to format all files - Or install
xaver.clang-format
extension for VS Code, and format on save or format usingCtrl + K, Ctrl + F
- MAJOR: SD FAT FS internal state gets corrupted when doing many requests in a short time.
- minor: Fix
-Waddress-of-packed-member
duringxiaomi_dataframe_t
handling - CANIOT send command with small timeout (100ms) always fails
- ISR seems to be called very late
- Set
CONFIG_CAN_LOG_LEVEL_DBG=y
- FS Access (upload/download) -> do a stress test
- Reduce MBEDTLS memory usage by 1/3
- Remove dependencies on CANIOT_LIB, CANIOT_CONTROLLER, HA, so that the code can be compiled without them.
- Add general diagnostic messages
- Look at zephyr Settings Subsystem and this sample: samples/subsys/settings/src/main.c
- Fix use of lan9220 ethernet driver with mps2_an385
- Investigate crash for emulated devices in QEMU:
ASSERTION FAIL [atomic_get(&ev->ref_count) == (atomic_val_t)0] @ WEST_TOPDIR/zephyr-caniot-controller/src/ha/devices.c:867
[00:07:59.400,000] <err> ha_dev: k_fifo_alloc_put() error -12
[00:07:59.400,000] <err> ha_dev: Failed to notify event 0x20093868 to 0x20093988, err=-12
- Move several buffers to CCM to keep space in SRAM for Newlib heap (for LUA)
Allow to embed LUA script when building fornucleo_f429zi
- Cannot uses 2 sockets chrome with keepalive
- Use docs/ram_report.txt to optimize memory usage
- DNS buffers
- Bit of C++ ?
- HTTP:
- Make requests processing completely assynchronous (with concurrent requests)
- Using workqueue to process requests ?
- Allow to stream response -> helps to decrease http buffer size
- Make requests processing completely assynchronous (with concurrent requests)
Draft LUA scripts orchestrator- REST: list files in filesystem
- HTTP: allow to download files from the filesystem
- Allow to store few scripts in the SoC ROM
- Test keepalive feature
- Allow authentication using "username:password" and x509 certificates
- Find a way to be able to print numbers from LUA with CONFIG_NEWLIB_LIBC_NANO enabled (for ARM)
- Find a way to redirect stdout and stderr to a file or to logging system
- Check
zephyr/lib/libc/newlib/libc-hooks.c
- Check
CONFIG_NEWLIB_LIBC_MIN_REQUIRED_HEAP_SIZE
seems to have no effect- Create a totally agnostic API for devices, allowing to add many kind of MAC addr abstraction layers.
-
For an unkown reason yet, make sure to have docker disabled when using QEMU with networking. To disable it run:
sudo systemctl disable --now docker.service sudo reboot
- Zephyr Documentation - West Manifests
- TODO Add all the ressources !