Of course, this is still a first, limited, proof of concept, and it misses some of the components that are mandatory to create a state of the art vocal smart assistant such as an elaborated NLP (Natural Language Processing), a good ASR (Automatic Speech Recognition), probably some neural network processing for deep learning…
Interessant: “Elivia”: towards an /e/ smart assistant?