Feedback from customers and potential customers was that there is a gap for an LLM-capable chip in the 2-2.5-W space, Danon said.

“This isn’t achievable with any other device,” he said. “At the edge, the majority of people are looking to run workloads between 1 and 3 billion parameters. This is the popular configuration from a performance perspective, from a memory capacity perspective, and also from a cost perspective.”

Orr Danon (Source: Hailo)