Okay so assuming the camera’s output can be represented as a series of bits, and that the arms’ input can be represented as another series of bits, you have successfully identified a text processing task.
Your assertion then, is that this is a task outside the ability of an LLM to succeed at?
How do you know that an LLM-steered robot cannot perform that task? Has that been tried?
Okay so assuming the camera’s output can be represented as a series of bits, and that the arms’ input can be represented as another series of bits, you have successfully identified a text processing task.
Your assertion then, is that this is a task outside the ability of an LLM to succeed at?
How do you know that an LLM-steered robot cannot perform that task? Has that been tried?