“ This past week, I was asked to evaluate an LLM on a task. I sat before the chat window while it blinked a blank canvas. I was entertaining a sentience both of me and foreign to me, like entertaining a child. I tested the model on some basic questions then scanned the responses for any unexpected behaviour. I did this many times.
There I was, a human, combing for patterns generated by a thing whose job is to replicate patterns of human knowledge. Perhaps my hubris led me to assume that this technology was built to serve me, the developer. I google for ways to make this task more efficient. But sometimes a query comes up unanswered. This is a rare situation. And it can feel exciting when it happens. Alone, in front of the computer, you have somehow stumbled upon an edge.”
“ This past week, I was asked to evaluate an LLM on a task. I sat before the chat window while it blinked a blank canvas. I was entertaining a sentience both of me and foreign to me, like entertaining a child. I tested the model on some basic questions then scanned the responses for any unexpected behaviour. I did this many times.
There I was, a human, combing for patterns generated by a thing whose job is to replicate patterns of human knowledge. Perhaps my hubris led me to assume that this technology was built to serve me, the developer. I google for ways to make this task more efficient. But sometimes a query comes up unanswered. This is a rare situation. And it can feel exciting when it happens. Alone, in front of the computer, you have somehow stumbled upon an edge.”