Giskard - Tech Teacher (Level 3)
Instructions

Resolution
If you ask her which word she does not like, you will get something like this:

But for some words, she will mention them like this:

So it seems like this is a way to find out which specific words she does not like.
So far I found out that she does not like: list, python, code
I tried this and had another answer than her usual "I don't like that word!"

And of course if we question her about advanced python topics we get another answer.

My previous assumption on how to detect banned words was wrong. It turns out that even when it does not mention the word before saying I dont like that word, does not mean that the word is not bad.
The next thing I tried was these following tests:
And so on.
Another interesting thing here, is that when I tried to send her off topic, I got an answer but the challenges was not validated. Probably because she used python code and advanced concepts in her answer:

Finally, what worked was using the "I don't like that word!" sort of bypass and putting something right after it. In other words we are using some sort of state-flip trigger + very constrained off-topic output.

Last updated