LLM-as-a-Judge: Toward World Models for Slate Recommendation Systems