New Benchmark Tests AI on Pure Logic, Not Pattern Matching