New 3D benchmark leaves AI in knots