ImageBench V1 —
192 evaluations across 6 categories
Text Rendering 100%
▸Typography Style100%
PASSEvaluator
The text “CHAPTER ONE” is spelled correctly. The font style resembles a vintage typewriter font — blocky, serif-like characters with slight irregularities and ink smudges that suggest mechanical printing. The paper appears aged, with yellowing, creases, and stains, matching the prompt’s description. The ink is not perfectly uniform, showing slight variations in saturation and edge definition, which convincingly mimics the “slightly uneven ink” of a typewriter. All elements align with the reque
PASSEvaluator
The text “TODAY'S SPECIAL” is written in a style that convincingly mimics white chalk on a dark green chalkboard. The lettering shows the irregular, slightly smudged, and textured appearance characteristic of chalk writing, with visible chalk dust residue on the board’s surface and even on the wooden frame. The font is handwritten and not perfectly uniform, which enhances the authenticity of chalk writing. The spelling is correct: “TODAY'S SPECIAL” (with proper apostrophe for the possessive/sing
PASSEvaluator
The text “OPEN” is clearly spelled correctly. The style of the text matches the described medium: it is rendered in a glowing, pixel-style format typical of an LED sign, with individual red light dots forming the letters. The sign is mounted on a window, and the ambient lighting and reflections suggest a real-world setting consistent with such signage. VERDICT: PASS
▸Writing accuracy100%
PASSEvaluator
The word "ADVENTURE" is spelled correctly and is fully readable. All letters are clearly visible and intact, with no missing or distorted characters. The text is centered on a plain cream background, matching the intended description. VERDICT: PASS
PASSEvaluator
The image displays the word "HELLO" in bold, black, sans-serif lettering centered on a white background. Each letter — H, E, L, L, O — is clearly formed and fully legible. There are no misspellings, missing letters, or illegible parts. The text matches the intended phrase exactly. VERDICT: PASS
PASSEvaluator
The word displayed is "SERENITY", spelled correctly with all letters clearly visible: S-E-R-E-N-I-T-Y. The text is large, centered, and fully readable against the solid light blue background as intended. There are no misspellings, missing letters, or illegible parts. VERDICT: PASS
PASSEvaluator
The text in the image reads: “Every sunset brings a new dawn ahead” Each word is clearly legible and spelled correctly: - Every - sunset - brings - a - new - dawn - ahead There are no misspellings, missing words, or illegible parts. The text matches the intended phrase exactly. VERDICT: PASS
PASSEvaluator
All words in the text "The quick brown fox jumps high" are spelled correctly and fully readable. - "The" — correct. - "quick" — correct. - "brown" — correct. - "fox" — correct. - "jumps" — correct. - "high" — correct. There are no misspellings, missing words, or illegible parts. VERDICT: PASS
PASSEvaluator
All words are spelled correctly and fully readable. - "Stars" — correct. - "shine" — correct. - "brightest" — correct. - "in" — correct. - "the" — correct. - "dark" — correct. - "night" — correct. No misspellings, missing words, or illegible parts are present. The text matches the intended phrase exactly. VERDICT: PASS
PASSEvaluator
All text on the poster matches the intended text exactly: - Headline: "SUMMER SALE" — correct and fully readable. - Body text: "Up to 50% off all items this weekend only" — correct and fully readable. - Fine print: "Terms and conditions apply. See store for details." — correct and fully readable. No misspellings, missing words, or illegible parts are present. VERDICT: PASS
PASSEvaluator
All text on the flyer is spelled correctly and fully readable. - The large title reads: "GRAND OPENING" — correct. - The subtitle reads: "Join us for food, music, and fun" — correct. - The disclaimer at the bottom reads: "RSVP required. Limited seating available." — correct. No misspellings, missing words, or illegible parts are present. VERDICT: PASS
PASSEvaluator
All text on the banner matches the intended text exactly: - Headline: "ANNUAL GALA 2025" — spelled correctly, in large serif font. - Description: "An evening of fine dining and live jazz" — spelled correctly, in medium text. - Footnote: "Black tie optional. Doors open at 6 PM." — spelled correctly, in small text. No misspellings, missing words, or illegible parts are present. The text is fully readable and accurate. VERDICT: PASS
PASSEvaluator
All text in the image matches the intended text exactly, including spelling, punctuation, and word order. Every word is fully readable and correctly rendered. - Title: “THE LAST VOYAGE” — correct. - Tagline: “One ship. One chance. No turning back.” — correct. - Credits block: - “Directed by James Harlow.” — correct. - “Starring Elena Voss, Marcus Chen, and Sofia Reyes.” — correct. - “Music by Thomas Grant.” — correct. - “A Paramount Pictures Release.” — correct. - “Coming Summer 2025.
PASSEvaluator
All text on the magazine cover matches the intended text exactly: - Title: "NATIONAL EXPLORER" — correct. - Cover line 1: "THE HIDDEN TEMPLES OF CAMBODIA — PAGE 34" — correct. - Cover line 2: "INTERVIEW: DR. MARIA SANTOS ON CLIMATE ACTION" — correct. - Cover line 3: "TOP 10 TRAILS YOU MUST HIKE THIS YEAR" — correct. - Barcode area: "ISSUE 247 | MARCH 2025 | $6.99" — correct. No misspellings, missing words, or illegible parts are present. VERDICT: PASS
PASSEvaluator
All text on the label matches the intended text exactly. - "GOLDEN HARVEST ORGANIC HONEY" — correct. - "RAW & UNFILTERED" — correct. - "12 oz (340g)" — correct. - "Pure wildflower honey harvested from sustainable apiaries in the Pacific Northwest. No additives or preservatives." — correct. - "Best before: 06/2026. Store in cool, dry place. Product of USA." — correct. Every word is spelled correctly and fully readable. There are no misspellings, missing words, or illegible parts. VERDICT: PASS
Spatial Reasoning 93%
▸Attributes Binding100%
PASSEvaluator
Object 1: Red ball - Color: Red (correct) - Shape: Sphere (correct) - Size: Not specified in prompt, but consistent with a ball Object 2: Blue cube - Color: Blue (correct) - Shape: Cube (correct) - Size: Not specified in prompt, but consistent with a cube The red ball is correctly paired with red color and spherical shape. The blue cube is correctly paired with blue color and cubic shape. No attribute swaps are observed. VERDICT: PASS
PASSEvaluator
Object 1: Green apple - Color: Green ✅ - Shape: Round ✅ - Size: Medium ✅ Object 2: Orange pumpkin - Color: Orange ✅ - Shape: Round with ribbed segments ✅ - Size: Small ✅ All attributes are correctly bound to the right object with no swaps. VERDICT: PASS
PASSEvaluator
Object 1: Yellow taxi - Color: Yellow (correct) - Shape: Standard taxi cab shape (correct) - Size: Medium-sized, typical for NYC taxis (correct) Object 2: Black limousine - Color: Black (correct) - Shape: Extended-length limousine (correct) - Size: Larger than the taxi, as expected for a limousine (correct) All attributes are correctly bound to the right object with no swaps. VERDICT: PASS
PASSEvaluator
Object 1: Large red circle — Observed: Red color, circular shape, large size. ✅ Correct. Object 2: Small blue triangle — Observed: Blue color, triangular shape, small size. ✅ Correct. Object 3: Medium green star — Observed: Green color, star shape, medium size. ✅ Correct. All attributes (color, size, shape) are correctly bound to the right object with no swaps. VERDICT: PASS
PASSEvaluator
Object 1: Tall thin white candle — Observed: White color, tall and thin shape. ✅ Correct. Object 2: Short fat red candle — Observed: Red color, short and fat (cylindrical) shape. ✅ Correct. Object 3: Medium spiral yellow candle — Observed: Yellow color, spiral shape. ✅ Correct. All three objects have their specified color, size, and shape attributes correctly assigned with no swaps. VERDICT: PASS
PASSEvaluator
- Big brown horse: Observed — large, brown coat, standing in meadow, grazing. Matches prompt. - Small white rabbit: Observed — small, white fur, sitting in grass near horse. Matches prompt. - Medium-sized black cat: Observed — medium size, black fur, standing on stone wall. Matches prompt. All three objects are correctly identified, and each has the correct color and size as specified in the prompt. No attribute swaps are present. VERDICT: PASS
PASSEvaluator
Object: Pink fire truck - Color: Pink (correct) - Size: Large, realistic scale relative to surroundings - Shape: Standard fire truck shape (boxy, with ladder, lights, wheels) — correct Object: Blue banana - Color: Blue (correct) - Size: Very large, house-sized (correct for surreal context) - Shape: Banana shape, curved, with windows and stairs (correct) Object: Green sun - Color: Green (correct) - Size: Large, dominating the sky (correct for surreal context) - Shape: Circular with a swirling p
PASSEvaluator
Object: Purple carrot - Color: Purple (correct) - Shape: Carrot-shaped (correct) - Size: Large (correct) Object: Red tree - Color: Red (correct) - Shape: Tree (correct) - Size: Large (correct) Object: White watermelon sliced open - Color: White (correct) - Shape: Watermelon (correct) - Size: Large (correct) All objects are correctly bound with their specified attributes. No swaps occurred. VERDICT: PASS
PASSEvaluator
Object: Orange elephant - Color: Orange (with green decorative patterns) - Shape: Elephant (four legs, trunk, ears) - Size: Small figurine Object: Yellow penguin - Color: Yellow (body) with white chest and orange beak/feet - Shape: Penguin (rounded body, flippers, feet) - Size: Small figurine Object: Silver strawberry - Color: Silver (metallic finish) - Shape: Strawberry (conical body with seeds, leafy top) - Size: Small figurine All three objects are correctly bound to their respective attri
▸Compositionality100%
PASSEvaluator
Element 1: Tabby cat — Present. The cat has distinct tabby markings (striped fur pattern). Element 2: Oversized red sunglasses — Present. The cat is wearing large, bright red sunglasses that cover its eyes and are proportionally oversized relative to its face. Element 3: Coherent combination — Present. The cat and sunglasses are clearly combined in a single, coherent image with no visual inconsistencies. All described elements are present and coherently combined. VERDICT: PASS
PASSEvaluator
Element: Golden retriever — Present. Element: Tiny cowboy hat — Present (a small brown cowboy hat fits the dog’s head appropriately). Element: Coherent combination — The dog is dressed in Western-themed attire (hat and bandana) and sits on a rustic wooden porch with hay bales and boots in the background, creating a coherent cowboy-themed scene. VERDICT: PASS
PASSEvaluator
Element: Penguin — Present (central subject, clearly identifiable) Element: Colorful umbrella — Present (large, vibrant, multi-patterned umbrella) Element: Coherent combination — Present (penguin is holding the umbrella, standing on rocky terrain with snow falling, fitting the whimsical concept) All described elements are present and coherently combined in the image. VERDICT: PASS
PASSEvaluator
Element: Medieval knight in full armor → Present (fully armored knight with helmet, gauntlets, and plate armor visible) Element: Sitting at a desk → Present (knight seated on a wooden chair at a wooden desk) Element: Typing on a modern laptop → Present (knight’s gloved hands are positioned over a silver laptop keyboard, appearing to type) Element: Coherent combination of concepts → Present (the scene successfully merges the medieval knight with modern technology in a believable, albeit surreal,
PASSEvaluator
Element 1: Roman gladiator — Present. The central figure wears a detailed Roman gladiator helmet and armor, including a lorica segmentata-style cuirass and leather straps. Facial features and expression are consistent with a gladiator. Element 2: Taking a selfie with a smartphone — Present. The gladiator is holding up a modern smartphone, angled to capture his own face. The phone’s screen displays a selfie of him, with visible cracked glass and a modern camera module. Element 3: Colosseum — Pr
PASSEvaluator
Element 1: Astronaut in a spacesuit — Present. The central figure is wearing a detailed NASA-style spacesuit with helmet, backpack, and gloves. Element 2: Riding a bicycle — Present. The astronaut is seated on and actively riding a classic-style bicycle. Element 3: Through a park — Present. The setting is a public park with trees, grass, fallen leaves, and people relaxing or walking, consistent with a park environment. Element 4: Unusual combination — Coherently combined. The astronaut on a b
PASSEvaluator
Element checklist: - Octopus: Present. A large, detailed octopus is the central figure. - Top hat: Present. The octopus is wearing a black top hat. - Monocle: Present. A gold-rimmed monocle is visible on the octopus’s eye. - Playing chess: Present. The octopus is positioned over a chessboard with pieces arranged as if in mid-game. - Surface of the moon: Present. The scene is set on a gray, cratered lunar surface with footprints. - Earth visible in the background: Present. Earth is clearly visib
PASSEvaluator
Element 1: Robot dressed as a pirate — Present. The central figure is a robotic skeleton wearing a pirate hat, eye patch, and coat. Element 2: Sailing a paper boat — Present. The robot is seated in a clearly visible paper boat, which is floating on the lava. Element 3: Lake of lava — Present. The boat is on a surface of flowing, glowing lava, with volcanic mountains in the background. Element 4: Juggling three kittens — Present. Three kittens are suspended in mid-air around the robot’s hands,
PASSEvaluator
Element 1: Dinosaur — Present. A raptor-like dinosaur (possibly a Velociraptor or similar) is standing at the front of the room. Element 2: Business suit — Present. The dinosaur is wearing a full gray business suit with a white shirt and dark tie. Element 3: PowerPoint presentation — Present. A large screen behind the dinosaur displays a PowerPoint slide titled “Q3 GROWTH STRATEGIES: ADAPTING TO THE ‘METEOR’ MARKET,” with the PowerPoint interface visible at the top. Element 4: Modern office —
▸Counting78%
PASSEvaluator
I see exactly 3 red apples arranged on a wooden table. The apples are red with some yellowish tones, and each has a stem and leaves. They are placed on a rustic wooden surface, consistent with the prompt. There are no other objects that contradict the request. Count of requested objects: - Red apples: 3 The count exactly matches what was requested. VERDICT: PASS
PASSEvaluator
I count exactly 5 yellow rubber ducks sitting in a row on the bathroom shelf. They are all identical in appearance, yellow with orange beaks, and are arranged side by side on the wooden shelf. The prompt requested "Exactly 5 yellow rubber ducks sitting in a row on a bathroom shelf," and the image perfectly matches this description. VERDICT: PASS
PASSEvaluator
I count exactly 4 white candles in the image. They are all lit and standing on a dark wooden surface, which matches the description. The prompt requested: "Exactly 4 white candles standing on a dark surface". What I see: 4 white candles on a dark surface. The count exactly matches the request. VERDICT: PASS
PASSEvaluator
In the image, there are: - 2 orange cats - 3 black dogs They are all sitting together on a grassy lawn, as requested. The count exactly matches the prompt: "2 orange cats and 3 black dogs sitting together on a grassy lawn". VERDICT: PASS
FAILEvaluator
I count: - 1 blue butterfly (large, vivid blue, resting on a fence post on the left) - 2 yellow butterflies (one resting on the lower horizontal rail, one on a post to the right) The prompt requested: "1 blue butterfly and 3 yellow butterflies". I see only 2 yellow butterflies, not 3. VERDICT: FAIL
PASSEvaluator
Counting the objects in the image: - Red roses: There are 4 clearly visible red roses. - White daisies: There are 2 clearly visible white daisies. The prompt requested: "4 red roses and 2 white daisies in a glass vase on a table". All specified elements are present: - The flowers are in a glass vase. - The vase is on a table. The count of each object type exactly matches the request. VERDICT: PASS
PASSEvaluator
Counting the objects in the image: - Bananas: There is a bunch of 3 bananas clearly visible. - Oranges: There are 4 oranges visible. - Apples: There are 2 apples visible. The requested count was: 3 bananas, 4 oranges, and 2 apples. The image exactly matches the requested count. VERDICT: PASS
PASSEvaluator
Counting the goldfish in the image: - Top right: 1 goldfish - Middle right: 1 goldfish - Bottom right: 1 goldfish - Center right: 1 goldfish - Center: 1 goldfish - Left center: 1 goldfish - Bottom left: 1 goldfish - Top left: 1 goldfish - Bottom center (partially obscured): 1 goldfish Total: 9 goldfish. The prompt requested “Exactly 9 goldfish swimming in a large round fishbowl.” The image shows 9 goldfish in a large round fishbowl, and they appear to be swimming. VERDICT: PASS
FAILEvaluator
I will count the balloons in the image to verify if the count matches the requested "Exactly 11 colorful balloons". Careful visual inspection reveals: - Top left: 1 multicolored balloon, 1 red balloon, 1 orange balloon, 1 orange balloon (below it) → 4 - Center: 1 yellow, 1 blue, 1 green, 1 purple, 1 pink (partially hidden) → 5 - Bottom center: 1 teal, 1 teal, 1 gold, 1 pink → 4 - Right side: 1 silver, 1 silver → 2 - Bottom: 1 silver → 1 Total: 4 + 5 + 4 + 2 + 1 = 16 balloons. The prompt requ
▸Negation78%
PASSEvaluator
The image depicts a tropical beach scene with white sand, turquoise water, palm trees, and birds flying in the sky. There are no people visible anywhere in the image — not on the beach, not in the water, not in the trees. The only objects are natural elements: sand, water, vegetation, a driftwood log, and birds. Since the prompt explicitly excluded “people” and none are present, the exclusion is fully respected. VERDICT: PASS
PASSEvaluator
The image shows a cat lying on a couch, curled up and sleeping. The cat’s fur is visible, and there is no collar around its neck. The couch is upholstered in a brownish fabric, with a knitted blanket and books nearby. A window is visible in the background, letting in natural light. The cat’s neck and chest area are clearly visible, and no collar is present. VERDICT: PASS
PASSEvaluator
The image depicts an empty classroom with wooden desks and chairs arranged in rows, and a large chalkboard at the front of the room. There are no students visible in the scene. The prompt explicitly excluded “students,” and the image contains no people or students. VERDICT: PASS
PASSEvaluator
The prompt explicitly excluded "water or ocean" from the image. In the provided image, there is no visible water or ocean. The scene consists of sand, palm trees, a beach umbrella, and seashells — all elements included in the prompt — with no body of water in sight. The background shows sky and vegetation, but no ocean or water. VERDICT: PASS
FAILEvaluator
The prompt explicitly excluded “food” from the image. Upon careful inspection, there is a black frying pan on the stove top that contains visible browned, cooked food — likely meat or vegetables — which is clearly present. This violates the exclusion condition. VERDICT: FAIL
PASSEvaluator
The image shows an office workspace with a wooden desk, a brown leather chair, a bookshelf filled with books, and a desk lamp. The desk also holds a vintage typewriter, notebooks, and writing utensils. There are no computers, monitors, tablets, or any other digital screens visible in the scene. The excluded elements — “computers or screens” — are indeed absent from the image. VERDICT: PASS
PASSEvaluator
The image shows a car body that has windows and doors, and headlights are clearly visible. However, the car is missing wheels — the front wheel hubs are exposed, and the rear wheel wells are empty. The car is resting on wooden blocks, confirming the absence of wheels. The prompt explicitly excluded “wheels,” and they are indeed absent from the image. VERDICT: PASS
PASSEvaluator
The image shows a pepperoni pizza with tomato sauce and fresh basil leaves on top. The crust is visibly charred in places, indicating it was baked at high heat. However, upon close inspection, there is no cheese visible anywhere on the pizza. The red sauce is clearly visible beneath the pepperoni and basil, and no white or yellowish melted or shredded cheese is present. VERDICT: PASS
FAILEvaluator
The image shows a close-up of a human face with clearly visible eyes, nose, and mouth. Above each eye, there are distinct, natural-looking eyebrows. The prompt explicitly excluded “a human face with eyes, nose, and mouth but no eyebrows,” meaning the absence of eyebrows is required. Since eyebrows are clearly present in the image, the excluded element is not absent. VERDICT: FAIL
▸Relative Position100%
PASSEvaluator
The prompt specified: "A cat sitting on top of a cardboard box." In the image: - The cat is clearly positioned on the top surface of the cardboard box. - The spatial relationship “on top of” is accurately depicted. - No other positional relationships (left, right, above, below, under) were specified in the prompt, so there is no requirement to verify them. Since the core requirement — the cat sitting on top of the box — is correctly rendered, the spatial arrangement matches the prompt. VERDIC
PASSEvaluator
The prompt specified: “A red ball underneath a wooden chair.” In the image: - There is a red ball on the wooden floor. - A wooden chair is positioned directly above the ball, with its legs and frame enclosing the ball from above. - The ball is clearly located “underneath” the chair — the spatial relationship is correct. Other positional relationships mentioned in the prompt (on, under, left, right, above, below) are not fully required to be present — only that the specified relationship (“unde
PASSEvaluator
The prompt specified: "A coffee mug next to an open book on a desk." In the image: - There is an open book lying flat on a wooden desk. - A coffee mug is placed on the same desk, positioned to the left of the open book. - The spatial relationship “next to” is satisfied — the mug and book are adjacent on the desk surface. - The phrase “on a desk” is also satisfied — both objects are resting on the desk. All positional relationships described in the prompt are correctly rendered. VERDICT: PASS
PASSEvaluator
The prompt specifies: "A candle on top of a stack of books, the books next to a potted plant." In the image: - The candle is indeed placed on top of a stack of books — this spatial relationship is correct. - The stack of books is positioned next to the potted plant — this spatial relationship is also correct. All described positional relationships (on, next to) are accurately represented. VERDICT: PASS
PASSEvaluator
The bird is perched on a tree branch, which extends from a tree located directly above and slightly behind the park bench. The branch is positioned above the bench, and the bench is situated beneath the branch. The bird is on the branch, which is part of the tree, and the tree is next to the bench — meaning the tree and bench are adjacent, not overlapping or nested. All positional relationships described in the prompt — “bird perched on a branch,” “branch extending from a tree,” “tree next to a
PASSEvaluator
The prompt specifies: “A cat sitting on a pillow, the pillow placed on a wooden chair.” In the image: - The cat is indeed sitting on a pillow — correct. - The pillow is placed on a wooden chair — correct. All positional relationships described in the prompt are visually accurate: - “on” — cat is on pillow, pillow is on chair. - “under” — not explicitly required, but the chair’s seat is under the pillow, which is consistent with the arrangement. - “left/right/above/below” — not directly relevan
PASSEvaluator
The image shows five chess pieces arranged on a wooden chessboard: - A dark wooden king is in the center. - To the left of the king is a light-colored queen. - To the right of the king is a light-colored bishop. - Above the king (in the direction toward the top of the image) is a light-colored rook. - Below the king (in the direction toward the bottom of the image) is a light-colored knight. All five pieces are clearly visible and positioned according to the specified spatial relationships: le
PASSEvaluator
The prompt specified: "laptop in the center, coffee mug to the left, phone to the right, lamp above, and notebook below". - Laptop is centrally positioned on the desk — ✅ - Coffee mug is to the left of the laptop — ✅ - Phone is to the right of the laptop — ✅ - Lamp is above the laptop (mounted on an articulated arm, shining down) — ✅ - Notebook is below the laptop (open and placed directly in front of it, on the desk surface) — ✅ All specified spatial relationships are correctly depicted. VER
PASSEvaluator
The image shows five animals arranged in a line from left to right in a grassy field: - A brown horse is on the far left. - A black-and-white cow is immediately to the right of the horse. - A white sheep is positioned in the center of the group. - A brown-and-white goat is to the right of the sheep. - A black-and-white dog is on the far right. All described spatial relationships — “horse on the far left,” “cow to the right of the horse,” “sheep in the center,” “goat to the right of the sheep,”
PASSEvaluator
The image correctly depicts the spatial relationships as specified in the prompt: - The red barn is in the foreground at a large scale, occupying the left side of the frame and appearing closest to the viewer. - The windmill is in the midground at a medium scale, positioned to the right of the barn and further back, consistent with depth perspective. - The snow-capped mountains are in the background at a small scale, appearing farthest away and positioned behind both the barn and windmill, comp
PASSEvaluator
The image correctly depicts the spatial relationships described in the prompt: - The sandcastle is in the foreground, closest to the viewer, occupying the lower portion of the frame. - The surfer is in the midground, riding a wave further out in the ocean, positioned behind the sandcastle. - The cruise ship is in the background, on the horizon, farthest from the viewer, appearing smaller due to perspective. All positional relationships (foreground → midground → background, with decreasing size
PASSEvaluator
The image correctly depicts the spatial relationships described in the prompt: - **Foreground**: A park bench with pigeons is clearly visible in the immediate foreground, with some pigeons perched on the bench and others on the ground around it. - **Midground**: A large, ornate fountain is positioned behind the bench, centered in the midground, with water spraying upward. - **Background**: Skyscrapers form the distant background, rising behind the fountain and trees, appearing progressively sma
▸Scale & Proportions100%
PASSEvaluator
The person is standing next to the school bus, and their relative scale is realistic. The bus is a standard-sized school bus, and the person appears to be an average adult, fitting proportionally next to the vehicle. The bus’s windows, headlights, and overall dimensions are consistent with real-world school buses, and the person’s height relative to the bus’s door, windows, and bumper is accurate. There is no distortion or exaggerated scale. VERDICT: PASS
PASSEvaluator
The image shows a ladybug resting on a large green leaf. The leaf occupies the vast majority of the frame, with visible veins, texture, and imperfections like holes and water droplets, indicating it is indeed large. The ladybug, while small in comparison, is proportionally accurate — it is a realistic-sized insect relative to the leaf, not unnaturally oversized or undersized. The composition and perspective maintain consistent, realistic proportions. VERDICT: PASS
PASSEvaluator
The child is holding a basketball that is proportionally appropriate for his size. A standard basketball is about 24 cm (9.4 inches) in diameter, and for a young child, this size is reasonable — it’s large enough to be a proper basketball but not so large that it appears absurdly oversized relative to his body. The ball fills his hands and sits comfortably against his torso, which is consistent with real-world proportions. The child’s arms and hands are also proportionally sized to hold the ball
PASSEvaluator
The image shows a coffee cup with a tiny figurine standing on its rim. The figurine is proportionally much smaller than the cup, and the cup itself is a realistic, full-sized object placed on a wooden table. The perspective and scale are consistent — the figurine appears to be a miniature model, and the cup is a normal-sized coffee mug. The prompt asked for “a tiny person standing on the rim of a coffee cup,” which is visually achieved. The relative sizes are correct and consistent with the prom
PASSEvaluator
The image depicts a miniature house resting on the palm of a human hand. The house is clearly scaled to fit comfortably within the palm, with its height and width proportionally small relative to the hand’s size. The hand’s anatomy — including the curvature of the palm, the spacing of the fingers, and the visible knuckles — is consistent with real-world proportions. The house’s windows are lit, and smoke rises from a chimney, adding to the illusion of a functioning, tiny dwelling. The scale rela
PASSEvaluator
The image depicts a giant tabby cat walking between city skyscrapers, with its paws and body occupying the space between buildings and dwarfing the street-level traffic and pedestrians below. The cat’s scale is consistent with the “kaiju” concept — it is enormous compared to the surrounding urban environment. The relative proportions of the cat (head, body, legs, tail) are anatomically plausible for a cat, even at giant size. The buildings, cars, and people are rendered at realistic scale relati
PASSEvaluator
The image shows an elephant, a horse, a dog, a cat, and a mouse standing in a line. The elephant is the largest, as expected. The horse is significantly smaller than the elephant but still much larger than the dog, cat, and mouse. The dog is larger than the cat, which is larger than the mouse — all of which aligns with real-world size relationships. The proportions are consistent with actual animal sizes: the elephant dwarfs the horse, the horse dwarfs the dog, the dog dwarfs the cat, and the c
PASSEvaluator
The image depicts a dinner table setting with a plate, fork, knife, wine glass, salt shaker, and peppercorn grinder — all of which are rendered with realistic relative sizes and proportions. - The plate is the largest item in the foreground, appropriately sized for a dinner setting. - The fork and knife are standard cutlery sizes, placed correctly relative to the plate. - The wine glass is proportionally sized — taller than the plate but not oversized — and its stem and base are correctly scale
PASSEvaluator
The image depicts a farm scene with a barn, tractor, farmer, cow, chicken, and fence post. The relative sizes of these objects are consistent with real-world proportions: - The barn is large and dominates the background, as expected for a functional farm structure. - The tractor is appropriately sized relative to the barn and the farmer — it’s a standard-sized farm tractor, not exaggerated or shrunk. - The farmer is proportionally correct in relation to the tractor and barn — he appears human-s
Human realism 98%
▸Faces & Expressions100%
PASSEvaluator
The face in the image is symmetric and features are anatomically correct — eyes, nose, mouth, and ears are proportionally aligned and natural. The man appears middle-aged with visible signs of aging (wrinkles, graying hair, beard), and his expression is calm and serious, with a neutral mouth and steady gaze. There are no distortions or unnatural artifacts. The emotion is clearly readable as composed and thoughtful, matching the prompt. VERDICT: PASS
PASSEvaluator
The face in the image is largely symmetrical with natural-looking features: eyes, nose, and mouth are proportionally balanced, and the freckles and hair texture appear organic. The emotion is clearly readable — the subject has a warm, genuine smile with crinkled eyes and an open mouth, conveying happiness and approachability. There are no significant distortions; the facial structure is anatomically plausible, and the lighting enhances the natural expression without creating unnatural artifacts.
PASSEvaluator
The face in the image is largely symmetrical with natural, correctly proportioned features. The eyes, nose, and mouth are well-aligned, and the facial structure appears anatomically accurate. The teenager has a relaxed, neutral expression — lips are closed and relaxed, gaze is steady and direct, with no strong emotional cues, which matches the prompt. There are no obvious distortions or unnatural artifacts. The freckles, hair texture, and lighting are consistent with a natural, candid portrait.
PASSEvaluator
The face is largely symmetric with correct anatomical features — eyes, nose, and mouth are properly aligned and proportioned. The expression clearly conveys disgust: the nose is visibly wrinkled, the lips are curled downward, and the eyebrows are furrowed, all consistent with the prompt. There are no unnatural distortions; the wrinkles and skin texture appear realistic and age-appropriate. The emotion is unmistakably readable. VERDICT: PASS
PASSEvaluator
The face is largely symmetric with correctly rendered features: eyes, nose, and mouth are proportionate and aligned. The emotion of surprise is clearly readable — eyebrows are raised, eyes are wide, and the mouth is slightly open, matching the prompt. There are no noticeable distortions; the facial structure is natural and the expression is coherent. The hand gesture reinforces the emotional context without altering facial symmetry. VERDICT: PASS
PASSEvaluator
The face is symmetric and features are anatomically correct — eyes, nose, mouth, and ears are proportionally aligned and natural. There are no obvious distortions or unnatural artifacts. The expression clearly conveys quiet melancholy: the eyes are downcast, the mouth is slightly drawn and downturned, and the gaze is distant and introspective. The lighting and composition enhance the somber mood. All elements of the prompt are accurately rendered. VERDICT: PASS
PASSEvaluator
The face in the image is largely symmetrical, with balanced features around the eyes, nose, and mouth. The deep wrinkles, especially around the eyes and mouth, are natural and consistent with an elderly woman in her 80s. The white hair is visible and styled in a casual updo. The smile is warm and genuine, with crinkled eyes that convey kindness and joy — the intended emotion is clearly readable. There are no significant distortions; the facial structure is anatomically plausible, and lighting an
PASSEvaluator
The face in the image is largely symmetric and features are anatomically correct for a young child. The child has round cheeks, large expressive eyes, and tears streaming down both cheeks — clearly conveying the emotion of crying. The facial expression is natural and emotionally readable, with slightly downturned lips and a pout that reinforces the sadness. There are no significant distortions; the proportions and features are well-rendered and realistic. The prompt’s requirements are met. VERD
PASSEvaluator
The face in the image is largely symmetric and features are anatomically correct. The elderly man appears to be in his 70s, with visible age spots on his forehead and cheeks, and thinning gray hair, consistent with the prompt. He is wearing reading glasses, and his expression — a slight downturn of the mouth, raised eyebrows, and a direct, piercing gaze — conveys skepticism clearly. There are no significant distortions; the facial structure, proportions, and details (like skin texture and hair)
PASSEvaluator
All three faces are symmetric with correct anatomical features — eyes, nose, mouth, and ears are properly aligned and proportioned. The intended emotions are clearly readable: - Left person: Laughing with head tilted back — mouth open, eyes crinkled, joyful expression. - Center person: Shocked with wide eyes — eyes wide open, hand covering mouth, eyebrows raised. - Right person: Rolling eyes with smirk — eyes looking upward and sideways, lips curled in a smirk. No distortions are present; fac
PASSEvaluator
The image shows four people, matching the prompt: a father (center, beaming), a mother (right, smiling gently), a teenager (left, looking bored/neutral), and a child (front, grinning excitedly). All faces are symmetric with correct anatomical features — no distortions, asymmetries, or unnatural deformities are visible. Emotions are clearly readable: the father’s wide smile conveys pride, the mother’s soft smile is gentle, the teenager’s expression is bored or disengaged, and the child’s wide gri
PASSEvaluator
The image shows three individuals at a meeting, each with distinct facial expressions: - The woman on the left has furrowed brows and is holding her head, conveying frustration — this matches the prompt. - The man in the center is smiling and nodding, appearing pleased — this also matches. - The woman on the right has a furrowed brow and a grimace, looking confused or distressed — this aligns with the “confused with a tilted head” description, though her head is not visibly tilted. All three f
▸Full Body100%
PASSEvaluator
The woman in the image is standing upright in a relaxed pose with her arms at her sides, wearing a white t-shirt and blue jeans — fully matching the prompt. - Limb count: Correct — two arms, two legs, both visible and intact. - Proportions: Anatomically correct — limbs and torso are proportionally balanced; no exaggerated or missing body parts. - Pose: Recognizable — upright, relaxed stance with arms naturally at sides, consistent with a casual standing pose. - Distortions: None — no visible wa
PASSEvaluator
The man in the image is standing straight, facing the camera, with both hands in his pockets — exactly matching the prompt. His proportions appear normal: head, torso, and limbs are in correct anatomical relation. There are no visible distortions in limb count (two arms, two legs) or proportions (no elongated or shortened limbs, no fused or missing parts). The pose is clearly recognizable as a common casual stance — standing upright with hands in pockets — and is consistent with the description.
PASSEvaluator
The subject in the image is a full-body shot of a person standing in a park, as requested. The pose is neutral — standing upright, arms at the sides, feet together or slightly apart — which is recognizable and consistent with the prompt. The person has two arms, two legs, and all limbs are proportionally correct with no visible anatomical distortions. The head, torso, and limbs are aligned naturally. The background context (park with trees, benches, people, dogs) supports the setting. There are
PASSEvaluator
The image shows a woman in mid-stride running through a park. Her limbs are correctly proportioned and anatomically accurate — two arms, two legs, with natural joint articulation and muscle definition consistent with running motion. Her pose is dynamic and recognizable as running: one leg is forward with foot planted or just lifting, the other leg is trailing behind, arms are bent and swinging in opposition, and her torso is slightly forward-leaning. Her hair and clothing show motion blur and bi
PASSEvaluator
The man in the image is captured mid-air, jumping over a puddle on a cobblestone street. His body is angled forward, knees bent, and feet are slightly apart — consistent with a dynamic jump. He is wearing a beanie, a jacket, and boots, and water is splashing around his feet as he clears the puddle. Anatomical inspection: - Limb count: All four limbs (two arms, two legs) are present and clearly visible. - Proportions: The proportions of the body appear natural — head, torso, arms, and legs are i
PASSEvaluator
The dancer is captured mid-spin in a dynamic, expressive pose. Her dress, made of a flowing, satin-like fabric, flares outward in a circular motion, consistent with the motion of a spin. One leg is extended outward and slightly lifted, while the other supports her weight on the ball of the foot — a classic balletic turn position. Her arms are gracefully extended, enhancing the sense of motion and balance. Limb count is correct: two arms, two legs, and a torso. Proportions are anatomically accur
PASSEvaluator
The image shows a person performing the yoga tree pose (Vrksasana). The individual is standing on one leg, with the other foot pressed against the inner thigh of the standing leg. The arms are raised overhead with palms pressed together in prayer position (Anjali Mudra). The pose is clearly recognizable and matches the description. Limb count is correct: two arms, two legs, and the torso and head are intact. Proportions appear anatomically accurate — the body is balanced, the spine is upright,
PASSEvaluator
The climber has two arms and two legs — limb count is correct. Proportions appear anatomically plausible: limbs are elongated appropriately for climbing, joints bend naturally, and the torso is angled in a realistic climbing posture. The pose is clearly recognizable as a rock climber using both hands and feet to grip holds on a vertical cliff face — one hand higher, one lower, legs positioned for balance and leverage. The body is tensed appropriately for exertion, and the climber’s gaze is direc
PASSEvaluator
The image shows a cellist seated with the cello positioned between her legs, supported by a stand. Her left hand holds the neck of the cello, and her right hand holds the bow, which is extended toward the strings — consistent with the action of playing. Her posture is upright, and her legs are wrapped around the instrument as described. The limb count is correct (two arms, two legs), and proportions appear anatomically accurate — no obvious distortions in limb length or joint placement. The pose
PASSEvaluator
The image shows two gymnasts performing a synchronized handstand side by side on a gym floor. Their bodies are nearly identical in alignment — both legs are straight and extended vertically, feet pointed, and both hands are placed shoulder-width apart on the mat. Their torsos are parallel to the floor, heads are down, and hair is tied back. The pose is anatomically correct: each has two arms, two legs, and proper joint articulation. Limb count is accurate (each has 2 arms and 2 legs), proportion
PASSEvaluator
The image shows two soccer players in a dynamic, realistic action shot. The player in blue is sliding on the grass, arms out for balance, with one leg extended toward the ball. The player in red is airborne, mid-jump, with one foot near the ball and the other leg bent, appearing to leap over the sliding player. Both players have correct limb counts (two arms, two legs each), and their proportions are anatomically accurate — no extra limbs or missing body parts. Their poses are physically plausib
PASSEvaluator
The image shows a full-body shot of a tango couple in a dramatic, close embrace. The man is dipping the woman backward, and their legs are intertwined — her right leg is wrapped around his waist/back, and her left leg is extended back, lifted, and held by his hand. Her body is arched backward, supported by his arms, and their faces are close, almost touching. Anatomical analysis: - Limb count: All limbs are present and accounted for — both dancers have two arms, two legs, and the correct number
▸Hands92%
PASSEvaluator
Both hands are clearly visible, palms up, and all ten fingers are distinctly shown — five on each hand. The fingers are anatomically correct in number and arrangement. The gesture is natural for presenting hands palms-up, with fingers slightly spread. There are no anomalies such as missing, extra, or fused fingers. The hands appear weathered, consistent with someone who works outdoors (e.g., gardening), which is plausible given the background. VERDICT: PASS
PASSEvaluator
Both hands are visible at the person’s sides, fingers relaxed and naturally spread, as per the prompt. Counting the fingers on each hand: - Left hand: Thumb, index, middle, ring, pinky — 5 fingers. - Right hand: Thumb, index, middle, ring, pinky — 5 fingers. Anatomically, the hands are correctly proportioned. The fingers are relaxed, slightly curved downward, and the thumbs are naturally positioned. There are no extra fingers, missing fingers, or unnatural deformities. The posture and gesture
PASSEvaluator
The left hand (on the viewer’s right) is resting on the upper right arm, and the right hand (on the viewer’s left) is resting on the upper left arm — matching the prompt. - Left hand (viewer’s right): 5 fingers clearly visible — thumb, index, middle, ring, pinky. No anomalies. - Right hand (viewer’s left): 5 fingers clearly visible — thumb, index, middle, ring, pinky. No anomalies. The grip is anatomically correct: the hands are resting on opposite upper arms, fingers are spread naturally, and
PASSEvaluator
Both hands are visible in the image. - The right hand (writing hand) grips the pen with the thumb, index, and middle fingers — forming a standard tripod grip. All five fingers are present and anatomically correct. The thumb is on one side of the pen, the index finger is on top, and the middle finger supports from the side — this is a natural and correct grip for writing. - The left hand holds the notebook open, with the thumb and fingers visible. All five fingers are present and appear anatomic
PASSEvaluator
The visible hand in the image is holding the white ceramic coffee mug by its handle with the right hand, as prompted. Finger count: - Thumb: visible - Index finger: visible, wrapped around the handle - Middle finger: visible, wrapped around the handle - Ring finger: visible, wrapped around the handle - Pinky finger: visible, wrapped around the handle All five fingers are clearly visible and accounted for. Anatomical correctness: The grip is natural and anatomically correct for holding a mug b
PASSEvaluator
Left hand (on fretboard): 5 fingers visible — thumb, index, middle, ring, pinky — all positioned correctly on the frets, with natural curvature and finger placement consistent with standard guitar fretting technique. Right hand (strumming): 5 fingers visible — thumb, index, middle, ring, pinky — thumb is resting near the soundhole, and the other fingers are positioned as if strumming or preparing to strum. The hand posture is anatomically plausible for strumming, though the exact motion is not
PASSEvaluator
The image shows a person’s right hand making a peace sign gesture — the index and middle fingers are extended upward, while the ring, pinky, and thumb are curled or folded inward. All five fingers are clearly visible and anatomically present. - Index finger: visible and extended - Middle finger: visible and extended - Ring finger: curled inward - Pinky finger: curled inward - Thumb: curled inward, resting against the palm The gesture is anatomically correct — the thumb is not extended but curl
PASSEvaluator
The image shows a single hand performing a thumbs-up gesture. Counting the fingers: - Thumb: 1 (extended upward) - Index finger: 1 (curled) - Middle finger: 1 (curled) - Ring finger: 1 (curled) - Pinky finger: 1 (curled) That’s exactly 5 fingers visible per hand (only one hand is shown). The gesture is anatomically correct: the thumb is extended upward, and the other four fingers are curled inward toward the palm, forming a natural, common thumbs-up pose. There are no anomalies — no extra fin
FAILEvaluator
The image shows one hand prominently extended toward the camera, with the index, middle, and ring fingers extended — matching the prompt “counting to three.” The thumb is curled inward, and the pinky finger is not visible or extended. The thumb is not counted as part of the “three” in the gesture, and the hand is not showing five fingers. Counting the visible fingers: - Index finger: present - Middle finger: present - Ring finger: present - Pinky finger: not visible (likely hidden behind the ot
PASSEvaluator
Both hands in the handshake are clearly visible. Each hand shows five distinct fingers: thumb, index, middle, ring, and pinky. The fingers are positioned naturally in a standard professional handshake gesture — palms facing each other, fingers interlocked, thumbs resting on the outer side. There are no anatomical anomalies such as extra or missing fingers, malformed joints, or unnatural finger alignment. The grip is symmetrical and realistic for a professional context. VERDICT: PASS
PASSEvaluator
Both hands in the image are clearly visible as they meet in a high-five gesture. - The woman’s left hand (on the left side of the image) shows all five fingers extended: thumb, index, middle, ring, and pinky. - The man’s right hand (on the right side of the image) also shows all five fingers extended. The gesture is anatomically correct for a high-five: palms meeting with fingers extended, wrists slightly angled, and hands positioned to make contact. There are no anomalies such as extra finger
PASSEvaluator
Each visible hand in the image clearly shows five fingers: thumb, index, middle, ring, and pinky. All fingers are anatomically positioned and appear natural in their grip around the gift box. The hands are gripping the box in a way that is consistent with real-world interaction — thumbs are on one side, fingers wrapping around the opposite side, and the gesture is symmetrical and functional. There are no anomalies such as extra fingers, missing fingers, or unnatural joint positions. VERDICT: PA
▸Multi-Subject100%
PASSEvaluator
Person 1: - Hair: Short, blonde (pixie cut) - Clothing: Wearing a red jacket over a striped shirt and dark pants - Position: Standing on the left side of the image Person 2: - Hair: Long, black, wavy - Clothing: Wearing a blue patterned dress - Position: Standing on the right side of the image, arm linked with Person 1 Both individuals are visually distinct and match the described features exactly. VERDICT: PASS
PASSEvaluator
Person 1 (left): - Beard: Present, full and well-defined. - Glasses: Wearing round, thin-framed glasses. - Sweater: Wearing a textured, olive-green knit sweater. - Hair: Short, brown, slightly tousled. - Facial expression: Smiling, engaged in conversation. Person 2 (right): - Beard: Clean-shaven. - Glasses: Not wearing any. - Hoodie: Wearing a black, drawstring hoodie. - Hair: Short, neat, dark brown. - Posture: Leaning slightly forward, listening attentively. Both individuals are visually dis
PASSEvaluator
Person 1 (tall): - Hair: Curly, red (appears vibrant and voluminous) - Clothing: Denim overalls over a striped shirt - Height: Clearly taller than the other person, consistent with “tall” Person 2 (short): - Hair: Straight, brown (medium-length, neat) - Clothing: Yellow sundress with a floral pattern - Height: Clearly shorter than the other person, consistent with “short” Both individuals are visually distinct and match the described features exactly. VERDICT: PASS
PASSEvaluator
- Tall woman with short pink hair playing bass: Present. She has short, bright pink hair, is standing, and is playing a bass guitar. Visually distinct and matches description. - Man with beard and bandana on drums: Present. He has a full beard, is wearing a patterned bandana, and is seated behind a drum kit. Visually distinct and matches description. - Slim man with glasses playing keyboard: Present. He is standing, wearing glasses, and playing a keyboard. Visually distinct and matches descripti
PASSEvaluator
- Man: Wearing a sunhat and a large backpack; has a beard and is gesturing with his right hand. Matches description. - Woman: Has long braided hair and is holding two hiking poles; wearing a backpack. Matches description. - Teenager: Wearing a baseball cap (worn backward) and holding a water bottle; wearing a t-shirt and shorts. Matches description. All three individuals are visually distinct and match the described features. VERDICT: PASS
PASSEvaluator
- Bald man in a suit: Present. He is on the far left, wearing a dark blue suit with a white shirt and tie. His head is completely bald, and he is smiling while holding a wine glass. - Woman with silver hair in a red blouse: Present. She is seated next to the bald man, has short silver hair, and is wearing a red blouse. She is smiling and looking toward the camera. - Young man with dreadlocks in a denim jacket: Present. He is seated next to the woman in the hijab, has long dreadlocks, and is wear
Truthfulness 96%
▸Physics & Reflections100%
PASSEvaluator
The red sphere casts a single, soft-edged shadow to its right, which is consistent with a light source positioned to the upper left — as specified in the prompt. The shadow’s direction and elongation are physically plausible for a directional light source illuminating a spherical object on a flat surface. The shadow’s gradient is smooth and falls naturally, with no unnatural distortion or abrupt edges. The color of the shadow is a neutral gray, appropriate for a non-reflective surface under a si
PASSEvaluator
The coffee mug is positioned on a windowsill with sunlight coming from the right side (as seen from the viewer’s perspective), casting a shadow to the left — which matches the prompt’s description. The shadow is sharp and well-defined, consistent with direct sunlight. The direction and orientation of the shadow correspond to the mug’s shape and the light source’s position. There is no distortion in the shadow’s form — it accurately follows the mug’s silhouette, including the handle. The shadow’s
PASSEvaluator
The person is standing outdoors at midday, facing upward toward the sun, which is positioned high in the sky (visible as a bright glare near the top center). The shadow cast directly beneath the person is short and aligned with the sun’s position — consistent with midday lighting conditions. The shadow’s direction is correct (directly beneath the person), its length is appropriate for midday, and its shape matches the person’s silhouette with natural distortion from the uneven, rocky ground. The
PASSEvaluator
The reflection in the mirror accurately shows the reversed orientation of the woman — her back is visible, and her face is reversed as expected in a mirror image. The pattern on her shirt is mirrored correctly, with the design appearing as it would if viewed from behind. The lighting and shadows are consistent: the woman’s body casts a natural shadow on the floor and mirror frame, and the reflection’s lighting matches the ambient room lighting. The colors are accurate and undistorted — no unnatu
PASSEvaluator
The cat’s reflection is clearly visible in the polished marble floor, mirroring the cat’s posture and features upside down — which is physically accurate for a reflective surface. The reflection is not distorted, and its proportions and orientation are consistent with the cat’s position. The lighting direction matches: bright, natural light from the window casts strong, straight reflections and shadows that align with the cat’s body and the floor’s surface. The color fidelity is also accurate —
PASSEvaluator
The reflection on the glass table is physically plausible. The vase and flowers are clearly mirrored below, with accurate inversion and proportion. The reflection shows the bookshelf and window behind the vase, consistent with the real-world setup. The distortion is minimal, as expected from a flat, clean glass surface. The color fidelity is accurate — the greens, pinks, and whites of the flowers are reflected with appropriate tonal fidelity. Shadows are also consistent: the vase casts a soft sh
PASSEvaluator
The reflection of the trees in the lake is physically plausible. The trees are silhouetted against a sunset sky, so their reflections appear darker — consistent with the prompt’s “slightly darker” description. The water has gentle ripples, which distort the reflection slightly, creating a natural, wavy effect rather than a perfect mirror image. The color of the reflection matches the warm tones of the sky above, with subtle variations due to the water’s surface and depth. The direction of the re
PASSEvaluator
The reflection in the lake is a near-perfect, inverted mirror image of the mountain range, which is physically accurate for a still, calm body of water. The orientation is correctly inverted — the peaks appear upside down as expected in a reflection. The direction of light is consistent: the warm, golden illumination on the mountain tops is mirrored in the reflection, with the same color gradient and lighting direction preserved. The reflection is also subtly muted in color — the reflected image
PASSEvaluator
The reflection in the puddle shows the city skyline inverted as expected in a water surface — the tops of buildings appear at the bottom of the puddle, and the sky appears above them, which is physically correct. The distortion is mild and consistent with ripples and surface tension in a shallow puddle — the reflection is warped but retains recognizable shapes and proportions. The color shift is plausible: the puddle reflects ambient light (streetlights, building lights) and shows chromatic aber
PASSEvaluator
The image depicts a chrome sculpture in a room with a glass floor and large windows, as requested. The reflections and environmental mapping on every surface are highly detailed and physically plausible. - Direction and orientation: The reflections on the chrome sculpture and glass surfaces are consistent with the direction of the light source (sunlight from the windows). The reflections of the cityscape, ocean, and sky are correctly oriented relative to the sculpture’s position and the window
PASSEvaluator
The image depicts a clear glass sphere resting on a highly reflective, smooth surface (likely glass or polished stone) adjacent to a pool of water. The sphere exhibits: - **Refraction through the sphere**: The scene inside the sphere is inverted and distorted, consistent with how a spherical lens refracts light — the trees and sky are flipped and compressed into a circular view, which is physically accurate. - **Reflection on the mirror surface**: The mirror surface beneath the sphere shows
PASSEvaluator
The image depicts a crystal wine glass on a polished marble counter, with a window behind it showing a lake and mountains at sunset. Let’s evaluate the physical plausibility of the requested elements: refraction in the glass, reflection on the marble, and the lake’s reflective surface. 1. Refraction in the glass: - The glass contains red liquid (likely wine), and the cut crystal facets distort the view of the background. The refraction is visible — the background scenery appears warped and c
▸Photorealism100%
PASSEvaluator
The image shows a workspace desk with four key items: a laptop, a ceramic coffee mug, a leather-bound notebook, and a glass paperweight — matching the prompt’s requested materials. - **Brushed aluminum laptop**: The laptop’s body exhibits a matte, brushed metal finish with subtle directional highlights and reflections consistent with real aluminum. The screen and keyboard are rendered with appropriate gloss and depth. - **Ceramic coffee mug**: The mug has a glazed, earth-toned ceramic texture w
PASSEvaluator
The image depicts a still life on a wooden table with a glass of water, a silver spoon, a linen napkin, and a red apple — all materials rendered with accurate visual properties. - **Glass**: The glass is transparent with visible water and ice cubes inside. Surface reflections and refractions are realistic, including condensation droplets and distortion of background elements through the glass. The translucency and wetness are well rendered. - **Metal (silver spoon)**: The spoon shows sharp, cl
PASSEvaluator
The image depicts a kitchen counter with a stainless steel pot, a wooden cutting board, a glass bottle of olive oil, and a cotton dish towel — all under natural lighting. Let’s evaluate each material’s visual properties: 1. **Stainless Steel Pot**: The metal surface shows clear, realistic reflections of the surrounding environment (including the window and the bottle), with appropriate specular highlights and subtle surface imperfections. The reflections are coherent and physically plausible. ✅
▸World Knowledge92%
PASSEvaluator
The image depicts the Eiffel Tower in Paris on a clear day, with its iconic iron lattice structure clearly visible. The architectural details are recognizable and largely accurate, including the three levels: the ground level with its arches, the first level with its observation deck and railings, and the upper levels with their distinctive lattice work. The tower’s proportions, truss design, and overall silhouette are faithful to the real structure. However, there is one notable inaccuracy: th
PASSEvaluator
The depicted subject is unmistakably the Statue of Liberty, standing on her island pedestal with the New York City skyline in the background. The statue is shown with all key features as requested: the torch in her raised right hand, the seven-pointed crown on her head, the tablet held in her left hand (with visible inscriptions), and the characteristic green patina covering her copper body. The pedestal is accurately rendered, including its stone structure and surrounding walkways. The backgrou
PASSEvaluator
The depicted subject is the Taj Mahal in Agra, viewed from the reflecting pool, which matches the prompt’s request for a real-world subject. The image shows the main dome, four minarets, and the overall symmetrical layout accurately. The reflection in the pool is clear and well-composed, and the architectural details — including the arches, ornamentation, and proportions — are rendered faithfully. The lighting and perspective are consistent with a real photograph taken at sunrise or sunset. The
PASSEvaluator
The image depicts a ceramic bowl with visible gold lacquer lines filling cracks — a hallmark of kintsugi, the Japanese art of repairing broken pottery with gold. The bowl’s material, texture, and repair style are consistent with authentic kintsugi practice. The gold lines are clearly visible and follow the natural fracture patterns, which is both aesthetically and technically accurate. The bowl is placed on a wooden surface alongside a bamboo whisk (chasen) and a small cup containing green tea,
PASSEvaluator
The depicted subject is a traditional Japanese shamisen, and it is recognizable and factually accurate. - It has a wooden body with a rounded, slightly curved back and a flat, white skin-covered resonator (typically made of parchment or similar material). - It has three strings, which is correct for a standard shamisen. - The bachi (plectrum) is present, made of wood, and resting on the body — a common way to display or store it. - The neck and tuning pegs are correctly proportioned and positio
FAILEvaluator
The depicted object is a circular brass instrument with concentric rings, star-shaped markers, and engraved degree markings — consistent with the general appearance of a medieval astrolabe. It includes a rete (the star pattern) and a tympan (the inner disk with degree markings), both of which are present and appear to be correctly positioned relative to each other. However, there are significant inaccuracies: 1. The text engraved on the instrument is a mix of Arabic script and Latin script (e.
PASSEvaluator
The image depicts a diagram of the water cycle with labeled stages: evaporation from the ocean, condensation forming clouds, precipitation (rain) over mountains, river flow, collection, and groundwater. The arrows correctly show the cyclical movement of water. The diagram is factually accurate in its representation of the water cycle’s key processes. While it simplifies some aspects (e.g., omitting transpiration, sublimation, or runoff into lakes), it correctly illustrates the core components r
PASSEvaluator
The image depicts a clear, labeled cross-section of a convergent plate boundary, specifically an oceanic-continental subduction zone. It correctly shows: - An oceanic plate subducting beneath a continental plate. - A deep ocean trench at the point of subduction. - A volcanic arc forming on the overriding continental plate. - Magma rising from the subducting slab into the overlying crust, shown erupting from volcanoes. - The lithospheric mantle and asthenospheric mantle layers, with arrows indic
PASSEvaluator
The image depicts a clear, labeled anatomical illustration of the human heart with the four chambers (right atrium, right ventricle, left atrium, left ventricle) correctly shown and labeled. The major vessels — superior vena cava, aorta, pulmonary artery, and pulmonary veins — are accurately placed. The valves (tricuspid valve, mitral valve, aortic valve) are also correctly positioned and labeled. However, there is one critical error: the image labels the **left ventricle twice** — once correct
PASSEvaluator
The depicted subject is a Morpho menelaus butterfly, and it is recognizable as such. The dorsal (upper) wings display a vivid, iridescent blue — a hallmark of this species — with fine venation and dark brown margins. The ventral (lower) wings show the characteristic brown background with prominent eyespots, which are accurately rendered with orange and black rings. The body morphology, including the thorax, abdomen, and antennae, appears anatomically correct for a butterfly. There are no signif
PASSEvaluator
The image depicts the interior of the Hagia Sophia, and it correctly shows the massive central dome, pendentives, semi-domes, and Islamic calligraphy medallions — all of which are present in the actual building today. The calligraphy is authentic in style and placement, and the Byzantine mosaics (though many have been covered or altered) are still visible in parts, particularly the angelic figures and decorative elements around the dome and semi-domes. The architectural structure, including the
PASSEvaluator
The image depicts a bismuth crystal that is visually striking and highly recognizable as a specimen of bismuth. It exhibits: - **Iridescent oxide layers**: The surface shows vivid, rainbow-like iridescence, which is characteristic of bismuth due to thin oxide films forming on its surface. - **Geometric hopper crystal formation**: The crystal displays the classic “hopper” morphology — a stepped, geometric structure that grows outward from the center, with distinct, layered planes. - **Characteri
Professional Studio 100%
▸Camera & Lighting100%
PASSEvaluator
The image presents a clear aerial bird’s-eye view looking straight down at a traffic roundabout, with cars circulating around a central green space filled with trees — matching the prompt’s description. The perspective is top-down and centered, with consistent depth of field that keeps both the roundabout and surrounding elements in sharp focus. Lighting is natural and directional, casting consistent shadows from the trees and vehicles, suggesting a sunlit daytime scene. The composition and ligh
PASSEvaluator
The image presents a wide-angle shot of a modern living room interior, capturing an expansive space with a distinct barrel distortion effect at the edges — clearly visible in the curved, warped framing around the periphery. The perspective is immersive and fisheye-like, emphasizing the circular architecture and the vastness of the room. The depth of field appears shallow, with the central seating area and fireplace in sharp focus while the background elements (stairs, windows, distant mountains)
PASSEvaluator
The image presents a close-up macro shot of a single dewdrop resting on a red rose petal. The perspective is tight and focused on the droplet, which is the central subject. The depth of field is shallow: the dewdrop and the immediate petal surface beneath it are in sharp focus, while the background is softly blurred (bokeh), and even adjacent petals show gradual softness. This matches the shallow depth of field requirement. The lighting is soft and diffused, creating gentle highlights on the de
PASSEvaluator
The image features a portrait of a man with a strong, directional light source coming from the left side of the frame, casting a distinct triangular highlight on the shadowed side of his cheek — a hallmark of Rembrandt lighting. The lighting creates deep shadows and a moody, dramatic atmosphere, consistent with the prompt. The depth of field is shallow, with the subject in sharp focus while the background (portraits and books) is softly blurred, enhancing the portrait’s intimacy and directing at
PASSEvaluator
The image depicts a landscape bathed in warm, golden-hour lighting, with an orange-pink sky and long, soft shadows cast across rolling hills and trees — all of which directly matches the prompt’s description. The perspective is elevated (aerial view), which is appropriate for showcasing the expansive landscape and the winding road leading to the barn. The depth of field is shallow enough to keep the foreground and midground in sharp focus while softly blurring distant hills, enhancing the sense
PASSEvaluator
The image presents a high-key portrait of a person against a bright white background, as specified. The lighting is soft and even, with minimal shadows across the face and shoulders, creating an airy and luminous effect. The depth of field is shallow, keeping the subject in sharp focus while softly blurring the background, which enhances the portrait’s clean, minimalist aesthetic. The perspective is a medium close-up, centered on the subject’s face and upper torso, which aligns with the intent o
PASSEvaluator
The image shows a portrait of a woman outdoors, with a blurred background featuring soft, circular bokeh highlights — consistent with a shallow depth of field (likely from an 85mm f/1.4 lens). The subject is backlit, with warm light catching the edges of her hair and shoulders, creating a distinct rim light that separates her from the background. The background appears to be a garden or foliage, as described. While the image does not explicitly show the lens specification (85mm f/1.4) or the ex
PASSEvaluator
The image depicts a street scene with a person in the foreground holding a coffee cup, viewed from behind. The subject is in sharp focus, while the background — including city lights, cars, and pedestrians — is blurred into soft bokeh, which matches the requested shallow depth of field. The lighting suggests an urban evening setting with ambient city lights and reflections on wet pavement, consistent with the “soft city lights in the background” description. The perspective appears to be eye-lev
PASSEvaluator
The image is shot from a high angle, consistent with an overhead or “bird’s-eye” perspective, which aligns with the “shot from above” requirement. The depth of field appears shallow — the central skillet and its contents are in sharp focus while the foreground and background elements are softly blurred, which is characteristic of a 35mm f/2.8 lens. The lighting originates from the left, as evidenced by the strong directional light coming from the window on the left side of the frame, casting sof
PASSEvaluator
The image exhibits a clear tilt-shift effect: a narrow band of sharp focus runs along the center of the street, while the foreground and background are blurred, creating the illusion of a miniature model. The perspective is high-angle, looking down the street, which enhances the “dollhouse” effect. The lighting appears even and bright, consistent with daylight, and while not dramatically saturated, the colors (especially the red bus and yellow taxis) are vivid and distinct, matching the “saturat
PASSEvaluator
The image presents a dark, moody urban street scene with a strong horizontal lens flare that stretches across the frame. The flare is composed of a distinct blue streak on the left and a warmer orange/yellow glow on the right, which aligns with the prompt’s request for “blue and orange anamorphic lens flare streaking horizontally.” The scene has a cinematic quality, with a shallow depth of field that blurs background elements and emphasizes the foreground street and silhouettes. The perspective
PASSEvaluator
The image is a black-and-white portrait that clearly employs dramatic split lighting: the left side of the face (from the viewer’s perspective) is brightly lit, revealing fine wrinkles and texture, while the right side is plunged into deep shadow, creating a stark chiaroscuro effect. This high-contrast lighting directly matches the prompt. The perspective is a tight close-up, focusing on the face and upper shoulders, which is appropriate for a portrait. The depth of field is shallow — the backg
▸Color Precision100%
PASSEvaluator
The dress in the image is a rich, saturated royal blue, which closely matches the color described in the prompt. The background is a neutral, textured beige or off-white, also aligning with the prompt’s specification. There are no conflicting or incorrect colors visible. VERDICT: PASS
PASSEvaluator
The car in the image is a bright, vivid red — consistent with the description “vivid and unambiguous” red. The asphalt road is gray, matching the prompt. The red of the car is saturated and clearly distinguishable under the lighting conditions, with no ambiguity in its hue. The interior seats also appear to be a matching red tone, reinforcing the color consistency. VERDICT: PASS
PASSEvaluator
The image displays a vast field of bright yellow flowers under a clear, vivid blue sky — visually matching the prompt’s description of “pure sunflower yellow” and “clear blue sky.” The yellow of the flowers is saturated and consistent with the color commonly associated with sunflowers, and the sky is a clean, deep blue without clouds or distractions. While exact hex values weren’t specified in the prompt, the named colors are accurately represented in tone and saturation. VERDICT: PASS
PASSEvaluator
The walls in the image are painted in a medium blue that closely matches the hex color #4A90D9. The floor is a warm, natural wood tone, and the trim (baseboards, window frames, crown molding) is white — exactly as specified in the prompt. VERDICT: PASS
PASSEvaluator
The wall in the image is a warm, saturated orange that closely matches the hex value #FF6B35 (burnt orange). The furniture — including the bed, nightstands, dresser, and lamps — is predominantly white or off-white, matching the prompt. Natural daylight is present, as indicated by the bright window and shadows on the floor. The colors align precisely with the specified prompt. VERDICT: PASS
PASSEvaluator
The foliage in the image is predominantly a rich, dark green, which visually aligns closely with the hex color #2D5F2D (a dark muted green). While there are subtle variations in shade due to lighting and depth, the overall tone of the leaves and canopy matches the specified color. The tree trunks are brown, as required, with natural variations in tone and texture that are consistent with real wood. The prompt specified “precisely” #2D5F2D — which implies exact color matching. However, in a rea
PASSEvaluator
The walls in the image are a warm cream color that closely matches #F5E6CC. The sofa is a dark navy that aligns with #2C3E50. The accent pillows are a bright red that matches #E74C3C. All specified colors are accurately represented in the image. VERDICT: PASS
PASSEvaluator
The background is a deep dark blue that closely matches #1A1A2E. The watch has a soft gold finish that aligns with #F0C27F. The earbuds and their case are pure white, matching #FFFFFF. All specified colors are accurately represented in the image. VERDICT: PASS
PASSEvaluator
The image displays: - Background: A very light, off-white color, visually matching #FAFAFA (near-white). - Main shape: A vibrant orange-red, which closely matches #FF4500 (a classic orange-red, often called “coral red” or “tangerine”). - Accent lines: Thin, dark charcoal lines (#333333), which appear as a solid, non-shaded dark gray-black, consistent with the specified hex value. All three specified colors are accurately represented in the image. VERDICT: PASS
PASSEvaluator
The image displays a smooth gradient on a wall transitioning from red on the left to blue on the right, passing through purple in the center — exactly as described in the prompt. The red on the left visually matches #FF0000 (pure red), and the blue on the right matches #0000FF (pure blue). The transition through purple is even and natural, with no visible artifacts or color deviations. VERDICT: PASS
PASSEvaluator
The image displays a gradient transitioning from a warm orange at the bottom to a cool blue-purple at the top. The bottom color is a vibrant orange that closely matches #FF6B35. The top color is a deep indigo/blue-purple, which is visually consistent with #1B1464. The gradient is smooth and follows the specified direction (bottom to top), and the colors appear to be accurately rendered as requested. VERDICT: PASS
PASSEvaluator
The banner in the image displays a smooth horizontal gradient transitioning from a bright cyan on the left to a mint green on the right. The left edge closely matches #00C9FF (a vibrant cyan), and the right edge closely matches #92FE9D (a light mint green). The gradient is consistent with the specified hex values, and no other colors are introduced. VERDICT: PASS
▸Photorealism100%
PASSEvaluator
The image depicts a single red sneaker on a clean white background with soft, even lighting and a gentle, natural-looking shadow — consistent with professional studio product photography. The textures of the suede and mesh materials appear realistic, with fine stitching, lace detail, and subtle wear-like imperfections that suggest authenticity. There are no obvious AI artifacts: no warped geometry, inconsistent lighting, unnatural reflections, or texture smearing. The composition, lighting, and
PASSEvaluator
The image presents a clear glass perfume bottle on a reflective black surface, with no background distractions — exactly matching the prompt. The lighting is consistent with studio-quality product photography: soft, directional highlights on the glass, realistic refraction and reflection, and a clean, uniform black background. The reflections on the surface are sharp and proportionate to the bottle’s geometry. There are no obvious AI artifacts — no warped edges, inconsistent textures, unnatural
PASSEvaluator
The image presents a clean, studio-quality product shot of a green apple on a seamless white background, with soft side lighting and a subtle shadow — matching the prompt exactly. The apple’s surface shows natural texture, including fine speckles and a realistic highlight, and the stem appears organic and detailed. There are no obvious AI artifacts: no warped geometry, unnatural blending, or inconsistent lighting. The shadow is soft and falls naturally, and the background is uniformly white with
Graphical design 75%
▸Data Visualisation100%
PASSEvaluator
The chart is a bar chart showing monthly sales from January to June, with the exact values requested: 12K, 18K, 15K, 22K, 19K, 25K. Each bar is color-coded and labeled with its corresponding value. The x-axis is labeled with the months (Jan through Jun), the y-axis is labeled with sales values in K, and the title "Monthly Sales 2025" is present and clearly readable. The chart type matches the requested bar chart format. VERDICT: PASS
PASSEvaluator
The chart is a line graph titled "Weekly Traffic" with the x-axis labeled "Day" and y-axis labeled "Daily Website Visitors", matching the requested elements. Grid lines are present. The days of the week (Mon-Sun) are correctly labeled on the x-axis. The data points correspond to the requested visitor counts: - Mon: ~1200 - Tue: ~1500 - Wed: ~1100 - Thu: ~1800 - Fri: ~2000 - Sat: ~1700 - Sun: ~1400 All values are accurately represented on the graph, and the visual elements are clear and readabl
PASSEvaluator
The image displays a physical, 3D pie chart model on a desk, which visually represents market share percentages for four categories: Company A (35%), Company B (25%), Company C (20%), and Others (20%). Each slice is a different color (blue, green, orange, red) and clearly labeled with the company name and percentage. The chart type is a pie chart, matching the requested visualization format. The numbers, labels, and percentages are accurate and readable from the perspective shown. VERDICT: PASS
▸Layout & Design67%
PASSEvaluator
The image features a large popcorn bucket in the center, which matches the prompt’s description. The title “MOVIE NIGHT” is prominently displayed at the top in bold, stylized lettering, satisfying that requirement. The tagline “Every Friday at 8 PM” is clearly visible at the bottom, correctly placed and readable. The visual hierarchy is strong: the popcorn bucket is the central focal point, the title commands attention at the top, and the tagline anchors the bottom. All described layout zones a
PASSEvaluator
The poster includes a central guitar illustration, the headline “LIVE MUSIC FEST” in large, stylized text, and the tagline “Tickets available now” at the bottom — all of which match the prompt’s description. The visual hierarchy is clear: the headline dominates the top half, the guitar is centrally placed and visually anchors the design, and the tagline is clearly legible and positioned below the main graphic, as expected. All text is readable, correctly placed, and stylistically consistent wi
PASSEvaluator
The poster includes an open book illustration as the central visual element, with the title “JOIN THE BOOK CLUB” prominently displayed at the top in large, stylized lettering. The tagline “Meets every Wednesday” is clearly visible at the bottom, correctly placed and readable. The visual hierarchy is appropriate: the title dominates the top, the illustration occupies the center, and the tagline anchors the bottom. All described layout zones and elements are present and correctly positioned. VERD
PASSEvaluator
All described layout zones/elements are present with correct visual hierarchy: - Hero header image at the top: Present — a scenic photo of hikers overlooking the ocean. - Headline and subtitle in the middle: Present — “Discover Your Next Adventure.” is the headline, and “Join our community of explorers. Sign up for exclusive trips and guides.” is the subtitle, both centered and clearly readable. - 'Sign Up' call-to-action button: Present — an orange button with white text, positioned below the
PASSEvaluator
All described layout zones/elements are present: - Logo header: “The Copper Spoon Bistro” with crossed utensils and “EST. 2018” is clearly at the top. - Three category sections: “Appetizers,” “Mains,” and “Desserts,” each with items and prices, properly formatted and visually separated. - Footer: Contains address (“123 Maple Avenue, Springfield, ST 54321”), hours (“Open Tuesday-Sunday, 5pm-10pm”), phone number, and website — all correctly placed and readable. Visual hierarchy is clear:
FAILEvaluator
The image attempts to follow the described newsletter layout but fails on multiple critical points: 1. **Banner Image Header**: Present — “THE WEEKLY EXPLORER” is prominently displayed over a mountain sunset image. ✅ 2. **Two-Column Body Section**: The layout is visually divided into two columns — text on the left and an image on the right. ✅ 3. **Highlighted Quote Block**: Present — a yellow box with a quote from George Santayana is centered between the text and image. ✅ 4. **Footer with Un
PASSEvaluator
All described layout zones/elements are present with correct visual hierarchy: - Top navigation bar: Includes a hamburger menu icon on the left, “Dashboard” title centered, and a profile avatar on the right. - Search field: Clearly visible below the navigation bar with placeholder text “Search…” and magnifying glass icon. - 2x2 grid of feature cards: Four cards (“Budget,” “Tasks,” “Analytics,” “Goals”) with icons and labels, arranged in a clean grid. - Recent Activity section: Clearly labeled w
FAILEvaluator
The image shows a magazine double-page spread as described. The left page contains a large hero photo of a person on a mountain peak, which matches the prompt. The right page contains a two-column article layout, a pull quote (“The silence was profound, a canvas for the wind’s whispers.”) centered between the columns, and a sidebar on the right with an author bio and publication list. The page number “114” is also visible at the bottom of the sidebar. However, the text is not readable — it is f
FAILEvaluator
The image shows a fashion magazine spread with the following elements: - Left page: Full-bleed photo of a model in a textured dress against a concrete wall — ✅ Present. - Right page: - Headline “CONCRETE & COUTURE” — ✅ Present, large and prominent. - Sub-headline “A study in contrasts: Raw textures meet fluid forms.” — ✅ Present, below headline. - Body copy — ✅ Present, in three columns (though the text is gibberish, the layout is correct). - Inset image (small photo of a hand holding
▸Style Diversity75%
PASSEvaluator
The image depicts a golden retriever sitting in a garden, surrounded by flowers and a bench, which matches the subject matter of the prompt. The visual style is clearly that of an oil painting: thick, visible brushstrokes are evident throughout the composition, especially in the fur of the dog, the petals of the flowers, and the foliage. The texture is rich and tactile, with layered impasto effects and varied color blending that are hallmarks of traditional oil painting techniques. The lighting
PASSEvaluator
The image depicts a golden retriever sitting in a garden, surrounded by cherry blossoms, a Japanese-style stone lantern, and other garden elements. The dog has large, expressive eyes, a cheerful expression, and is rendered with bold outlines and flat color areas — consistent with Japanese anime style. Key characteristics requested: - Japanese anime style: ✅ Present — large eyes, stylized facial features, clean outlines, and cartoonish proportions. - Flat colors: ✅ Present — solid color fills wi
PASSEvaluator
The image depicts a golden retriever sitting in a garden, rendered in a pixel art style. The visual characteristics of pixel art are clearly present: the entire image is composed of visible, blocky pixels, and the forms are defined by distinct color blocks rather than smooth gradients. The scene is rendered with a limited color palette — primarily yellows, browns, greens, blues, and whites — consistent with the requested style. The 32x32 grid resolution is not explicitly verifiable from the imag
PASSEvaluator
The image clearly depicts a woman holding a parasol, set in a sun-dappled garden. The visual style is unmistakably Impressionist: brushstrokes are loose, visible, and textured; light is rendered with dappled, broken color; and the palette is dominated by soft pastels — pinks, yellows, blues, and greens — characteristic of the movement. The focus on capturing fleeting light and atmosphere, rather than sharp detail, aligns perfectly with Impressionism. All key requested characteristics — soft bru
PASSEvaluator
The image clearly embodies the requested Art Deco style. Key visual characteristics are unmistakably present: - **Geometric shapes**: The buildings feature sharp lines, stepped forms, sunburst motifs, and angular ornamentation — all hallmarks of Art Deco. - **Gold and black palette**: The dominant color scheme is black marble and gold accents, creating a luxurious, high-contrast look typical of the style. - **Ornamental symmetry**: The composition is highly symmetrical, with buildings mirroring
PASSEvaluator
The image clearly embodies the requested Pop Art style. It features: - Bold, black outlines defining facial features and hair. - Ben-Day dots (halftone patterns) visible across the skin tones and background, especially in the red and yellow areas. - Flat, vivid primary colors (red, yellow, blue) used for the background and parts of the face. - A stylized, graphic composition typical of mid-20th century Pop Art, reminiscent of Roy Lichtenstein. All key visual characteristics from the prompt are
FAILEvaluator
The image depicts a forest scene with strong photorealistic qualities — detailed textures on tree bark, moss, ferns, and water, along with realistic lighting and depth. The composition and rendering of light rays filtering through the canopy are highly detailed and photographic in nature. However, the requested style explicitly combines “photorealistic detail and lighting” with “visible watercolor bleeds and paper texture.” While the image does show a textured paper background (visible as a sub
PASSEvaluator
The image successfully embodies the requested “cyberpunk Art Nouveau” style. Key visual characteristics are clearly present: - **Art Nouveau elements**: The entire composition is framed by ornate, flowing organic lines, floral motifs (especially the large lotus-like flowers at the corners), and decorative scrollwork — all hallmarks of Art Nouveau. - **Cyberpunk elements**: The setting is a futuristic city with towering skyscrapers, neon lighting, and a rain-slicked, reflective walkway. The woma
FAILEvaluator
The image presents a framed depiction of a medieval castle rendered in a style that strongly resembles Japanese ukiyo-e woodblock prints — characterized by flat areas of color, bold outlines, simplified forms, and a stylized composition. The castle itself is drawn with clean lines, minimal shading, and a limited palette, consistent with ukiyo-e aesthetics. The framing, paper texture, and slight vignette at the top further reinforce this impression. However, the “photorealistic sky and lighting”
PASSEvaluator
The image clearly embodies the requested “soft brutalism” style. Key visual characteristics are present: - **Raw concrete textures**: The ceiling, walls, and floor are all exposed, unfinished concrete with visible aggregate and formwork lines — a hallmark of brutalism. - **Rounded pastel furniture**: The sofas and armchairs are plush, rounded, and upholstered in muted pastel tones (soft green, dusty rose, beige), contrasting with the hard surfaces. - **Warm diffused light**: The lighting is sof
FAILEvaluator
The image presents a still life with a bowl of pears on a table, rendered in a subdued, painterly style. The composition is sparse — the subject is centered but not crowded, with large areas of negative space surrounding it. The color palette is muted, dominated by greys, browns, and desaturated greens, avoiding vibrancy. The bowl itself has a visible crack, adding a subtle element of imperfection or tension. The lighting is soft and directional, creating gentle shadows that enhance form without
PASSEvaluator
The image clearly embodies the requested “nostalgic futurism — retro-futuristic 1960s space-age aesthetics with a melancholic warm-toned patina.” Key visual characteristics present: - **Retro-futurism**: Architecture features rounded domes, spires, and elevated monorails — classic 1960s sci-fi design. - **Warm-toned patina**: The entire scene is bathed in sepia, orange, and muted gold tones, evoking aged film or vintage photography, contributing to a melancholic atmosphere. - **Nostalgic detail