Testland
Browse all skills & agents

object-model-patterns

Pure reference catalog of the canonical object-model architecture patterns for test automation frameworks - Page Object Model (Fowler), Screenplay (Marcano/Palmer/Hill), Component Object, App Actions (Cypress idiom), Service Object, Repository, and Screen Object (the desktop/mobile sibling of Page Object covering Windows UIA, macOS XCTest, Linux AT-SPI, Appium / Espresso) - each with its canonical citation, when-to-use rules, refuse-to-mix anti-patterns, and a worked example. Distinct from `test-code-conventions` (file-level §1-§10) and from per-framework skills (`playwright-testing` etc., tool-specific configuration). Preloaded by `framework-architecture-auditor` and `playwright-codegen-reviewer` as the architecture-tier reference for what each pattern actually is.

object-model-patterns

Overview

This skill is a pure reference - no execution steps; it is the canonical catalog the framework-architecture-auditor and playwright-codegen-reviewer cite to determine "what good looks like" per pattern. The catalog complements test-code-conventions (which is file-level §1-§10) with the architecture-tier vocabulary.

When to use

  • Designing a new test automation framework - pick one canonical pattern; do not mix.
  • Reviewing an existing framework's architecture - audit which pattern is in use and whether it is applied consistently.
  • Migrating from one pattern to another (the most common migration: classic POM → Screenplay or POM → App Actions).
  • Onboarding new engineers - point them at the canonical citation for the pattern the team uses.

Do not use this skill to:

  • Author per-framework tool configuration - that's the per-framework skill (playwright-testing, cypress-testing, etc.).
  • Pick the framework itself - that's framework-choice-advisor.
  • Audit a running codebase against the chosen pattern - that's framework-architecture-auditor, which preloads this skill.

Pattern 1 - Page Object Model (POM)

Canonical source: Martin Fowler's PageObject definition (the bliki article is the cross-language canonical reference) + Selenium HQ documentation on Page Object Models.

Fowler's definition: "A page object wraps an HTML page, or fragment, with an application-specific API, allowing you to manipulate page elements without digging around in the HTML."

Selenium HQ's elaboration: "A page object is an object-oriented class that serves as an interface to a page of your AUT… There is a clean separation between the test code and page-specific code, such as locators."

The three load-bearing rules:

  1. No assertions in the POM body. Fowler: "Page objects are most commonly used in testing, but should not make assertions themselves." Selenium HQ: "Page objects themselves should never make verifications or assertions. This is part of your test and should always be within the test's code, never in a page object." One narrow exception (Selenium HQ): a verification at instantiation that the page loaded.
  2. Navigation methods return the next POM. Fowler: "If you navigate to another page, the initial page object should return another page object for the new page." This enables compile-time detection of broken workflows.
  3. POM exposes the page's services, not its widgets. Methods are named after the user-meaningful action (addToCart, submitOrder), not the DOM mechanic (clickButton, typeIntoField).

When to use POM

  • The SUT is page-oriented (traditional multi-page web app, server-rendered).
  • The team has 3+ engineers and needs locator deduplication.
  • The framework is Selenium / WebdriverIO / classic Playwright codegen output.

Anti-patterns (canonical)

Anti-patternWhy it fails
Assertions inside the POMCouples the page model to test outcomes; reuse across tests becomes brittle
void-returning navigation methodsLoses the compile-time check Fowler explicitly identifies as the pattern's benefit
clickAddToCartButton() instead of addToCart()Couples the test vocabulary to UI mechanics - when the UI changes, every test changes
Exposing the underlying WebDriver / Page instance through public POM methodsLeaks framework details into tests; defeats the encapsulation
One God-POM serving five pagesViolates single-responsibility; bigger refactor cost than the POM was supposed to prevent

Pattern 2 - Screenplay

Canonical source: Antony Marcano, Andy Palmer, and Jan Molak - Serenity BDD documentation on Screenplay; origin paper Marcano & Hill 2007 "Page Objects Refactored: SOLID Steps to the Screenplay Pattern."

The Screenplay vocabulary (Serenity BDD docs):

TermDefinition
ActorThe user or system performing tasks. "In Screenplay we model actors who interact with an application in various ways to perform tasks that help them achieve their goals."
AbilityA capability that enables actors to perform tasks (e.g., BrowseTheWeb, CallAnApi).
TaskA higher-level domain concept that groups Interactions (e.g., Login, AddToCart).
InteractionA low-level operation (click, type, fetch).
QuestionA query about system state used in assertions (e.g., TheCartTotal.value()).

Why Screenplay vs POM: Screenplay separates what the user does (Tasks, Interactions) from what the user can do (Abilities) from what the user observes (Questions). The result is a SOLID-aligned object model that survives UI refactors better than POM in large suites.

When to use Screenplay

  • The framework will exceed ~200 tests; the SOLID separation pays off at scale.
  • The team has Java / Kotlin / TypeScript / Python engineers who appreciate dependency-injection-style composition.
  • The SUT has multiple actor types (admin user, anonymous user, API client) that share underlying UI/API interactions.
  • The team uses Serenity BDD, SerenityJS, Boa Constrictor (.NET), or Screenplay-style implementations.

Anti-patterns

Anti-patternWhy it fails
Mixing Screenplay and POM in the same codebaseDoubles the maintenance surface; engineers can't tell which to write
Tasks that do not call Interactions (Task = re-named POM method)Loses the Screenplay benefit; the team got Page Object Model under a different name
Question classes that mutate stateViolates the Question's "pure observation" contract; assertions on observations fail unpredictably
Abilities used as a junk-drawer for utilitiesThe Ability should grant a real capability; using it as a service-locator defeats the dependency-injection benefit

Pattern 3 - Component Object

Canonical source: Selenium HQ docs (Page Components are part of the official POM extension) + practitioner consensus (testing-library, Storybook, Playwright Component Testing). Treated as a refinement of POM, not a competing pattern.

Definition: A Component Object is a Page Object scoped to a UI component (header, nav, form, modal, card) rather than a whole page. Where a page contains a re-used component (the navbar appears on every page), the Component Object models that component once; each Page Object that contains it composes it in.

When to use Component Object

  • The SUT is component-architected (React, Vue, Svelte, Angular, Web Components).
  • A small number of components appear on every page (navbar, footer, search box, modal).
  • Storybook / per-component visual testing is part of the suite.

Anti-patterns

Anti-patternWhy it fails
Modelling every DOM element as a Component ObjectComponent Objects are for re-used components, not every <div>
Component Objects that hold cross-component stateViolates the encapsulation; the component should not know which page contains it
Page Objects that bypass the Component Object and target its internalsThe Component Object's locators get duplicated; refactor leakage

Pattern 4 - App Actions (Cypress idiom)

Canonical source: Kent C. Dodds and the Cypress team - "Stop using Page Objects and Start using App Actions" (Cypress blog).

Definition: App Actions bypass the UI for setup steps by exposing application functions (Redux dispatches, store mutations, API calls) directly via cy.window().its('app') or equivalent. The test still asserts via the UI; only the Arrange phase is short-circuited.

Why App Actions vs POM: "Logging in" is not what the test is about - it's overhead. App Actions skip the login UI flow and inject a session directly, making the test 10× faster and removing flake from the login form.

When to use App Actions

  • The framework is Cypress (the pattern is named after the Cypress idiom; other frameworks adapt it).
  • The SUT exposes a deterministic state-setting API (Redux store, programmatic auth).
  • Setup is dominated by repeated UI flows (login, seed cart, navigate-to-deep-page).
  • The team is willing to accept the lock-in to the SUT's internal API surface.

Anti-patterns

Anti-patternWhy it fails
App Actions for the Act phase (the thing under test)The test no longer verifies the UI path under test
App Actions that aren't documented as test-only surfaceProduction code accidentally depends on the test-only API
Mixing App Actions and POM without conventionEngineers can't tell which to use; the suite forks
App Actions for end-to-end smoke / critical-path testsCritical paths must exercise the full UI; App Actions skip the very thing the smoke proves

Pattern 5 - Service Object

Canonical source: Ruby on Rails / Java enterprise testing patterns + practitioner blog consensus. Refinement of POM for non-UI test layers.

Definition: A Service Object is the API-test equivalent of a Page Object - it wraps a remote service (REST endpoint, GraphQL query, gRPC method, message-queue producer) with a domain API the test consumes. Methods like cartService.addItem(sku, qty) rather than httpClient.post('/api/cart/items', { sku, qty }).

When to use Service Object

  • API / contract / integration tests where the test code calls an HTTP / gRPC / queue interface repeatedly.
  • The SUT has 5+ services and the test code would otherwise be drowning in HTTP boilerplate.
  • The team uses Pact, schemathesis, RestAssured, Karate, or any framework with raw HTTP at the test layer.

Anti-patterns

Anti-patternWhy it fails
Service Object that re-implements the production service (mocks-in-disguise)Tests against a fake instead of the real service; misses contract drift
Service Object with assertions insideSame anti-pattern as POM assertions - couples model to test outcomes
Single Service Object for 10 different servicesViolates single-responsibility; the object becomes a god-client
Service Object that handles retries / circuit breakers identical to productionTests pass because the Service Object hides the failures the test should catch

Pattern 6 - Repository (test-data access)

Canonical source: Martin Fowler's Repository pattern (originally for domain-driven design) adapted for test-data setup. Practitioner adoption in 2020+ test frameworks (factory libraries layer on top).

Definition: A Repository in test context is the data-access abstraction that hides the storage mechanism (DB, fixture file, factory call) behind a domain API: userRepo.createTestUser({ role: 'admin' }).

When to use Repository

  • The framework needs deterministic test-data setup across DB / fixture file / factory.
  • Multiple test types (unit / integration / E2E) need the same setup logic.
  • The data-source mechanism is likely to change (DB schema migration, factory library swap).

Anti-patterns

Anti-patternWhy it fails
Repository that mixes test setup with production data fetchingProduction code accidentally adopts test-only quirks
Repository methods that return mutable objects shared across testsTest cross-coupling; one test mutates and breaks another
Repository that creates "magic" data the test doesn't seeTests pass for inscrutable reasons; debugging is impossible

Pattern 7 - Screen Object (desktop / mobile sibling of Page Object)

Canonical source: Martin Fowler's PageObject article - the current bliki entry opens with the note that "An object that wraps an HTML page, or fragment, with an application-specific API." The earlier name WindowDriver (Fowler, 2004) covered desktop GUI windows under the same encapsulation principle before the term migrated to web. The desktop / mobile community reuses the structurally-identical pattern under the name Screen Object (one class per logical screen, locators + actions encapsulated, no assertions inside). No single owner formally documents the rename - screen object is community-canonical across FlaUI, XCUITest, Appium / Espresso practitioner literature.

The mobile sibling is documented inside Google's Android testing guidance as Screen Robot (Jake Wharton - Instrumentation Testing Robots (2016)) and inside Square's mobile literature as well; both reproduce the same encapsulation contract.

The three load-bearing rules transfer unchanged from POM:

  1. No assertions in the Screen Object body. Same rationale Fowler gives for POM ("page objects … should not make assertions themselves"). The desktop test asserts on window.Title, element.IsEnabled, control-pattern state; the Screen Object exposes those via getters but does not verify them.
  2. Navigation methods return the next Screen Object. login.SubmitsCredentials() returns MainScreen. Compile-time detection of broken workflows survives the migration from web POM to desktop Screen Object.
  3. Screen Object exposes the screen's services, not its widgets. login.SubmitsCredentials(creds) not login.LoginButton.Click(). Methods are named after the user-meaningful action - same vocabulary rule as POM.

When to use Screen Object

  • Desktop / mobile SUT routed through any accessibility-tree backend per desktop-test-strategy-reference: Windows UIA (FlaUI, WinAppDriver, Appium-Windows), macOS XCTest (XCUIApplication / XCUIElementQuery per Apple's Testing with Xcode UI Testing chapter), Linux AT-SPI (dogtail / pyatspi).
  • Mobile-native SUT (Appium, Espresso, XCUITest on iOS) - same encapsulation, sometimes branded "Screen Robot" per the Wharton citation above.
  • Cross-platform desktop frameworks (Avalonia, .NET MAUI) where the same screen exists across OSes but the accessibility backend differs per host.

Anti-patterns (Screen Object-specific in addition to the POM list)

Anti-patternWhy it fails
Screen Object that hard-codes AutomationId strings inline in every method (e.g. cf.ByAutomationId("LoginButton") repeated)Refactor cost when the developer renames the AutomationId; centralise the constant at the top of the Screen class
Screen Object that wraps a single accessibility-tree call without adding a domain methodSame anti-pattern as the POM clickAddToCartButton() smell - Screen exposes mechanic, not service
Screen Object that asserts on accessibility properties (role, label) it controlsAsserting on internal state defeats the no-assertions rule; assertions belong in the test
Screen Object that calls Thread.Sleep / Task.Delay between actionsHides flakiness; route through the driver's retry primitive (FlaUI Retry.WhileNull, XCTest waitForExistence)
Screen Object that depends on absolute window coordinatesDefeats the accessibility-tree abstraction; multi-monitor / DPI / locale breaks the test
One Screen Object class per dialog AND per main view in the same screenModal sub-screens are nested Screen Objects; do not flatten

Worked desktop example (FlaUI / xUnit)

Bad (mechanical leakage into the test body - same shape as the web POM anti-pattern):

[StaFact]
public void Logs_in_with_valid_credentials() {
    var window = _fx.App.GetMainWindow(_fx.Automation);
    window.FindFirstDescendant(cf => cf.ByAutomationId("Username")).AsTextBox().Enter("alice@example.com");
    window.FindFirstDescendant(cf => cf.ByAutomationId("Password")).AsTextBox().Enter("hunter2");
    window.FindFirstDescendant(cf => cf.ByAutomationId("LoginButton")).AsButton().Invoke();
    Assert.Equal("Invoices", _fx.App.GetMainWindow(_fx.Automation).Title);
}

Good (Screen Object at the business layer):

[StaFact]
public void Logs_in_with_valid_credentials() {
    var login = new LoginScreen(_fx.App.GetMainWindow(_fx.Automation));
    var main  = login.SubmitsCredentials("alice@example.com", "hunter2");
    Assert.Equal("Invoices", main.Title);
}

The mechanics live inside LoginScreen (constants for AutomationIds, retry-wrapped element fetches, SubmitsCredentials returns the next Screen Object). The test reads as a specification.

Pattern selection matrix

The patterns are not equally good for every project. The matrix:

PatternBest forAvoid for
POMPage-oriented web SUT, 3-50 engineers, classic frameworksComponent-first React/Vue (use Component Object); Cypress (consider App Actions)
ScreenplayLarge suites (200+ tests), multiple actor types, SOLID enthusiastsSmall projects (overhead exceeds benefit); teams allergic to dependency injection
Component ObjectReact/Vue/Svelte component-architected SUT, Storybook-integratedServer-rendered traditional pages (use POM)
App ActionsCypress + Redux/store-architected SUT, setup-heavy testsCritical-path / smoke tests (must exercise UI); SUT without programmatic state API
Service ObjectAPI / integration / contract tests with 5+ servicesUI-only tests (no service calls); contract tests via schemathesis (the tool generates its own client)
RepositoryMulti-data-source projects, DB + fixture + factory in one suiteSingle-source projects (overhead exceeds benefit)
Screen ObjectDesktop / mobile SUT through any accessibility-tree backend (UIA, XCTest, AT-SPI, Appium / Espresso)Pure web SUT (use POM); pure API tests (use Service Object)

Cross-cutting anti-patterns (apply to any pattern)

Anti-patternWhy it fails
Mixing two object-model patterns in the same codebaseEngineers can't tell which to write; vocabulary drift accelerates
Inheritance hierarchies >2 levels deep (BasePage → AppPage → DomainPage → SpecificPage)Per framework-architecture-auditor §A2, depth-3+ chains break unpredictably on root-level changes
Page / Component / Task / Service Objects holding mutable test dataCross-test coupling; parallel-execution breakage
Public getter-style methods that expose locators (get loginButton())Defeats encapsulation; locators leak into test code
Object-model methods that wait, retry, or handle SUT errorsHides flakiness; tests pass when they should fail loudly
Object-model classes that import test-framework assertion librariesImplies assertions are happening inside; smell for the no-assertion rule

Hand-off targets

References

  • Martin Fowler - PageObject (canonical cross-language definition, the load-bearing reference for all POM rules; the earlier WindowDriver name covered desktop GUI before the term migrated to web): https://martinfowler.com/bliki/PageObject.html
  • Jake Wharton - Instrumentation Testing Robots (2016) - the canonical "Screen Robot" reference for the mobile sibling of the Screen Object pattern, also applicable to desktop: https://jakewharton.com/testing-robots/
  • Apple - Testing with Xcode - UI Testing chapter, the XCUIApplication / XCUIElementQuery / XCUIElement reference for macOS Screen Objects: https://developer.apple.com/library/archive/documentation/DeveloperTools/Conceptual/testing_with_xcode/chapters/09-ui_testing.html
  • desktop-test-strategy-reference - the OS-backend reference for Screen Object's accessibility-tree substrate (UIA / XCTest / AT-SPI).
  • Selenium HQ - Page Object Models (official Selenium documentation; quotes the no-assertions and navigation-return-shape rules verbatim): https://www.selenium.dev/documentation/test_practices/encouraged/page_object_models/
  • Antony Marcano, Andy Palmer, Jan Molak - Screenplay Fundamentals (Serenity BDD documentation; the canonical Actor/Ability/Task/Interaction/Question vocabulary): https://serenity-bdd.github.io/docs/screenplay/screenplay_fundamentals
  • Marcano & Hill (2007) - Page Objects Refactored: SOLID Steps to the Screenplay Pattern (the origin paper for the Screenplay name and SOLID rationale; cited via Serenity BDD): https://serenity-bdd.github.io/docs/screenplay/
  • Kent C. Dodds + Cypress team - Stop using Page Objects and Start using App Actions (Cypress blog, the canonical App Actions reference): https://www.cypress.io/blog/stop-using-page-objects-and-start-using-app-actions/
  • Martin Fowler - Repository pattern (originally domain-driven design; cited for test-data Repository adaptation): https://martinfowler.com/eaaCatalog/repository.html
  • ISTQB glossary - Page Object (the canonical ISTQB entry confirming the pattern is industry-standard): https://glossary.istqb.org/en_US/term/page-object
  • ISTQB glossary - Service Virtualisation (related concept; the Service Object is the test-side counterpart): https://glossary.istqb.org/en_US/term/service-virtualization
  • test-code-conventions - file-level companion (§1-§10).
  • framework-architecture-auditor - the reviewer that audits codebases against these patterns; preloads this skill.
  • playwright-codegen-reviewer, spec-to-e2e-test-scaffolder - agents that apply these patterns.