<< All versions
Skill v1.0.1
currentAutomated scan100/100majiayu000/claude-skill-registry/browser-automation-core
3 files
──Details
PublishedMay 22, 2026 at 06:52 PM
Content Hashsha256:4990197a2b14ede9...
Git SHA87774d02808d
Bump Typepatch
──Files
Files (1 file, 4.7 KB)
SKILL.md4.7 KBactive
SKILL.md · 86 lines · 4.7 KB
version: "1.0.1" name: "Core Browser Automation" description: "General-purpose patterns for reliable browser automation (selectors, waiting, scrolling, overlays, HITL)." tools:
- playwright
Selector Strategy (Stability First)
- Priority 1:
data-testid,data-test,data-cy,id(if stable). - Priority 2: Accessible roles with names (e.g.,
getByRole('button', { name: 'Submit' })). - Priority 3: Text content (e.g.,
getByText('Submit')) - use with caution if text is dynamic. - Avoid: Brittle CSS selectors (e.g.,
div > div:nth-child(3)), XPath, or selectors tied to visual layout.
Waiting Strategy (No Flaky Sleeps)
- Explicit Waits: Wait for elements to be attached, visible, and enabled before interacting.
- State Changes: Wait for clear UI signals (spinners disappearing, success messages appearing).
- Bounded Polling: If no clear signal exists, use a loop with a short sleep (1-2s) and a max retry count.
- Avoid: Long blind sleeps (e.g.,
sleep(5000)).
Evidence Capture Hygiene
- When a tool accepts a
filename, prefer a simple filename (e.g.,tab2-detail-limit-200.png) rather than passing a full/relative directory path; some environments will prepend their own output directory and can accidentally create nested paths.
Scroll Strategy
- Window vs. Container: Determine if the scrollbar belongs to the
windowor a specific container element. - Incremental Scan: Scroll in small chunks (e.g., half viewport) to trigger lazy-loading or reveal elements.
- Check after Scroll: Re-evaluate the page state after scrolling (elements might become visible).
Handling Overlays & Modals
- Detection: Watch for common overlay selectors (dialogs, cookie banners, "interstitial" layers).
- Dismissal: Look for "Close", "X", "Accept", "Reject", "No thanks" buttons.
- Click Intercepted: If a click fails due to an overlay, find the overlay, dismiss it, and retry the click.
Frames & Iframes
- Detection: If an element is not found, check if it resides within an
iframe. - Switching: Switch context to the iframe before querying elements inside it.
Embedded BI Dashboards (Tableau-like)
Common traits
- The interactive visualization is often inside an
iframe(e.g.,iframe[title="Data Visualization"]). - Many charts/tables are canvas-rendered: the underlying DOM won’t contain the visible text, so
locator('text=...')may fail even when the label is on screen. - The accessibility tree (roles like
treegrid,row,gridcell,textbox,button) is frequently the most reliable way to target UI.
Reliable interaction patterns
- Target inside the iframe using role-based selectors (preferred):
frameLocator(...).getByRole(...). - Prefer table-row evidence over brittle filters when filters are multi-select or unclear.
- If a category row is not present due to a “top N” limit, increase a visible limit control (e.g.,
Detail Limit) to expose the desired rows. - Confirm state from the UI: after applying a filter or changing a limit, re-snapshot and verify a clear change (row appears, numbers change, undo/revert enables, etc.).
Quick filter pitfalls + recovery
- Typing into a quick-filter search box may only filter the list of options and may not apply the filter until a checkbox/value is explicitly selected.
- Some quick-filter widgets expose a textbox in the accessibility tree but don’t expose a stable DOM
inputselector. - When a filter isn’t collapsing the view as expected, use a deterministic fallback:
- Increase the visible row limit (e.g.,
Detail Limit) and read the exact row’s values directly.
Platform note (macOS)
- Keyboard modifiers differ:
- Select all:
Meta(Command) +Aon macOS (notControl+A).
Human-in-the-Loop (HITL) Policy
- Auth: Stop for Login/SSO/MFA/CAPTCHA. Ask user to complete and type "Done".
- Irreversible Actions: ALWAYS ask for explicit confirmation before clicking:
- Submit / Complete / Finish
- Attest / Certify
- Approve / Confirm / Yes
- Send / Pay
- Ambiguity: If unsure if an action is irreversible, ASK first.
Recovery Rules
- Element not found:
- Check for iframes.
- Check for shadow DOM.
- Check if the element is behind an overlay.
- Scroll to bring it into view.
- Click intercepted:
- Identify the obscuring element.
- Dismiss it (if it's a modal/banner).
- Wait for it to disappear (if it's a toast/spinner).
- Stale element:
- Re-query the element from the DOM before interacting.
- Visible text not found (canvas / viz rendering):
- Assume the text may not exist in the DOM.
- Re-target via accessibility roles (e.g.,
treegrid/gridcell) or use a “show more rows”/limit control to surface the needed row.