Performance tuning¶

This guide covers practical knobs and patterns for high-volume usage.

Pagination sizing¶

Prefer larger limit values for throughput (fewer requests), but keep response sizes reasonable for your workload.
If you hit 429s, lower concurrency first (see below), then consider reducing limit.

Concurrency (async)¶

Run independent reads concurrently, but cap concurrency (e.g., 5–20 in flight depending on rate limits and payload size).
When you see 429s, reduce concurrency and let the SDK respect Retry-After.

Connection pooling¶

The SDK uses httpx connection pooling. For high-throughput clients:

Reuse a single client instance for many calls (don’t create a new Affinity per request).
Close clients when done (use a context manager).

Timeouts and deadlines¶

Use the global timeout to set a sensible default for API requests.
For large file downloads, use per-call timeout and deadline_seconds to bound total time spent (including retries/backoff).

from affinity import Affinity
from affinity.types import FileId

with Affinity(api_key="your-api-key", timeout=30.0) as client:
    for chunk in client.files.download_stream(FileId(123), timeout=60.0, deadline_seconds=300):
        ...

Caching¶

The SDK provides optional in-memory caching for metadata-style responses (field definitions, list configurations). This reduces API calls for frequently-accessed, slowly-changing data.

Enabling cache¶

from affinity import Affinity

# Enable with default 5-minute TTL
client = Affinity(api_key="your-api-key", enable_cache=True)

# Custom TTL (in seconds)
client = Affinity(api_key="your-api-key", enable_cache=True, cache_ttl=600.0)

Long-running applications¶

For long-running processes (web servers, background workers), be aware that cached data may become stale:

Field definitions may change if admins add/modify fields
List configurations may change if lists are reconfigured
Default TTL is 5 minutes (300 seconds)

Recommendations:

Choose appropriate TTL: Match your TTL to how often metadata changes in your organization
Invalidate on known changes: Clear cache after operations that modify metadata
Periodic refresh: For very long processes, consider periodic cache clears

Manual cache invalidation¶

# Clear all cached entries
client.clear_cache()

When to use caching¶

Scenario	Recommendation
Short-lived scripts	Caching optional (few repeated calls)
CLI tools	Enable caching (reduces latency for field lookups)
Web servers	Enable with appropriate TTL
Background workers	Enable, consider periodic cache refresh

Cache isolation¶

Cache is isolated per API key and base URL combination, so multiple clients with different credentials won't share cached data.

CLI session caching¶

For CLI pipelines, use session caching to share metadata across invocations:

export AFFINITY_SESSION_CACHE=$(xaffinity session start)
xaffinity list export "My List" | xaffinity person get
xaffinity session end

See CLI Pipeline Optimization for details.

Counting entities¶

List entries (efficient)¶

Use client.lists.get_size(list_id) to get accurate entry counts:

# Count entries on a single list
count = client.lists.get_size(list_id)
print(f"Entries: {count}")

# Get counts for multiple lists (cached for 5 min)
for lst in client.lists.all():
    count = client.lists.get_size(lst.id)
    print(f"{lst.name}: {count}")

# Force fresh fetch (bypass cache) when accuracy is critical
count = client.lists.get_size(list_id, force=True)

This uses the V1 API which returns accurate counts. Results are cached for 5 minutes per list. Use force=True to bypass the cache when you need the most up-to-date value.

Global persons/companies (no direct method)¶

The Affinity API does not provide a count endpoint for global persons or companies. A count() method was intentionally omitted from the SDK because it would require paginating through all records—potentially hundreds of API calls for large databases.

Why this matters:

Database size	API calls (V2)	Approximate time
1,000 entities	10 calls	~1 second
10,000 entities	100 calls	~10 seconds
100,000 entities	1,000 calls	~2 minutes

If you need a global count, iterate once and cache the result:

# One-time count (expensive for large databases)
person_count = sum(1 for _ in client.persons.iter())

Recommended alternatives:

Use lists: Create a list containing your target entities and use client.lists.get_size(list_id)
Cache externally: Count once and store the result in your application
Estimate: If exact counts aren't critical, sample or track changes incrementally

Field Lookup Patterns¶

When working with field values, you often need to look up fields by name rather than ID. The SDK provides several approaches with different trade-offs.

Recommended: FieldResolver¶

For most use cases, use the FieldResolver helper:

from affinity import Affinity, FieldResolver, ResolveMode
from affinity.types import FieldType

with Affinity(api_key="...", enable_cache=True) as client:
    # Create resolver from field metadata (cached for 5 minutes)
    resolver = FieldResolver(client.companies.get_fields())

    # Fetch companies with field data
    companies = list(client.companies.iter(field_types=[FieldType.GLOBAL]))

    for company in companies:
        # Simple access by name
        status = resolver.get(company, "Status")
        industry = resolver.get(company, "Industry")

        # Batch extraction
        values = resolver.get_many(company, ["Status", "Industry", "Size"])

        # Resolve all complex types to human-readable text
        stage = resolver.get(company, "Deal Stage", resolve=ResolveMode.TEXT)

The FieldResolver: - Caches name -> ID mapping internally - Handles nested value extraction automatically - Supports case-insensitive field names - ResolveMode.TEXT resolves dropdowns, persons, companies, locations to text - Disambiguates enrichment fields with source:Name syntax (e.g., "dealroom:Description")

List Entry Field Access¶

For list entries, the V2 API nests field data inside entry.entity.fields, not entry.fields. FieldResolver.get() handles this automatically — just pass the entry directly:

for entry in client.lists.entries(list_id).all(field_types=[FieldType.LIST]):
    status = resolver.get(entry, "Status")  # auto-delegates to entry.entity.fields

Low-Level Access¶

For direct access when you already have field IDs:

# Get extracted value by field ID
value = company.fields.get_value("field-123")  # "Active" for text, DropdownOption for dropdowns

# Get raw field data (for advanced use cases)
field_data = company.fields.get("field-123")  # Returns {"id": ..., "value": {...}}

Field Metadata Sources¶

Choose the appropriate source based on your needs:

Method	Scope	Use Case
`client.fields.list()`	All fields	Cross-entity operations
`client.fields.list(list_id=...)`	List-specific fields	Working with one list
`client.companies.get_fields()`	Company fields	Company-focused work
`client.persons.get_fields()`	Person fields	Person-focused work
`client.lists.get_fields(list_id)`	List fields	List entry operations

All methods cache results for 5 minutes when caching is enabled.

Caching Considerations¶

Field metadata changes infrequently (only when admins add/modify fields), so caching is safe for most applications:

# Enable caching for automatic 5-minute TTL
client = Affinity(api_key="...", enable_cache=True)

# For long-running processes, you can manually refresh
field_by_name = {f.name.casefold(): f for f in client.companies.get_fields()}
# ... later, after known field changes ...
client.clear_cache()
field_by_name = {f.name.casefold(): f for f in client.companies.get_fields()}