BioMapper Test Architecture

Overview

The BioMapper test suite follows Test-Driven Development (TDD) principles with comprehensive coverage of the 37+ self-registering actions. Tests are organized by entity type mirroring the action structure, with a minimum 80% coverage requirement enforced in CI/CD.

Directory Structure

tests/
├── unit/                           # Unit tests
│   ├── core/
│   │   ├── standards/              # Standards tests
│   │   │   ├── test_file_loader.py
│   │   │   └── test_file_validator.py
│   │   ├── strategy_actions/       # Core action tests
│   │   │   ├── test_load_dataset_identifiers.py
│   │   │   ├── test_calculate_set_overlap.py
│   │   │   ├── test_merge_with_uniprot_resolution.py
│   │   │   └── utils/              # Utility tests
│   │   │       ├── test_progressive_wrapper.py
│   │   │       └── data_processing/
│   │   │           └── test_calculate_mapping_quality.py
│   │   └── test_control_flow.py   # Control flow tests
│   └── strategy_actions/          # Direct action tests
│       ├── test_build_nightingale_reference.py
│       ├── test_semantic_metabolite_match.py
│       ├── test_merge_datasets.py
│       └── test_action_aliases.py
├── scripts/                        # Script tests
│   ├── test_refactored_scripts.py
│   └── test_metabolomics_wrapper.py
├── test_identifier_normalization.py # Identifier tests
└── conftest.py                    # Test configuration and fixtures

Test Categories

Unit Tests

Action Tests (37+ actions)
- Parameter validation with Pydantic models
- execute_typed() method implementation
- Context manipulation (datasets, statistics, output_files)
- Error handling and edge cases
- Backward compatibility with dict interface
Registry Tests
- Self-registration via @register_action
- ACTION_REGISTRY population
- Dynamic action lookup
- Import-time registration
Model Tests
- ActionResult validation
- Parameter model constraints
- Field validators
- Type coercion

Integration Tests

Strategy Execution Tests
- Complete YAML strategy workflows
- Variable substitution (${parameters.key}, ${env.VAR})
- Multi-step action sequences
- Context flow between actions
API Tests
- /api/strategies/v2/ endpoint
- Job management with SQLite persistence
- Server-Sent Events streaming
- Background job processing
- Checkpoint recovery
Client Tests
- BiomapperClient.run() synchronous wrapper
- Error handling and retries
- Progress streaming
- Timeout management

Test Patterns for Typed Actions

TDD Approach for New Actions

# Step 1: Write failing test first
class TestProteinNormalizeAction:
    def test_parameter_validation(self):
        """Test Pydantic parameter validation."""
        params = ProteinNormalizeParams(
            input_key="raw_proteins",
            output_key="normalized",
            remove_isoforms=True,
            validate_format=True
        )
        assert params.input_key == "raw_proteins"
        assert params.remove_isoforms is True

    def test_invalid_params(self):
        """Test validation errors."""
        with pytest.raises(ValidationError) as exc:
            ProteinNormalizeParams(
                input_key="",  # Invalid: empty string
                output_key="normalized"
            )
        assert "empty" in str(exc.value)

Typed Action Execution Tests

class TestProteinNormalizeAction:
    @pytest.mark.asyncio
    async def test_execute_typed(self, mock_context):
        """Test typed execution with shared context."""
        # Arrange
        action = ProteinNormalizeAction()
        mock_context["datasets"]["raw_proteins"] = [
            {"identifier": "P12345-1"},
            {"identifier": "Q67890"}
        ]
        params = ProteinNormalizeParams(
            input_key="raw_proteins",
            output_key="normalized",
            remove_isoforms=True
        )
        
        # Act
        result = await action.execute_typed(params, mock_context)
        
        # Assert
        assert result.success
        assert "normalized" in mock_context["datasets"]
        assert len(mock_context["datasets"]["normalized"]) == 2
        assert mock_context["datasets"]["normalized"][0]["identifier"] == "P12345"

Backward Compatibility Tests

class TestBackwardCompatibility:
    @pytest.mark.asyncio
    async def test_dict_interface(self, mock_context):
        """Test legacy dict-based interface still works."""
        action = ProteinNormalizeAction()
        
        # Legacy dict params
        params_dict = {
            "input_key": "raw_proteins",
            "output_key": "normalized",
            "remove_isoforms": True
        }
        
        # Should work via execute() wrapper
        result = await action.execute(params_dict, mock_context)
        
        assert result["success"] is True
        assert "normalized" in result["message"]

Test Fixtures

Common Fixtures (conftest.py)

@pytest.fixture
def mock_context():
    """Mock context for action testing."""
    class MockContext:
        def __init__(self):
            self._data = {'custom_action_data': {}}
        
        def set_action_data(self, key: str, value):
            self._data['custom_action_data'][key] = value
        
        def get_action_data(self, key: str, default=None):
            return self._data.get('custom_action_data', {}).get(key, default)
    
    return MockContext()

@pytest.fixture
def temp_csv_file(tmp_path):
    """Create temporary CSV file for testing."""
    csv_file = tmp_path / "test_data.csv"
    csv_file.write_text("id,name\nP12345,Protein1\nQ67890,Protein2\n")
    return csv_file

Action-Specific Fixtures

@pytest.fixture
def sample_merged_data():
    """Sample merged data for overlap calculations."""
    return [
        {
            "source_id": "P12345",
            "target_id": "ENSP000123",
            "match_type": "direct",
            "match_confidence": 1.0
        }
    ]

Testing Strategy

Shared Context Pattern

Actions receive and modify a shared Dict[str, Any] context:

@pytest.fixture
def mock_context():
    """Standard context structure for action testing."""
    return {
        "datasets": {},           # Named datasets
        "current_identifiers": [], # Active identifiers
        "statistics": {},         # Accumulated metrics
        "output_files": [],       # Generated files
        "metadata": {}            # Strategy metadata
    }

CI/CD Integration

Tests are configured to work in CI environments:

Manual tests skip automatically when CI=true or GITHUB_ACTIONS=true
Integration tests that require the API server are marked and skipped appropriately
Unit tests run independently without external dependencies

Test Configuration

The pytest.ini file includes:

[pytest]
asyncio_mode = auto
asyncio_default_fixture_loop_scope = function

# Test markers
markers =
    requires_api: marks tests that require the API server to be running
    integration: marks tests that require real services (Qdrant, etc.)
    performance: marks performance benchmark tests
    memory: marks memory efficiency and optimization tests
    requires_qdrant: marks tests that require Qdrant vector database
    requires_external_services: marks tests that require external APIs or services
    requires_network: marks tests that require network access
    slow: marks tests that take more than 30 seconds to run

# Test isolation and resource limits
addopts = --strict-markers -p no:cacheprovider

# Plugin path
pythonpath = . dev

Coverage Requirements

Minimum Standards

80% overall coverage enforced in CI/CD
100% coverage for TypedStrategyAction params
95%+ coverage for execute_typed() methods
All error paths tested
Edge cases verified

Action-Specific Requirements

Each new action must have comprehensive tests
Test both typed and dict interfaces
Verify context manipulation
Test parameter validation
Cover error scenarios

CI/CD Integration

# .github/workflows/test.yml
- name: Run tests with coverage
  run: |
    poetry run pytest --cov=biomapper --cov-report=html --cov-fail-under=80

Running Tests

Essential Commands

# Full test suite with coverage
poetry run pytest --cov=biomapper --cov-report=html

# Unit tests only
poetry run pytest tests/unit/

# Integration tests
poetry run pytest tests/integration/

# Specific action test with verbose output
poetry run pytest tests/unit/core/strategy_actions/entities/proteins/ -xvs

# Run specific test by name
poetry run pytest -k "test_protein_normalize"

# Debug single test with output
poetry run pytest -xvs tests/unit/core/strategy_actions/test_registry.py::test_action_registration

# Check coverage for specific module
poetry run pytest --cov=biomapper.core.strategy_actions --cov-report=term-missing

Performance Testing

The test suite includes performance considerations:

Large dataset tests with timing validation
Memory usage monitoring for data processing
CSV parsing performance benchmarks
Venn diagram generation performance

Example:

def test_large_dataset_performance(self):
    """Test performance with large datasets."""
    # Test with 10,000 rows
    large_data = generate_test_data(10000)
    start_time = time.time()
    
    # Execute action
    result = await action.execute(params, context)
    
    execution_time = time.time() - start_time
    assert execution_time < 30.0  # Should complete within 30 seconds

Test Organization Best Practices

Directory Structure Mirrors Code

tests/unit/core/strategy_actions/entities/proteins/
  ↓ mirrors ↓
src/actions/entities/proteins/

Test Naming Conventions

Test files: test_<action_name>.py
Test classes: Test<ActionName>
Test methods: test_<specific_scenario>

Fixture Organization

Global fixtures in conftest.py
Entity-specific fixtures in subdirectory conftest
Action-specific fixtures in test file

Future Enhancements

In Progress

Complete migration to TypedStrategyAction tests
Automated strategy validation tests
Performance benchmarking suite

Planned

Property-based testing with Hypothesis
Mutation testing for coverage quality
Load testing for API endpoints
Integration with external services mocking

Verification Sources

Last verified: 2025-01-18

This documentation was verified against the following project resources:

/home/ubuntu/biomapper/tests/unit/core/strategy_actions/ (Core action tests including test_load_dataset_identifiers.py)
/home/ubuntu/biomapper/tests/unit/strategy_actions/ (Direct action tests: test_semantic_metabolite_match.py)
/home/ubuntu/biomapper/tests/unit/core/standards/ (Standards tests: test_file_loader.py, test_api_validator.py, test_context_handler.py)
/home/ubuntu/biomapper/tests/ (Test directory structure and organization)
/home/ubuntu/biomapper/pytest.ini (Test markers, asyncio configuration, and addopts settings)
/home/ubuntu/biomapper/src/actions/ (Action implementations with entity-based organization)
/home/ubuntu/biomapper/CLAUDE.md (TDD approach and testing commands)