Lean Processor

`lean_automator.lean.processor` ¶

Orchestrates Lean code generation and verification for Knowledge Base items.

This module manages the process of generating formal Lean 4 code (both statement signatures and proofs) for mathematical items (KBItem) stored in the knowledge base. It interacts with an LLM client (GeminiClient) via the llm_interface module to generate the code based on LaTeX statements, informal proofs, and dependency context.

The core logic, orchestrated by generate_and_verify_lean, involves:

Checking prerequisites and dependencies (_check_prerequisites_and_dependencies).
Generating a Lean statement signature (_generate_statement_shell).
Generating and verifying Lean proof tactics (_generate_and_verify_proof).
Updating the KBItem status and content throughout the process.
Providing batch processing capabilities (process_pending_lean_items).

Classes¶

Functions¶

`generate_and_verify_lean(unique_name: str, client: GeminiClient, db_path: Optional[str] = None, lake_executable_path: str = 'lake', timeout_seconds: int = 120) -> bool` `async` ¶

Orchestrates the full Lean processing pipeline for a single KBItem.

Calls helper functions to check prerequisites, generate statement, and generate/verify proof as needed.

Parameters:

Name	Type	Description	Default
`unique_name`	`str`	The unique name of the KBItem to process.	required
`client`	`GeminiClient`	An initialized LLM client instance.	required
`db_path`	`Optional[str]`	Path to the knowledge base database file. Uses `DEFAULT_DB_PATH` if None.	`None`
`lake_executable_path`	`str`	Path to the `lake` executable.	`'lake'`
`timeout_seconds`	`int`	Timeout for `lake build` commands.	`120`

Returns:

Type	Description
`bool`	True if the item is successfully processed (ends in PROVEN status),
`bool`	False otherwise.

Source code in lean_automator/lean/processor.py

async def generate_and_verify_lean(
    unique_name: str,
    client: GeminiClient,
    db_path: Optional[str] = None,
    lake_executable_path: str = "lake",
    timeout_seconds: int = 120,
) -> bool:
    """Orchestrates the full Lean processing pipeline for a single KBItem.

    Calls helper functions to check prerequisites, generate statement, and
    generate/verify proof as needed.

    Args:
        unique_name: The unique name of the KBItem to process.
        client: An initialized LLM client instance.
        db_path: Path to the knowledge base database file. Uses
            `DEFAULT_DB_PATH` if None.
        lake_executable_path: Path to the `lake` executable.
        timeout_seconds: Timeout for `lake build` commands.

    Returns:
        True if the item is successfully processed (ends in PROVEN status),
        False otherwise.
    """
    start_time = time.time()
    effective_db_path = db_path or DEFAULT_DB_PATH
    overall_success = False

    # --- Basic readiness checks ---
    if not client or isinstance(client, type("DummyGeminiClient", (object,), {})):
        logger.error(
            f"GeminiClient missing or invalid. Cannot process Lean for {unique_name}."
        )
        return False
    if not all(
        [
            KBItem,
            ItemStatus,
            ItemType,
            get_kb_item_by_name,
            save_kb_item,
            lean_interaction,
            lean_proof_repair,
            llm_interface,
        ]
    ):
        logger.critical(
            f"Core modules/types missing for {unique_name}. Cannot process."
        )
        return False

    try:
        # --- Step 1: Check prerequisites and dependencies ---
        item, dependency_items, error_msg = _check_prerequisites_and_dependencies(
            unique_name, effective_db_path
        )

        if error_msg:
            logger.warning(f"Prerequisite check failed for {unique_name}: {error_msg}")
            # Save ERROR status only if item exists and status indicates an error
            # occurred during processing
            if item and hasattr(item, "update_status") and hasattr(ItemStatus, "ERROR"):
                # Avoid overwriting PROVEN status if that was the reason
                # for the 'error' message
                if getattr(item, "status", None) != ItemStatus.PROVEN:
                    # Check if the error requires setting ERROR status
                    if (
                        "missing required latex_statement" in error_msg
                        or "not ready" in error_msg
                        or "not accessible" in error_msg
                    ):
                        item.update_status(ItemStatus.ERROR, error_msg)
                        await save_kb_item(item, client=None, db_path=effective_db_path)
            return False  # Prerequisite failed

        if not item:  # Should not happen if error_msg is None, but check defensively
            logger.error(
                f"Prerequisite check returned no item and no error for {unique_name}."
            )
            return False

        # If item is already proven, the check function handles logging,
        # we just exit successfully.
        if (
            hasattr(ItemStatus, "PROVEN")
            and getattr(item, "status", None) == ItemStatus.PROVEN
        ):
            return True

        # --- Step 2: Determine if statement generation is needed ---
        needs_statement_generation = not getattr(item, "lean_code", None) or getattr(
            item, "status", None
        ) in {ItemStatus.LATEX_ACCEPTED, ItemStatus.PENDING_LEAN}
        statement_shell = None

        if needs_statement_generation:
            statement_shell = await _generate_statement_shell(
                item, dependency_items, client, effective_db_path
            )
            if not statement_shell:
                logger.error(f"Failed to generate statement shell for {unique_name}.")
                # Status already set to ERROR inside helper function
                return False
            # Re-fetch item state after generation as it was modified
            item = get_kb_item_by_name(unique_name, effective_db_path)
            if not item:
                logger.critical(
                    f"Item {unique_name} vanished after statement generation!"
                )
                return False  # Or raise
        else:
            logger.info(
                f"Skipping Lean statement generation for {unique_name}, "
                f"using existing lean_code."
            )
            statement_shell = getattr(item, "lean_code", None)
            # Validate existing shell
            if not statement_shell or ":= sorry" not in statement_shell.strip():
                error_msg = (
                    f"Existing lean_code for {unique_name} is invalid "
                    f"(missing ':= sorry')."
                )
                logger.error(error_msg)
                if hasattr(item, "update_status") and hasattr(ItemStatus, "ERROR"):
                    item.update_status(ItemStatus.ERROR, error_msg)
                    await save_kb_item(item, client=None, db_path=effective_db_path)
                return False

        # --- Step 3: Handle non-provable items ---
        item_type = getattr(item, "item_type", None)
        proof_required = getattr(item_type, "requires_proof", lambda: True)()

        if not proof_required:
            logger.info(
                f"Lean Proc: Statement generated/present for non-provable "
                f"{unique_name}. Marking PROVEN."
            )
            if hasattr(item, "update_status") and hasattr(ItemStatus, "PROVEN"):
                # Only update if not already proven
                if getattr(item, "status", None) != ItemStatus.PROVEN:
                    item.update_status(ItemStatus.PROVEN)
                    await save_kb_item(item, client=None, db_path=effective_db_path)
                overall_success = True  # Mark as success
            else:
                logger.error(
                    f"Failed to mark non-provable item {unique_name} as PROVEN."
                )
                overall_success = False  # Should not happen if checks passed
        else:
            # --- Step 4: Generate and verify proof ---
            proof_success = await _generate_and_verify_proof(
                item,
                statement_shell,
                dependency_items,
                client,
                effective_db_path,
                lake_executable_path,
                timeout_seconds,
            )
            overall_success = proof_success

    except Exception as e:
        logger.exception(
            f"Unhandled exception during generate_and_verify_lean for "
            f"{unique_name}: {e}"
        )
        # Attempt to mark item as error if possible
        try:
            item_err = get_kb_item_by_name(unique_name, effective_db_path)
            if (
                item_err
                and hasattr(item_err, "update_status")
                and hasattr(ItemStatus, "ERROR")
            ):
                if (
                    getattr(item_err, "status", None) != ItemStatus.PROVEN
                ):  # Avoid overwriting success
                    item_err.update_status(
                        ItemStatus.ERROR, f"Unhandled exception: {str(e)[:100]}"
                    )
                    await save_kb_item(item_err, client=None, db_path=effective_db_path)
        except Exception as save_err:
            logger.error(
                f"Failed to save ERROR status after unhandled exception for "
                f"{unique_name}: {save_err}"
            )
        overall_success = False  # Ensure failure on exception

    # --- Final Logging ---
    end_time = time.time()
    duration = end_time - start_time
    if overall_success:
        logger.info(
            f"Lean processing SUCCEEDED for {unique_name} in {duration:.2f} seconds."
        )
    else:
        # Fetch final state for accurate logging
        final_item = get_kb_item_by_name(unique_name, effective_db_path)
        final_status_name = "UNKNOWN"
        if final_item:
            final_status = getattr(final_item, "status", None)
            final_status_name = getattr(final_status, "name", "UNKNOWN")
            # Check again if it somehow ended as PROVEN despite failure path
            if hasattr(ItemStatus, "PROVEN") and final_status == ItemStatus.PROVEN:
                logger.warning(
                    f"Item {unique_name} ended as PROVEN despite processing path "
                    f"indicating failure. Assuming success."
                )
                overall_success = True  # Correct the outcome based on final state
                logger.info(
                    f"Lean processing SUCCEEDED (final status) for {unique_name} "
                    f"in {duration:.2f} seconds."
                )
                return True  # Return success

        logger.error(
            f"Lean processing FAILED for {unique_name}. "
            f"Final Status: {final_status_name}. Total time: {duration:.2f} seconds."
        )

    return overall_success

`process_pending_lean_items(client: GeminiClient, db_path: Optional[str] = None, limit: Optional[int] = None, process_statuses: Optional[List[ItemStatus]] = None, **kwargs)` `async` ¶

Processes multiple items requiring Lean code generation and verification.

Queries the database for items in specified statuses (defaulting to LATEX_ACCEPTED, PENDING_LEAN, LEAN_VALIDATION_FAILED). It then iterates through eligible items (checking status again before processing), calls generate_and_verify_lean for each one up to an optional limit, and logs summary statistics.