Luke Richardson - Electrical Engineer

1. Project Overview

The AI Co-Drawing application is an interactive collaborative drawing tool that combines human creativity with artificial intelligence. Built with React and powered by Google's Gemini 2.0 Flash model, it allows users to create drawings and have them enhanced, transformed, or evolved through AI assistance.

Core Features

Interactive Canvas: High-resolution drawing surface with touch and mouse support
AI Enhancement: Transform drawings using natural language prompts
Iterative Creation: Build upon AI-generated images with additional drawing
Real-time Processing: Live drawing with immediate visual feedback
Multi-modal AI: Combines image understanding with text generation capabilities

Workflow

User draws on the canvas using customizable brush tools
User provides a text prompt describing desired enhancements
AI analyzes the drawing and prompt to generate an enhanced version
Generated image becomes the new canvas background
User can continue drawing on top of the AI-generated content
Process repeats for iterative collaborative creation

2. AI Integration

Gemini 2.0 Flash Model

The application uses Google's latest Gemini 2.0 Flash model with native image generation capabilities. This model can understand both visual input (drawings) and textual instructions to create coherent, contextual enhancements.

Model Configuration

Model: gemini-2.0-flash-preview-image-generation
Input Modalities: Text + Image
Output Modalities: Text + Image
Max Resolution: 1024x1024 pixels

Multi-modal Processing

The AI system processes two types of input simultaneously:

AI Request Structure

let contents: Content[] = [
  {
    role: "USER",
    parts: [{ inlineData: { data: drawingData, mimeType: "image/png" } }],
  },
  {
    role: "USER", 
    parts: [{ text: `${prompt}. Keep the same minimal line doodle style.` }],
  },
];

const response = await ai.models.generateContent({
  model: "gemini-2.0-flash-preview-image-generation",
  contents,
  config: {
    responseModalities: [Modality.TEXT, Modality.IMAGE],
  },
});

Style Preservation

To maintain visual consistency, the AI is instructed to "keep the same minimal line doodle style" ensuring that enhancements feel like natural extensions of the original drawing rather than complete replacements.

3. Canvas Drawing System

Canvas Architecture

The drawing system uses HTML5 Canvas with a multi-layer approach for optimal performance and functionality.

Canvas Initialization

const initializeCanvas = () => {
  const canvas = canvasRef.current;
  if (!canvas) return;
  const ctx = canvas.getContext("2d");
  if (!ctx) return;

  // Set high-resolution canvas size
  canvas.width = 960;
  canvas.height = 540;

  // Fill canvas with white background
  ctx.fillStyle = "#FFFFFF";
  ctx.fillRect(0, 0, canvas.width, canvas.height);
};

Coordinate System

The canvas uses a sophisticated coordinate mapping system to handle different screen sizes and device pixel ratios:

Coordinate Mapping

const getCoordinates = (e: React.MouseEvent | React.TouchEvent) => {
  const canvas = canvasRef.current;
  if (!canvas) return { x: 0, y: 0 };
  
  const rect = canvas.getBoundingClientRect();
  const scaleX = canvas.width / rect.width;
  const scaleY = canvas.height / rect.height;

  let clientX, clientY;
  if ("touches" in e) {
    clientX = e.touches[0]?.clientX || 0;
    clientY = e.touches[0]?.clientY || 0;
  } else {
    clientX = e.clientX;
    clientY = e.clientY;
  }

  return {
    x: (clientX - rect.left) * scaleX,
    y: (clientY - rect.top) * scaleY,
  };
};

Drawing Engine

The drawing engine supports smooth line rendering with configurable brush properties:

Drawing Implementation

const draw = (e: React.MouseEvent | React.TouchEvent) => {
  if (!isDrawing) return;

  const canvas = canvasRef.current;
  if (!canvas) return;
  const ctx = canvas.getContext("2d");
  if (!ctx) return;
  
  const { x, y } = getCoordinates(e);

  ctx.lineWidth = brushSize;
  ctx.lineCap = "round";
  ctx.strokeStyle = penColor;
  ctx.lineTo(x, y);
  ctx.stroke();
};

4. Implementation Details

State Management

The application uses React hooks for comprehensive state management:

State Structure

// Drawing State
const [isDrawing, setIsDrawing] = useState(false);
const [penColor, setPenColor] = useState("#000000");
const [brushSize, setBrushSize] = useState(5);

// AI Generation State
const [prompt, setPrompt] = useState("");
const [generatedImage, setGeneratedImage] = useState<string | null>(null);
const [isLoading, setIsLoading] = useState(false);

// Error Handling
const [showErrorModal, setShowErrorModal] = useState(false);
const [errorMessage, setErrorMessage] = useState("");

Background Image System

AI-generated images are seamlessly integrated as canvas backgrounds for iterative drawing:

Background Integration

useEffect(() => {
  if (generatedImage && canvasRef.current) {
    const img = new window.Image();
    img.onload = () => {
      backgroundImageRef.current = img;
      drawImageToCanvas();
    };
    img.src = generatedImage;
  }
}, [generatedImage]);

const drawImageToCanvas = () => {
  if (!canvasRef.current || !backgroundImageRef.current) return;

  const canvas = canvasRef.current;
  const ctx = canvas.getContext("2d");
  if (!ctx) return;

  // Fill with white background first
  ctx.fillStyle = "#FFFFFF";
  ctx.fillRect(0, 0, canvas.width, canvas.height);

  // Draw the background image
  ctx.drawImage(backgroundImageRef.current, 0, 0, canvas.width, canvas.height);
};

Touch Support

Comprehensive touch support for mobile devices with gesture prevention:

Touch Event Handling

useEffect(() => {
  const preventTouchDefault = (e: TouchEvent) => {
    if (isDrawing) {
      e.preventDefault();
    }
  };

  const canvas = canvasRef.current;
  if (canvas) {
    canvas.addEventListener("touchstart", preventTouchDefault, { passive: false });
    canvas.addEventListener("touchmove", preventTouchDefault, { passive: false });
  }

  return () => {
    if (canvas) {
      canvas.removeEventListener("touchstart", preventTouchDefault);
      canvas.removeEventListener("touchmove", preventTouchDefault);
    }
  };
}, [isDrawing]);

5. User Interface

Design System

The interface follows a consistent design system with dark theme and neutral colors:

Color Palette

Background: #000000

Cards: #171717

Controls: #262626

Primary: #FFFFFF

Component Structure

Canvas Area: Primary drawing surface with full touch/mouse support
Settings Panel: Tool controls and AI configuration options
Drawing Tools: Brush size, color picker, and canvas management
AI Enhancement: Prompt input and generation controls
Error Modal: User-friendly error handling and display

Responsive Design

The interface adapts to different screen sizes using Tailwind CSS breakpoints:

Responsive Layout

{/* Responsive grid layout */}
<div className="grid grid-cols-1 xl:grid-cols-[1fr_300px] gap-6">
  {/* Canvas takes full width on mobile, left side on desktop */}
  <motion.div className="bg-neutral-900 rounded-2xl border border-neutral-800 p-6">
    <canvas 
      className="w-full h-[400px] md:h-[500px] lg:h-[600px] bg-white cursor-crosshair touch-none"
    />
  </motion.div>
  
  {/* Settings panel stacks below on mobile, right side on desktop */}
  <motion.div className="bg-neutral-900 rounded-2xl border border-neutral-800 p-6">
    {/* Settings content */}
  </motion.div>
</div>

6. API Integration

Environment Configuration

The application requires proper environment setup for API access:

.env.local

# Google Generative AI API Key
# Get your API key from: https://makersuite.google.com/app/apikey
NEXT_PUBLIC_GOOGLE_API_KEY=your_api_key_here

Error Handling

Comprehensive error handling with user-friendly feedback:

Error Management

function parseError(error: string) {
  const regex = /{"error":(.*)}/gm;
  const m = regex.exec(error);
  try {
    const e = m ? m[1] : error;
    const err = JSON.parse(e);
    return err.message || error;
  } catch (e) {
    return error;
  }
}

// In the main component
} catch (error: any) {
  console.error("Error submitting drawing:", error);
  setErrorMessage(error.message || "An unexpected error occurred.");
  setShowErrorModal(true);
} finally {
  setIsLoading(false);
}

Performance Considerations

Image Compression: Canvas exports optimized PNG data for API transmission
Async Processing: Non-blocking AI generation with loading states
Memory Management: Proper cleanup of canvas contexts and event listeners
Error Recovery: Graceful handling of API failures with retry options
Client-side Rendering: All drawing operations run locally for immediate feedback

Security

API keys are handled securely through environment variables with the NEXT_PUBLIC_ prefix for client-side access. The application includes input validation and sanitization to prevent malicious prompts or canvas manipulations.

Documentation

Table of Contents