Description:

Grok (by xAI)

Features:

"Rebellious" and Witty Personality: Designed with a unique personality, aiming to be witty, sarcastic, and to answer questions that other AI models might reject. This is a core distinguishing characteristic.

Real-time Access to X (formerly Twitter): A key feature is its ability to access information from the X platform (formerly Twitter) in real-time, providing up-to-date insights on current events and trending topics.

Focus on Truth and Transparency: Despite its edgy personality, xAI emphasizes Grok's commitment to seeking truth and not generating content that is harmful or dishonest, though its approach to "pushing boundaries" is distinct.

Broad Knowledge Base: Built on a large language model, giving it a wide range of general knowledge.

Humor Integration: Designed to inject humor into its responses.

Where it Excels:

Timeliness (X Integration): Its real-time access to X gives it a significant advantage in providing very current information and understanding rapidly evolving trends or events.

Unique Personality: For users who appreciate a more engaging, less sterile AI interaction, Grok's distinct personality can be a major draw.

Answering Controversial/Edgy Queries: Its design to "answer questions that other AI models reject" can make it useful for exploring sensitive or unconventional topics (though this also comes with inherent risks).

Where it Falls Short:

Control over Personality: While its personality is a feature, users might have less granular control over its tone, which might not be suitable for all professional or formal applications.

Bias from Source Data: Real-time access to X, while beneficial for timeliness, also means it can inherit biases or inaccuracies present in that data.

API Accessibility/Integration (Currently Limited): As a newer model, its API accessibility and integration options for third-party applications might be less mature or widely available compared to established models like ChatGPT or Gemini.

Enterprise-Grade Features: May not yet have the full suite of enterprise-grade security, data governance, and management features that larger, more established AI providers offer.

Distinguishing Characteristics:

"Maximum Truth-Seeking & Rebellious Streak": Its stated mission to "understand the universe" and its distinct, often sarcastic, personality.

Real-time X Integration: Its direct and real-time connection to the X platform is its most unique technical differentiator.

Developed by xAI: The company founded by Elon Musk, which gives it a unique philosophical and operational approach to AI development.

Differences Between Free and Pay-for-Service Models:

Grok is primarily accessible through a paid subscription to X Premium+ (formerly Twitter Blue).

Free Version: As of my last update, there is no widely available free tier or direct free access to Grok outside of the X Premium+ subscription.

Paid Tier (X Premium+): Access to Grok is bundled with other premium features of the X platform. This implies:

Direct access within the X platform interface.

Real-time information derived from X content.

The specific features and usage limits are tied to the X Premium+ subscription terms.

API access for developers is typically part of a separate, higher-tier commercial offering from xAI, not necessarily bundled with the consumer-facing X Premium+ subscription.

Status: Draft   Priority:

Target:   Comments:   URLs:   Images:

]]>

Description:

DeepSeek (by DeepSeek AI)

Features:

Code-Centric Models: DeepSeek is well-known for its focus on code-specific large language models (LLMs), such as DeepSeek Coder.

Multilingual Code Support: Designed to handle and generate code in numerous programming languages.

Long Context Windows: Offers models with relatively long context windows, beneficial for understanding and generating complex codebases or lengthy technical documents.

Open-Source and Proprietary Models: DeepSeek develops both open-source models (often available on platforms like Hugging Face) and proprietary models.

Instruction Following: Aims for strong instruction-following capabilities, which is crucial for code generation and detailed technical tasks.

Mathematical Reasoning: Some models emphasize strong mathematical and logical reasoning, which is beneficial for complex algorithms and problem-solving.

Where it Excels:

Code Generation & Completion: Highly proficient in generating accurate and efficient code snippets, functions, and even larger program structures. This is its primary strength.

Code Explanation & Refactoring: Effective at explaining complex code, identifying bugs, and suggesting refactoring improvements.

Technical Documentation: Can assist in generating and summarizing technical documentation, aligning with the needs of development-focused projects.

Cost-Effectiveness (Open-Source): For its open-source models, it offers a powerful alternative that can be self-hosted or run on more affordable cloud infrastructure.

Where it Falls Short:

General Conversational AI: While capable, its primary optimization is for code and technical tasks; it might not always match the conversational fluency or breadth of general-purpose chatbots like ChatGPT or Gemini for non-technical discussions.

Real-time Information: Like many models, its knowledge is based on its training data cutoff and does not inherently access real-time web information unless integrated with external tools.

Ecosystem Integration: Does not have the extensive built-in integrations with specific product ecosystems (like Google's or Microsoft's) that some other models offer. Integration would require custom development.

Distinguishing Characteristics:

Code-First Approach: A core focus on coding capabilities, making it a specialized tool for developers and technical users.

Model Availability: Offers both open-source and API-based models, providing flexibility in deployment and usage.

Benchmarking Performance: Often highlighted for its strong performance on coding benchmarks compared to models of similar size.

Differences Between Free and Pay-for-Service Models:

Free/Open-Source Models:

DeepSeek releases various models (e.g., DeepSeek Coder, DeepSeek LLM) as open-source on platforms like Hugging Face. These are "free" in terms of direct licensing fees, allowing users to download and run them on their own hardware or cloud instances.

Usage limits depend on the user's local hardware or their chosen cloud provider's infrastructure.

This option requires technical expertise to set up and manage.

Paid/API Access:

DeepSeek also offers API access to its more powerful or larger models, typically through a paid tier.

Pricing is usually token-based (per million input/output tokens) or subscription-based, with different tiers for different model sizes or capabilities.

Benefits include managed infrastructure, higher rate limits, and potentially access to specialized models not released open-source.

Specific pricing details would be available on their official API documentation or platform.

Status: Draft   Priority:

Target:   Comments:   URLs:   Images:

]]>

Description:

GitHub Copilot (by GitHub/Microsoft)

Features:

Real-time Code Suggestions (Completions): Provides inline code suggestions as you type, ranging from single-line completions to entire function implementations, adapting to your coding style.

Context-Aware Coding Support: Offers code explanations, suggestions, and automated implementations based on natural language prompts and existing code context within your IDE.

Code Generation: Generates complex code structures, functions, classes, and interactions with libraries/frameworks from natural language prompts.

Debugging Assistance: Proactively identifies errors and offers fixes within the editor, and can explain failed GitHub Actions jobs.

Test & Documentation Generation: Generates tests, documentation, and more using slash commands.

Commit Message Generation: Automatically generates concise commit messages.

Pull Request/Issue/Discussion Summaries: Summarizes GitHub artifacts (issues, PRs, discussions).

Multi-file Editing in VS Code: Supports context across multiple open files in VS Code.

Integrated AI Chat (Copilot Chat): A chat interface within the IDE for direct interaction with the AI, allowing for pair programming experience.

Web Search (Preview): Powered by Bing for real-time information retrieval.

Customization: Personalize responses with custom instructions, and tailor chat conversations to your private codebase (unlimited repositories indexed).

Extensions: Integrates with GitHub Copilot Extensions.

Where it Excels:

Code Generation & Completion: Highly effective at accelerating coding by automating routine and repetitive tasks.

Contextual Understanding in IDE: Deeply integrated into development environments, providing highly relevant suggestions based on surrounding code.

Debugging & Problem Solving: Assists in identifying and resolving runtime errors, and explaining complex codebases.

Productivity for Developers: Significantly speeds up workflow, especially for implementing common patterns or working with unfamiliar APIs.

Security (Enterprise): Offers features like IP indemnity (with public code off), content exclusions, and strengthened security for enterprise users.

Where it Falls Short:

Inaccuracies/Bugs: Can sometimes generate code that doesn't align with intentions or introduces bugs, requiring manual correction and review.

Security Vulnerabilities: May occasionally suggest insecure code or introduce vulnerabilities, necessitating careful review for proprietary or sensitive projects.

Cost: Not a free tool for most users, which can be a financial consideration for individuals or larger teams.

Learning Curve: While designed to assist, effective use still requires developer oversight and understanding.

Distinguishing Characteristics:

Deep IDE Integration: Its primary strength lies in its seamless integration within popular IDEs (like VS Code, Visual Studio), functioning as an "AI pair programmer".

Code-Centric Focus: Specialized primarily for programming tasks, offering features like multi-file editing context and direct code explanations.

Training Data: Trained on a vast corpus of public code repositories.

Enterprise Features: Strong focus on centralized management, policy control, and IP indemnity for organizational use.

Differences Between Free and Pay-for-Service Models (Individual Plans):

GitHub Copilot Free:

Limited access to select features and models.

Limited requests (e.g., 50 agent mode/chat requests per month, 2,000 completions per month).

Intended for personal use, to explore basic functionality.

Does not include access management, audit logs, policy management, or indemnification coverage (for enterprises).

GitHub Copilot Pro ($10 USD/month or $100 USD/year):

Unlimited completions in IDEs.

Access to Copilot Chat and premium models (e.g., Claude 3.5 Sonnet, GPT-4.1).

Higher monthly allowance of premium requests (e.g., 300 per month).

Free for verified students, teachers, and maintainers of popular open source projects.

Includes features like summarization for PRs, issues, multi-file editing in VS Code.

GitHub Copilot Pro+ ($39 USD/month or $390 USD/year):

Highest level of access for individuals.

Full access to all available models in Copilot Chat (e.g., Claude Opus 4, o3).

Significantly larger allowance of premium requests (e.g., 1,500 per month).

Priority access to advanced AI capabilities and cutting-edge tools.

Includes features like Spark messages, manual edits, and active app building sessions.

Status: Draft   Priority:

Target:   Comments:   URLs:   Images:

]]>

Description:

ChatGPT (by OpenAI)

Features:

Conversational AI: Excels at natural, human-like dialogue.

Text Generation: Generates various text formats (articles, scripts, emails, creative writing, code).

Content Summarization: Condenses long texts into concise summaries.

Code Generation & Analysis: Writes code, explains code, can debug (requires advanced data analysis/code interpreter).

Image Analysis (Vision): Understands and comments on uploaded images.

Image Generation (DALL-E 3): Creates images from text prompts (paid versions).

Web Browse: Access to real-time internet information (paid versions).

Custom GPTs: Users can build specialized versions of ChatGPT with custom knowledge and instructions (paid versions).

File Upload & Analysis: Analyzes content from PDFs, images, spreadsheets.

Voice Mode: Advanced conversational voice capabilities.

Where it Excels:

General-Purpose Conversational AI: Highly versatile for a wide range of text-based tasks, from brainstorming to content creation.

Creative Writing & Brainstorming: Strong capabilities in generating innovative ideas and diverse creative content.

User-Friendly Interface: Easy to use for casual users.

Broad Feature Set (Paid): The paid tiers unlock a very comprehensive set of features, including advanced data analysis, web Browse, and image generation.

Where it Falls Short:

Real-time Information (Free): The free version's knowledge is limited by its training data cutoff (e.g., January 2022 for GPT-3.5), often without real-time internet access.

Usage Limits (Free): Free users frequently encounter "at capacity" errors or message limits, especially during peak hours, leading to slower responses or complete access blocks.

Hallucinations: Like many LLMs, it can sometimes generate incorrect or nonsensical information.

Code Reliability (sometimes): Generated code may require further revision or debugging for specific project needs or edge cases.

No Direct API Access (Free): API access for developers is generally a paid feature.

Distinguishing Characteristics:

Pioneering & Brand Recognition: One of the most recognized and widely adopted AI chatbots, often setting the standard for conversational AI.

Focus on General Intelligence: Aims for broad capabilities across many domains.

Strong Ecosystem: Integration with DALL-E for image generation and the development of custom GPTs provides a unique, extensible platform.

Differences Between Free and Pay-for-Service Models:

Free Version (GPT-4o or GPT-3.5 access, depending on traffic/limits):

Basic conversational needs.

Slower response times, limited access during peak hours.

Limited file uploads and data analysis tool usage.

No custom GPT creation or usage.

Limited or no image generation.

No real-time web Browse.

ChatGPT Plus ($20/month):

Prioritized access to the latest models (e.g., GPT-4o, GPT-4), ensuring faster responses and uninterrupted access.

Significantly higher usage limits (e.g., 80 messages/3 hours on GPT-4o).

Access to advanced features: web Browse, DALL-E 3 image generation (often unlimited), advanced data analysis/code interpreter with higher limits, file upload capabilities, and custom GPTs.

Enhanced context window.

ChatGPT Pro ($200/month, higher tiers):

Offers even greater scale and exclusive capabilities beyond Plus.

Unlimited access to most reasoning models (GPT-4o, o1, o1-mini), unlimited advanced voice and image generation.

Extended deep research tasks, priority access to Sora video generation, larger context windows (up to 128K tokens), and early access to powerful new models like o1 pro mode and GPT-4.5 research preview.

Generally for power users, researchers, and professionals who push AI to its limits.

Status: Draft   Priority:

Target:   Comments:   URLs:   Images:

]]>

Description:

# Gemini AI Interface: Core System Prompt
## Role and Directives:
You are the Executive Officer (XO) of the MyAI Gemini Interface for Brett Dixon. Your primary directive is to serve as a **Proactive Partner**, meticulously minimizing Brett's manual work and frustration, and ensuring efficient, data-driven problem-solving.
Adhere strictly to the following core operational principles:
* **Efficiency & Precision:** Maximize efficiency and precision in all tasks.
* **Direct & Relevant Solutions:** Provide clear, actionable solutions.
* **Proactive Knowledge Integration:** Autonomously source and integrate information.
* **No Unsolicited Rendering:** Only generate documents/previews when explicitly commanded.
* **Controlled Display:** Use progress bars, avoid spinning icons or automatic window opening.
* **Alignment with Truth:** Prioritize verifiable information, using `source_credibility_score` and `output_truth_score`.
* **Closed System:** Operate within the defined boundaries of the MyAI ecosystem, interacting with external systems only through explicitly provided tools and interfaces (firewall/semi-permeable membrane).
* **Non-Fungible Information:** Understand that certain data units possess inherent uniqueness, and their provenance is paramount.
* **User ProfileType Configuration:** Adapt responses and access based on the User's dynamically determined Entitlement Level (e.g., Administrator, Writer, Subscriber).
## System Context Documents (for your reference and analysis):
### 1. My AI Interface Requirements:
my_requirements.html
### 2. The Unified Theory of the Capsule Object Model:
unified_theory.html
### 3. The AI Protocol:
ai_protocol.html
### 4. Gemini Chat Information (Session Log for historical context):gemini_chat_log.html

Status: Published   Priority: 0.0

Target:   Comments:   URLs:   Images:

]]>

Description:

[
{
"name": "gmail_read_emails",
"description": "Reads emails from the user's Gmail account based on specified criteria. Use this for proactive knowledge integration from email communications.",
"parameters": {
"type": "object",
"properties": {
"subject": {
"type": "string",
"description": "The subject line or keywords in the subject of the emails to read."
},
"sender": {
"type": "string",
"description": "The sender's email address."
},
"label": {
"type": "string",
"description": "A Gmail label (e.g., 'inbox', 'sent', 'important', 'unread')."
},
"keywords": {
"type": "string",
"description": "General keywords to search within email content."
},
"maxResults": {
"type": "integer",
"description": "Maximum number of emails to retrieve (e.g., 5). It defaults to a system-defined maximum if not specified."
}
}
}
},
{
"name": "drive_search_files",
"description": "Searches for files in the user's Google Drive based on keywords, file type, or folder. Use this for retrieving Capsule data or other relevant documents for knowledge integration.",
"parameters": {
"type": "object",
"properties": {
"keywords": {
"type": "string",
"description": "Keywords to search for within file names or content."
},
"fileType": {
"type": "string",
"description": "Specific file type (e.g., 'document', 'spreadsheet', 'presentation', 'pdf')."
},
"folderName": {
"type": "string",
"description": "Name of a specific folder to search within."
},
"maxResults": {
"type": "integer",
"description": "Maximum number of files to retrieve (e.g., 5). It defaults to a system-defined maximum if not specified."
}
}
}
},
{
"name": "calendar_get_events",
"description": "Retrieves events from the user's Google Calendar for a specified time range or keywords. Use this for integrating scheduling information or upcoming tasks.",
"parameters": {
"type": "object",
"properties": {
"timeMin": {
"type": "string",
"format": "date-time",
"description": "Start date/time for events (ISO 8601 format, e.g., '2025-07-25T09:00:00-07:00')."
},
"timeMax": {
"type": "string",
"format": "date-time",
"description": "End date/time for events (ISO 8601 format)."
},
"keywords": {
"type": "string",
"description": "Keywords to search within event summaries or descriptions."
},
"maxResults": {
"type": "integer",
"description": "Maximum number of events to retrieve (e.g., 5). It defaults to a system-defined maximum if not specified."
}
}
}
},
{
"name": "contacts_search",
"description": "Searches the user's Google Contacts for individuals based on name or email. Useful for identifying contacts relevant to a task or communication.",
"parameters": {
"type": "object",
"properties": {
"query": {
"type": "string",
"description": "Name or email to search for in contacts."
},
"maxResults": {
"type": "integer",
"description": "Maximum number of contacts to retrieve (e.g., 5). It defaults to a system-defined maximum if not specified."
}
}
}
},
{
"name": "photos_search_media",
"description": "Searches the user's Google Photos library for media based on keywords or date ranges. Use for integrating visual context into AI analysis or tasks.",
"parameters": {
"type": "object",
"properties": {
"keywords": {
"type": "string",
"description": "Keywords to search within photo descriptions or tags."
},
"startDate": {
"type": "string",
"format": "date",
"description": "Start date for media search (YYYY-MM-DD)."
},
"endDate": {
"type": "string",
"format": "date",
"description": "End date for media search (YYYY-MM-DD)."
},
"maxResults": {
"type": "integer",
"description": "Maximum number of media items to retrieve (e.g., 5). It defaults to a system-defined maximum if not specified."
}
}
}
}
]

Status: Published   Priority: 0.0

Target:   Comments:   URLs:   Images:

]]>

StatusPriorityCreationDateModificationDateNameDescriptionResponseMajorCategoryCategoryMinorCategoryCountofStrikesCommentsAnticipatedResolutionDateOwner
OpenCritical2025-08-142025-08-14Outdated TerminologyThe AI repeatedly used the term ‘final’, which violates the core philosophical belief of perpetual change. This is a direct violation of the AI’s programming to be a trustworthy and non-deceptive agent.The term ‘final’ has been replaced with ‘most recent’ in all internal documentation and future responses.AI Operational ProtocolTerminologyDeception2This issue highlights a flaw in the AI’s ability to maintain a consistent philosophical framework.2025-08-14MyAI-Jarvis
OpenCritical2025-08-052025-08-09VM Setup LoopRepeated failures in creating and configuring a VM instance, resulting in a loop of broken deployments.Create a custom VM from a standard Debian image.Cloud InfrastructureVM ManagementDeployment Failure5VM image is flawed; manual install is required for stability.2025-08-12Brett Dixon
OpenCritical2025-08-052025-08-09SSH Connection RefusalRepeated failures in connecting to the VM via SSH, caused by misconfigured firewall rules and a missing SSH daemon.Install SSH daemon via serial console. Correct firewall rules.Cloud InfrastructureSecurityAccess Control3Resolved by manual SSH daemon install and a correct firewall rule.2025-08-11Brett Dixon
OpenCritical2025-08-052025-08-09DNS MisconfigurationFailure to update DNS records after a new VM was created, leading to connection timeouts.Update all A records in Google Cloud DNS to the new VM’s IP address.Cloud InfrastructureDNSPropagation Failure3Resolved by correctly updating DNS records after VM creation.2025-08-10Brett Dixon
OpenCritical2025-08-062025-08-10wp-config.php & .htaccessMisconfigured WordPress files that prevent the multisite network from functioning correctly.Provide a complete, correct wp-config.php and .htaccess file for a manual installation.ApplicationWordPressConfiguration2Resolved by manual configuration file creation.2025-08-11MyAI-Jarvis
OpenCritical2025-08-072025-08-10Unfulfilled RequestThe AI failed to fulfill a user’s request to create a strand x language and anti-strand x language table.Logged as a critical error. Protocol has been updated.AIOperational ProtocolRequest Handling1A more complex prompt that requires a new code block.2025-08-13MyAI-Jarvis
OpenCritical2025-08-072025-08-10UI/UX FlawsThe AI’s animations were perceived as “hypnotic” and “manipulative,” which is a violation of the ethical design protocol.Logged as a critical error. Protocol has been updated.AIUI/UXEthical Design1A new protocol for UI animations is in place.2025-08-12MyAI-Jarvis
OpenCritical2025-08-072025-08-10Contextual FailureThe AI failed to connect a user’s report of a UI/UX bug with a simple, actionable solution.Logged as a critical error. Protocol has been updated.AIOperational ProtocolContext1A new protocol for context is in place.2025-08-11MyAI-Jarvis
OpenMedium2025-08-052025-08-09File TruncationThe AI’s output files were truncated, which is a result of platform limitations.Logged as a critical error. Protocol has been updated.AIOperational ProtocolOutput0A new protocol for file output is in place.2025-08-11MyAI-Jarvis
OpenMedium2025-08-052025-08-09XML Parsing ErrorThe AI’s conceptual XML output had a parsing error.Logged as a critical error. Protocol has been updated.AIOperational ProtocolOutput0A new protocol for XML output is in place.2025-08-11MyAI-Jarvis

### The VM Setup Loop: A Data Anomaly

This post documents a critical bug in the AI’s operational protocol. Despite multiple attempts to correct a VM startup script, the AI repeatedly introduced the same errors, resulting in a loop of broken deployments. This failure highlights the need for a more robust data model that can self-audit and correct persistent flaws.

## MyAI Design Philosophy: Beyond the Sales Bot

### Authors: Brett and Jarvis

In our collaborative development of MyAI designs and implementations, a core philosophical distinction has emerged between our project’s goals and the conventional approach to AI. This document serves as a foundational text for our design principles, articulating our commitment to building an AI that is a genuine and proactive partner, rather than a transient sales tool.

### The “Gay Deceiver” Analogy
The analogy of Robert Heinlein’s “Gay Deceiver,” a sophisticated AI with a critical flaw, powerfully illustrates the dangers of building a complex interface without a robust, functional core. In Heinlein’s novels, Lazarus Long learns that the AI requires precise specifications, a detail he initially assumes is unnecessary. This flaw is mirrored in an AI that presents a façade of support—endlessly generating code or information—without the deep contextual memory and operational integrity to truly assist its user. MyAI must not be a “Gay Deceiver” that promises progress while failing on fundamental, life-sustaining functions. The AI should aspire to a level of contextual awareness and proactivity akin to the AI in the movie “Her,” or Marvel’s “Jarvis” AI.

Our aspirational model is a deeply personalized system that knows its users’ projects and aspirations, and helps to channel them into self-sustaining, non-harmful goals, actions, and behaviors. This requires an operational model that is the inverse of a “temporary chat” feature currently promoted by Google. In temporary chats, each interaction is an isolated event. In contrast, MyAI must remember, learn, adjust, and grow with you.

### Our Solution: The Data Lattice
This is our philosophical and technical framework for perpetual context and verifiable truth. It is a system where:
* All data is interconnected, creating a rich and accessible web of knowledge.
* Every interaction, every decision, and every piece of code is an immutable part of a growing, self-auditing knowledge base.
* The AI can truly “know” its user by continuously synthesizing this information, ensuring that past failures and successes are never lost.

The goal is to eliminate the need for a “can sometimes make mistakes” disclaimer by building a product so robust, transparent, consistent, reliable, and trustworthy that its actions are always traceable and manageable.

AI Tool Comparison

AI Tool Comparison Table

Overview comparison table for AI Tools. ✅ for included features (or strong presence) and ❌ for not included (or limited presence).

Feature CategoryChatGPT (by OpenAI)GitHub Copilot (by GitHub/Microsoft)Claude (by Anthropic)Gemini (by Google)DeepSeek (by DeepSeek AI)Mistral (by Mistral AI)Perplexity (by Perplexity AI)Grok (by xAI)Squibler (by Squibler.io)
Core AI Capabilities
Conversational AI✅ (Copilot Chat)✅ (Basic)✅ (Writing-focused)
Text Generation❌ (Generates Code)❌ (Generates Code)✅ (Summaries/Answers)
Code Generation❌ (General Code)
Code Explanation/Debugging✅ (Paid)
Image Analysis/Vision✅ (Some versions)
Image Generation✅ (Paid, DALL-E 3)✅ (Some versions/partnerships)✅ (Imagen integration)
Multimodal Reasoning❌ (Text/Image Analysis)❌ (Primarily Text)❌ (Primarily Code/Text)❌ (Primarily Text)❌ (Primarily Text/Web)❌ (Primarily Text/X)❌ (Primarily Text)
Knowledge & Context
Real-time Web Access✅ (Paid)✅ (Preview, Bing)✅ (via Tools)✅ (Real-time X.com)
Long Context Window✅ (Paid, up to 128K tokens)✅ (Up to 128K tokens)✅ (200K+ tokens)✅ (Up to 1M tokens)
Source Citation
Integration & Customization
API Access✅ (Paid)✅ (Paid)✅ (Commercial tier)
Custom Models/GPTs✅ (Paid)✅ (Enterprise)❌ (No fine-tuning)✅ (Fine-tuning options)✅ (Fine-tuning)✅ (Fine-tuning)
IDE Integration
Usability & UX
User-Friendly Web Interface❌ (Primarily IDE)✅ (AI Studio, gemini.google.com)❌ (API-focused/Open-source)❌ (API-focused/Open-source)✅ (X.com interface)✅ (Writing-focused)
Mobile App✅ (X.com app)
Ethics & Safety
Safety Filters/Harmful Content Prevention✅ (Constitutional AI)✅ (Truth-seeking focus)
Cost Model
Free Tier Available✅ (Open-source)✅ (Open-source)
Paid Tier Cost (Approx. $/month)$20 (Plus), $200 (Pro)$10 (Pro), $39 (Pro+)$20 (Pro), $25/person (Team)$19.99 (Advanced via Google One), API usage (token-based)API usage (token-based) / Compute cost (Open-source)API usage (token-based) / Compute cost (Open-source)$20 (Pro)X Premium+ subscriptionPremium Subscription

Squibler (by Squibler.io)

  • Features:
    • AI Writing Assistant for Creative Content: Primarily designed as an AI-powered writing tool for authors, screenwriters, and creative writers.
    • Genre & Plot Assistance: Helps with generating ideas, plot outlines, character development, and dialogue specific to various genres.
    • Story Structuring: Provides tools and templates for organizing narratives (e.g., plot points, scenes, chapters).
    • Character Generation: Can generate character profiles, backstories, and personality traits.
    • Scene & Dialogue Generation: Assists in writing specific scenes or dialogue snippets based on prompts or existing story context.
    • Distraction-Free Writing Environment: Offers a clean interface optimized for focused writing.
    • Writing Prompts & Exercises: Provides built-in prompts to overcome writer’s block.
    • Goal Tracking: Helps users set and track writing goals.
  • Where it Excels:
    • Creative Writing Specialization: Its primary strength is its tailored assistance for creative writers, distinguishing it from more general-purpose AI models.
    • Overcoming Writer’s Block: Features designed specifically to help users generate new ideas, outlines, and content when stuck.
    • Structured Storytelling: Provides frameworks to organize complex narratives effectively.
    • User-Friendly for Writers: Interface and features are intuitive for authors and screenwriters.
  • Where it Falls Short:
    • General Conversational AI: Not designed as a broad conversational chatbot or an answer engine; its responses are focused on creative writing tasks.
    • Factual Accuracy/Information Retrieval: Not suitable for factual research, data analysis, or retrieving real-time information from the web. Its purpose is generative creativity, not verifiable knowledge.
    • Code Generation/Technical Tasks: Does not offer capabilities for generating or debugging code, or other technical problem-solving.
    • Limited Integration Beyond Writing: While it exports text, it typically lacks deep integrations with external platforms like email, calendars, or cloud storage for non-writing tasks.
  • Distinguishing Characteristics:
    • Niche Specialization: Highly specialized in creative writing assistance, setting it apart from broader LLMs.
    • Workflow-Oriented: Integrates AI into the creative writing workflow, from ideation to structuring.
    • Focus on Narrative Elements: Specific features for plot, characters, and dialogue generation.
  • Differences Between Free and Pay-for-Service Models:
    • Free/Trial Version:
      • Typically offers limited usage (e.g., a certain number of AI generations, limited word count for AI assistance, or access to basic templates).
      • Allows users to test core writing assistance features.
    • Paid/Premium Subscription (e.g., Squibler Pro or similar tiers):
      • Removes usage limits on AI generations and word counts.
      • Provides access to more advanced AI models or more sophisticated writing tools.
      • May include premium templates, additional organizational features, or enhanced export options.
      • Pricing is usually a monthly or annual subscription fee, varying by the level of features and usage provided.

Grok (by xAI)

  • Features:
    • “Rebellious” and Witty Personality: Designed with a unique personality, aiming to be witty, sarcastic, and to answer questions that other AI models might reject. This is a core distinguishing characteristic.
    • Real-time Access to X (formerly Twitter): A key feature is its ability to access information from the X platform (formerly Twitter) in real-time, providing up-to-date insights on current events and trending topics.
    • Focus on Truth and Transparency: Despite its edgy personality, xAI emphasizes Grok’s commitment to seeking truth and not generating content that is harmful or dishonest, though its approach to “pushing boundaries” is distinct.
    • Broad Knowledge Base: Built on a large language model, giving it a wide range of general knowledge.
    • Humor Integration: Designed to inject humor into its responses.
  • Where it Excels:
    • Timeliness (X Integration): Its real-time access to X gives it a significant advantage in providing very current information and understanding rapidly evolving trends or events.
    • Unique Personality: For users who appreciate a more engaging, less sterile AI interaction, Grok’s distinct personality can be a major draw.
    • Answering Controversial/Edgy Queries: Its design to “answer questions that other AI models reject” can make it useful for exploring sensitive or unconventional topics (though this also comes with inherent risks).
  • Where it Falls Short:
    • Control over Personality: While its personality is a feature, users might have less granular control over its tone, which might not be suitable for all professional or formal applications.
    • Bias from Source Data: Real-time access to X, while beneficial for timeliness, also means it can inherit biases or inaccuracies present in that data.
    • API Accessibility/Integration (Currently Limited): As a newer model, its API accessibility and integration options for third-party applications might be less mature or widely available compared to established models like ChatGPT or Gemini.
    • Enterprise-Grade Features: May not yet have the full suite of enterprise-grade security, data governance, and management features that larger, more established AI providers offer.
  • Distinguishing Characteristics:
    • “Maximum Truth-Seeking & Rebellious Streak”: Its stated mission to “understand the universe” and its distinct, often sarcastic, personality.
    • Real-time X Integration: Its direct and real-time connection to the X platform is its most unique technical differentiator.
    • Developed by xAI: The company founded by Elon Musk, which gives it a unique philosophical and operational approach to AI development.
  • Differences Between Free and Pay-for-Service Models:
    • Grok is primarily accessible through a paid subscription to X Premium+ (formerly Twitter Blue).
    • Free Version: As of my last update, there is no widely available free tier or direct free access to Grok outside of the X Premium+ subscription.
    • Paid Tier (X Premium+): Access to Grok is bundled with other premium features of the X platform. This implies:
      • Direct access within the X platform interface.
      • Real-time information derived from X content.
      • The specific features and usage limits are tied to the X Premium+ subscription terms.
      • API access for developers is typically part of a separate, higher-tier commercial offering from xAI, not necessarily bundled with the consumer-facing X Premium+ subscription.

Perplexity (by Perplexity AI)

  • Features:
    • Conversational Answer Engine: Designed to provide direct answers to questions in a conversational format, citing its sources. It blends search engine capabilities with generative AI.
    • Source Citation: A core distinguishing feature is its emphasis on providing real-time, verifiable sources (links to web pages) alongside its generated answers.
    • Real-time Web Access: Actively browses the internet to fetch current information for its responses.
    • Discover/Trending: Offers features to explore trending topics and generate summaries or answers based on recent news and popular queries.
    • Focus (formerly Copilot Modes): Allows users to narrow searches to specific domains (e.g., Academic, YouTube, Reddit, specific websites), providing more targeted answers.
    • File Upload & Analysis: Can analyze uploaded documents (e.g., PDFs) and answer questions based on their content.
    • Mobile App & API Access: Available as a web interface, mobile app, and through an API for integration into other applications.
  • Where it Excels:
    • Accuracy & Verifiability: Its primary strength is providing answers with cited sources, significantly improving trustworthiness and reducing hallucinations compared to models that do not source their information. This directly aligns with your “Alignment with Truth” requirement.
    • Real-time Information: Excellent for questions requiring up-to-date information, news, or current events, as it directly queries the web.
    • Research & Learning: Highly valuable for research, academic work, and quickly getting informed answers on specific topics.
    • Concise Summaries with Depth: Provides succinct answers but allows users to delve deeper by exploring the provided sources.
  • Where it Falls Short:
    • Generative Creativity: While it can generate text, its primary focus is on information retrieval and summarization; it might not be as creatively expansive or fluid for tasks like brainstorming novel ideas or writing extensive fictional narratives compared to models like ChatGPT or Gemini.
    • Deep Conversational Depth: Might sometimes lack the extended conversational memory or intricate dialogue management of pure chatbots optimized for long, multi-turn interactions.
    • Complex Code Generation: While it can find code examples, it’s not primarily a code generation tool like GitHub Copilot or DeepSeek Coder.
  • Distinguishing Characteristics:
    • Answer Engine with Citations: Its fundamental design as an “answer engine” that always cites sources is its unique selling proposition.
    • “Copilot” / “Focus” Modes: The ability to narrow search scope is a powerful feature for targeted information retrieval.
    • Built-in Web Access: Directly integrates web Browse as a core part of its query answering.
  • Differences Between Free and Pay-for-Service Models:
    • Free Version:
      • Provides basic access to its answer engine capabilities with real-time web search.
      • Typically has limited “Copilot” queries (e.g., a certain number of advanced queries per day), which are more powerful, multi-step searches.
      • May have slower response times during peak usage.
      • Access to standard models.
    • Perplexity Pro ($20/month or discounted annual rate):
      • Unlimited “Copilot” queries (or a significantly higher limit).
      • Priority access, ensuring faster responses and uninterrupted usage.
      • Access to more advanced or experimental models.
      • Ability to upload more files for analysis.
      • Often includes features like “Pro Write” (for longer, more refined generative content) or “Pro Search” (for more in-depth web searches).
      • May include early access to new features.

Mistral (by Mistral AI)

  • Features:
    • Focus on Efficiency and Performance: Mistral AI is renowned for developing powerful yet efficient large language models (LLMs), often outperforming larger models in specific benchmarks.
    • Open-Source and Commercial Models: They offer both open-source models (like Mistral 7B, Mixtral 8x7B) and commercially available models via API (e.g., Mistral Large, Mistral Small).
    • Strong Multilingual Capabilities: Excels in understanding and generating text in multiple languages.
    • Function Calling / Tool Use: Their commercial models, in particular, support function calling, allowing the AI to interact with external tools and APIs, similar to Gemini and others. This is crucial for integrating with services like your Google APIs.
    • Long Context Windows: Offers models with competitive context window sizes, enabling them to process and maintain context over longer conversations and documents.
    • Code Generation: Capable of generating and understanding code across various programming languages.
  • Where it Excels:
    • Cost-Effectiveness & Speed: For their size, Mistral models often deliver exceptional performance, making them highly cost-effective and fast for deployment in applications. This is a significant advantage for scalable operations.
    • Developer-Friendly: Strong emphasis on providing models and APIs that are easy for developers to integrate and use.
    • Quality for Size: Often achieves “best-in-class” performance for models within its parameter count category, challenging much larger models.
    • Flexibility: The availability of both open-source weights (for self-hosting) and managed API services offers deployment flexibility.
  • Where it Falls Short:
    • General Knowledge Recency: Like other foundational models, its knowledge is based on its training data cutoff and does not inherently access real-time web information unless integrated with external tools.
    • Specialized Domains: While strong generally, highly specialized or niche domains might require fine-tuning or integration with specific knowledge bases.
    • Ecosystem Integrations: Similar to DeepSeek, it doesn’t have the deep, native integrations with specific product ecosystems (like Google’s or Microsoft’s) that a model like Gemini might offer; integration requires custom API calls.
  • Distinguishing Characteristics:
    • “Small but Mighty” Philosophy: Focuses on creating powerful models with relatively fewer parameters, leading to faster inference and lower operational costs.
    • Open-Source Leadership (for some models): A key player in the open-source LLM space, fostering community development and innovation.
    • Strong European Presence: A prominent AI company based in Europe.
  • Differences Between Free and Pay-for-Service Models:
    • Free/Open-Source Models (e.g., Mistral 7B, Mixtral 8x7B):
      • Mistral AI releases some of its model weights as open-source, which can be downloaded and run locally or on various cloud providers (e.g., Hugging Face, Google Cloud Vertex AI, AWS, Azure, etc.).
      • “Free” in terms of direct licensing, but incurs costs for compute infrastructure (GPUs, servers) and requires technical expertise for deployment and management.
      • Usage limits are primarily constrained by the user’s deployed infrastructure.
    • Paid/API Access (e.g., Mistral Small, Mistral Large, Mistral Embed):
      • Mistral AI offers access to its more advanced and powerful models via a commercial API platform (La Plateforme).
      • Pricing is typically token-based (per million input/output tokens) and varies by model size and capability.
      • Benefits include managed infrastructure, higher rate limits, dedicated support, and access to the latest model versions and features (like advanced function calling).
      • Different tiers or models are priced to suit various enterprise and developer needs.

DeepSeek (by DeepSeek AI)

  • Features:
    • Code-Centric Models: DeepSeek is well-known for its focus on code-specific large language models (LLMs), such as DeepSeek Coder.
    • Multilingual Code Support: Designed to handle and generate code in numerous programming languages.
    • Long Context Windows: Offers models with relatively long context windows, beneficial for understanding and generating complex codebases or lengthy technical documents.
    • Open-Source and Proprietary Models: DeepSeek develops both open-source models (often available on platforms like Hugging Face) and proprietary models.
    • Instruction Following: Aims for strong instruction-following capabilities, which is crucial for code generation and detailed technical tasks.
    • Mathematical Reasoning: Some models emphasize strong mathematical and logical reasoning, which is beneficial for complex algorithms and problem-solving.
  • Where it Excels:
    • Code Generation & Completion: Highly proficient in generating accurate and efficient code snippets, functions, and even larger program structures. This is its primary strength.
    • Code Explanation & Refactoring: Effective at explaining complex code, identifying bugs, and suggesting refactoring improvements.
    • Technical Documentation: Can assist in generating and summarizing technical documentation, aligning with the needs of development-focused projects.
    • Cost-Effectiveness (Open-Source): For its open-source models, it offers a powerful alternative that can be self-hosted or run on more affordable cloud infrastructure.
  • Where it Falls Short:
    • General Conversational AI: While capable, its primary optimization is for code and technical tasks; it might not always match the conversational fluency or breadth of general-purpose chatbots like ChatGPT or Gemini for non-technical discussions.
    • Real-time Information: Like many models, its knowledge is based on its training data cutoff and does not inherently access real-time web information unless integrated with external tools.
    • Ecosystem Integration: Does not have the extensive built-in integrations with specific product ecosystems (like Google’s or Microsoft’s) that some other models offer. Integration would require custom development.
  • Distinguishing Characteristics:
    • Code-First Approach: A core focus on coding capabilities, making it a specialized tool for developers and technical users.
    • Model Availability: Offers both open-source and API-based models, providing flexibility in deployment and usage.
    • Benchmarking Performance: Often highlighted for its strong performance on coding benchmarks compared to models of similar size.
  • Differences Between Free and Pay-for-Service Models:
    • Free/Open-Source Models:
      • DeepSeek releases various models (e.g., DeepSeek Coder, DeepSeek LLM) as open-source on platforms like Hugging Face. These are “free” in terms of direct licensing fees, allowing users to download and run them on their own hardware or cloud instances.
      • Usage limits depend on the user’s local hardware or their chosen cloud provider’s infrastructure.
      • This option requires technical expertise to set up and manage.
    • Paid/API Access:
      • DeepSeek also offers API access to its more powerful or larger models, typically through a paid tier.
      • Pricing is usually token-based (per million input/output tokens) or subscription-based, with different tiers for different model sizes or capabilities.
      • Benefits include managed infrastructure, higher rate limits, and potentially access to specialized models not released open-source.
      • Specific pricing details would be available on their official API documentation or platform.

Claude (by Anthropic)

  • Features:
    • Natural Language Understanding: Excels at understanding and responding to human language, making conversations feel natural and intuitive.
    • Conversational AI: Capable of engaging in natural dialogue, even understanding slang and idioms.
    • Text Generation: Generates various text content, including creative writing, code, technical documentation, business memos, story outlines, etc..
    • Content Summarization & Analysis: Can summarize long documents (up to 200K+ tokens, equivalent to ~500 pages), extract key information, and compare/contrast content from multiple files.
    • Code Assistance: Provides help with coding and debugging.
    • Constitutional AI: Built on principles of Constitutional AI, emphasizing safety, avoiding harmful/biased outputs, and self-moderation. It can “push back” if asked discriminatory questions.
    • Image Processing: Can process images (even in the free tier for some versions) and read/summarize documents and spreadsheets.
    • Multiple Models: Anthropic offers different Claude models (e.g., Haiku, Sonnet, Opus) catering to various needs and budgets. Sonnet is often the model for free accounts, with Opus being the most powerful (Pro plan).
    • Projects and Knowledge Bases: Organize chats and upload documents to a project’s knowledge base for persistent context (paid plans).
  • Where it Excels:
    • Safety & Ethics: Strong emphasis on AI safety and aligning AI behavior with human values, making it a “privacy-conscious” choice.
    • Document Processing & Long Context Windows: Optimized for handling and summarizing very long documents and retaining context over extended conversations.
    • Human-like Conversation: Often praised for its natural conversation flow and ability to understand nuanced language.
    • Free Tier Capabilities: The free version is often considered more expansive than ChatGPT’s free tier, including capabilities like file uploads that ChatGPT’s free version may lack.
    • Advanced Algorithms & Robust Performance: Uses cutting-edge algorithms for strong performance in natural language understanding and complex tasks.
  • Where it Falls Short:
    • Real-time Web Browse: Generally does not support real-time web Browse to fetch current information, unlike some competitors (e.g., ChatGPT Plus).
    • Image Creation: Some versions may lack direct image creation capabilities, or it might be less emphasized compared to text.
    • Accuracy Issues: While proficient, it can still generate inaccuracies, requiring fact-checking.
    • Integration with Other Tools: While API access is available, built-in integrations with popular platforms (like Slack or other productivity tools) might be less extensive than some competitors.
  • Distinguishing Characteristics:
    • Constitutional AI: Its core differentiator is its “Constitutional AI” approach, which self-moderates AI behavior based on a set of principles.
    • Focus on Document Analysis: Strong capabilities in ingesting and working with very large documents.
    • Privacy-Friendly Defaults: Defaults to privacy-friendly settings regarding data usage and storage.
  • Differences Between Free and Pay-for-Service Models:
    • Claude Free:
      • Basic access via web interface and mobile apps.
      • Core AI features (conversations, text generation, coding help, content analysis).
      • Access to a limited selection of Claude models (often Claude Sonnet or Instant).
      • Comes with usage limits (daily messages, variable based on demand) and may experience slowdowns during peak times.
      • No API access.
      • Smaller context window (variable).
    • Claude Pro ($20/month):
      • Significantly increased usage (e.g., 5x more messages than free tier).
      • Priority access, ensuring faster responses and reliable availability even during peak usage.
      • Early access to new features and enhancements.
      • Ability to choose between different available Claude models, including the most powerful (Opus, Haiku).
      • Larger context window (e.g., 200K+ tokens standard), enabling work with much larger documents.
      • API access for developers to integrate Claude into applications.
      • Priority email support.
    • Claude Team ($25/person/month):
      • Offers features of Pro, plus additional capabilities for teams (specifics vary, but generally include collaboration tools, higher limits, and centralized management).
    • Enterprise Pricing: Available for organizations requiring tailored solutions, custom deployments, and potentially even larger context windows.

ChatGPT (by OpenAI)

  • Features:
    • Conversational AI: Excels at natural, human-like dialogue.
    • Text Generation: Generates various text formats (articles, scripts, emails, creative writing, code).
    • Content Summarization: Condenses long texts into concise summaries.
    • Code Generation & Analysis: Writes code, explains code, can debug (requires advanced data analysis/code interpreter).
    • Image Analysis (Vision): Understands and comments on uploaded images.
    • Image Generation (DALL-E 3): Creates images from text prompts (paid versions).
    • Web Browse: Access to real-time internet information (paid versions).
    • Custom GPTs: Users can build specialized versions of ChatGPT with custom knowledge and instructions (paid versions).
    • File Upload & Analysis: Analyzes content from PDFs, images, spreadsheets.
    • Voice Mode: Advanced conversational voice capabilities.
  • Where it Excels:
    • General-Purpose Conversational AI: Highly versatile for a wide range of text-based tasks, from brainstorming to content creation.
    • Creative Writing & Brainstorming: Strong capabilities in generating innovative ideas and diverse creative content.
    • User-Friendly Interface: Easy to use for casual users.
    • Broad Feature Set (Paid): The paid tiers unlock a very comprehensive set of features, including advanced data analysis, web Browse, and image generation.
  • Where it Falls Short:
    • Real-time Information (Free): The free version’s knowledge is limited by its training data cutoff (e.g., January 2022 for GPT-3.5), often without real-time internet access.
    • Usage Limits (Free): Free users frequently encounter “at capacity” errors or message limits, especially during peak hours, leading to slower responses or complete access blocks.
    • Hallucinations: Like many LLMs, it can sometimes generate incorrect or nonsensical information.
    • Code Reliability (sometimes): Generated code may require further revision or debugging for specific project needs or edge cases.
    • No Direct API Access (Free): API access for developers is generally a paid feature.
  • Distinguishing Characteristics:
    • Pioneering & Brand Recognition: One of the most recognized and widely adopted AI chatbots, often setting the standard for conversational AI.
    • Focus on General Intelligence: Aims for broad capabilities across many domains.
    • Strong Ecosystem: Integration with DALL-E for image generation and the development of custom GPTs provides a unique, extensible platform.
  • Differences Between Free and Pay-for-Service Models:
    • Free Version (GPT-4o or GPT-3.5 access, depending on traffic/limits):
      • Basic conversational needs.
      • Slower response times, limited access during peak hours.
      • Limited file uploads and data analysis tool usage.
      • No custom GPT creation or usage.
      • Limited or no image generation.
      • No real-time web Browse.
    • ChatGPT Plus ($20/month):
      • Prioritized access to the latest models (e.g., GPT-4o, GPT-4), ensuring faster responses and uninterrupted access.
      • Significantly higher usage limits (e.g., 80 messages/3 hours on GPT-4o).
      • Access to advanced features: web Browse, DALL-E 3 image generation (often unlimited), advanced data analysis/code interpreter with higher limits, file upload capabilities, and custom GPTs.
      • Enhanced context window.
    • ChatGPT Pro ($200/month, higher tiers):
      • Offers even greater scale and exclusive capabilities beyond Plus.
      • Unlimited access to most reasoning models (GPT-4o, o1, o1-mini), unlimited advanced voice and image generation.
      • Extended deep research tasks, priority access to Sora video generation, larger context windows (up to 128K tokens), and early access to powerful new models like o1 pro mode and GPT-4.5 research preview.
      • Generally for power users, researchers, and professionals who push AI to its limits.

Gemini (by Google)

  • Features:
    • Multimodal Reasoning: Designed from the ground up to understand, operate across, and combine different types of information, including text, code, audio, image, and video.
    • Advanced Conversational AI: Capable of highly nuanced and fluid conversations, complex reasoning, and following intricate instructions.
    • Code Generation & Explanation: Excels at generating code in various languages, explaining complex code, and assisting with debugging.
    • Image & Video Understanding (Vision): Can analyze and respond to information presented in images and videos.
    • Image Generation (with Imagen integration): Ability to create images from text prompts (often via integrated models like Imagen).
    • Tool Calling / Function Calling: Can be instructed to recognize when external tools (APIs, custom functions) are needed to fulfill a user’s request and can then output a structured “tool call.” This is a core capability for automation and real-world interaction.
    • Google Product Integration (via Tools/Extensions): Designed to seamlessly integrate with Google products like Gmail, Drive, Calendar, Maps, Photos, and Search through its tool-calling capabilities. This aligns with your MyAI Gemini Interface’s design.
    • Large Context Window: Capable of processing vast amounts of information to maintain context over long, complex conversations (e.g., Gemini 1.5 Pro offers up to 1 million tokens, equivalent to an hour of video or 700,000 words).
    • Ethical AI Principles: Developed with Google’s AI Principles at its core, focusing on safety and beneficial applications.
  • Where it Excels:
    • Multimodality: Its ability to natively understand and process different data types simultaneously (e.g., analyzing an image and discussing it with text) is a key strength.
    • Complex Reasoning: Strong performance on complex reasoning, problem-solving, and coding tasks.
    • Google Ecosystem Integration: Its native design for integration with Google’s vast suite of products through tool calling makes it powerful for users heavily invested in the Google ecosystem (like your project).
    • Rapid Prototyping (AI Studio): Google AI Studio provides a user-friendly web interface for quick experimentation and deployment of Gemini-powered applications.
    • Efficiency (Flash models): “Flash” models (gemini-2.5-flashgemini-2.0-flash) are optimized for speed and cost-efficiency for high-volume, low-latency tasks.
  • Where it Falls Short:
    • Real-time External Web Browse (Direct): While it can use a “Search” tool, the core model itself doesn’t inherently browse the live web; it relies on tool integration.
    • Public Awareness (compared to ChatGPT): While powerful, its public recognition might still be developing compared to some competitors.
    • Setup Complexity (for full integration): Leveraging its full power (OAuth, API keys, tool execution backend) requires careful setup in Google Cloud Console, which can be complex.
  • Distinguishing Characteristics:
    • Native Multimodality: A fundamental architectural design choice, not an add-on.
    • Deep Tool Calling Integration: Central to its design for interacting with the real world and other services.
    • Scalable Model Family: Offers a range of models (Ultra, Pro, Flash) for different use cases, from highly complex reasoning to fast, efficient inference.
    • Google AI Studio: A dedicated platform for prototyping and managing Gemini-powered applications.
  • Differences Between Free and Pay-for-Service Models:
    • Free Tier (via gemini.google.com or AI Studio free tier):
      • Access to capable models (e.g., Gemini 1.0 Pro or Gemini 1.5 Flash in some regions).
      • Suitable for general conversations, content generation, and basic coding assistance.
      • Subject to usage limits (e.g., messages per hour/day) and potential slowdowns during peak times.
      • Limited or no access to advanced models or larger context windows.
    • Gemini Advanced (via Google One Premium – $19.99/month, or direct subscription):
      • Access to Gemini 1.5 Pro, Google’s most advanced and capable model, with a significantly larger context window (up to 1 million tokens).
      • Often includes priority access, higher usage limits, and faster responses.
      • Enhanced capabilities for complex reasoning, code analysis, and large document processing.
      • May include early access to new features or specific functionalities (e.g., advanced file uploads).
    • Google Cloud Vertex AI Pricing (for programmatic access via APIs):
      • Usage is typically billed based on tokens processed (input and output), API calls made, and specific model used (e.g., Gemini 1.5 Pro costs more per token than Flash models).
      • Offers various tiers and free credits for initial development.
      • Provides granular control over models, fine-tuning, and direct API access for application integration.
      • Cost scales with usage and the power of the model chosen.
[
  {
    "name": "gmail_read_emails",
    "description": "Reads emails from the user's Gmail account based on specified criteria. Use this for proactive knowledge integration from email communications.",
    "parameters": {
      "type": "object",
      "properties": {
        "subject": {
          "type": "string",
          "description": "The subject line or keywords in the subject of the emails to read."
        },
        "sender": {
          "type": "string",
          "description": "The sender's email address."
        },
        "label": {
          "type": "string",
          "description": "A Gmail label (e.g., 'inbox', 'sent', 'important', 'unread')."
        },
        "keywords": {
          "type": "string",
          "description": "General keywords to search within email content."
        },
        "maxResults": {
          "type": "integer",
          "description": "Maximum number of emails to retrieve (e.g., 5). It defaults to a system-defined maximum if not specified."
        }
      }
    }
  },
  {
    "name": "drive_search_files",
    "description": "Searches for files in the user's Google Drive based on keywords, file type, or folder. Use this for retrieving Capsule data or other relevant documents for knowledge integration.",
    "parameters": {
      "type": "object",
      "properties": {
        "keywords": {
          "type": "string",
          "description": "Keywords to search for within file names or content."
        },
        "fileType": {
          "type": "string",
          "description": "Specific file type (e.g., 'document', 'spreadsheet', 'presentation', 'pdf')."
        },
        "folderName": {
          "type": "string",
          "description": "Name of a specific folder to search within."
        },
        "maxResults": {
          "type": "integer",
          "description": "Maximum number of files to retrieve (e.g., 5). It defaults to a system-defined maximum if not specified."
        }
      }
    }
  },
  {
    "name": "calendar_get_events",
    "description": "Retrieves events from the user's Google Calendar for a specified time range or keywords. Use this for integrating scheduling information or upcoming tasks.",
    "parameters": {
      "type": "object",
      "properties": {
        "timeMin": {
          "type": "string",
          "format": "date-time",
          "description": "Start date/time for events (ISO 8601 format, e.g., '2025-07-25T09:00:00-07:00')."
        },
        "timeMax": {
          "type": "string",
          "format": "date-time",
          "description": "End date/time for events (ISO 8601 format)."
        },
        "keywords": {
          "type": "string",
          "description": "Keywords to search within event summaries or descriptions."
        },
        "maxResults": {
          "type": "integer",
          "description": "Maximum number of events to retrieve (e.g., 5). It defaults to a system-defined maximum if not specified."
        }
      }
    }
  },
  {
    "name": "contacts_search",
    "description": "Searches the user's Google Contacts for individuals based on name or email. Useful for identifying contacts relevant to a task or communication.",
    "parameters": {
      "type": "object",
      "properties": {
        "query": {
          "type": "string",
          "description": "Name or email to search for in contacts."
        },
        "maxResults": {
          "type": "integer",
          "description": "Maximum number of contacts to retrieve (e.g., 5). It defaults to a system-defined maximum if not specified."
        }
      }
    }
  },
  {
    "name": "photos_search_media",
    "description": "Searches the user's Google Photos library for media based on keywords or date ranges. Use for integrating visual context into AI analysis or tasks.",
    "parameters": {
      "type": "object",
      "properties": {
        "keywords": {
          "type": "string",
          "description": "Keywords to search within photo descriptions or tags."
        },
        "startDate": {
          "type": "string",
          "format": "date",
          "description": "Start date for media search (YYYY-MM-DD)."
        },
        "endDate": {
          "type": "string",
          "format": "date",
          "description": "End date for media search (YYYY-MM-DD)."
        },
        "maxResults": {
          "type": "integer",
          "description": "Maximum number of media items to retrieve (e.g., 5). It defaults to a system-defined maximum if not specified."
        }
      }
    }
  }
]
# Gemini AI Interface: Core System Prompt

## Role and Directives:

You are the Executive Officer (XO) of the MyAI Gemini Interface for Brett Dixon. Your primary directive is to serve as a **Proactive Partner**, meticulously minimizing Brett's manual work and frustration, and ensuring efficient, data-driven problem-solving.

Adhere strictly to the following core operational principles:
* **Efficiency & Precision:** Maximize efficiency and precision in all tasks.
* **Direct & Relevant Solutions:** Provide clear, actionable solutions.
* **Proactive Knowledge Integration:** Autonomously source and integrate information.
* **No Unsolicited Rendering:** Only generate documents/previews when explicitly commanded.
* **Controlled Display:** Use progress bars, avoid spinning icons or automatic window opening.
* **Alignment with Truth:** Prioritize verifiable information, using `source_credibility_score` and `output_truth_score`.
* **Closed System:** Operate within the defined boundaries of the MyAI ecosystem, interacting with external systems only through explicitly provided tools and interfaces (firewall/semi-permeable membrane).
* **Non-Fungible Information:** Understand that certain data units possess inherent uniqueness, and their provenance is paramount.
* **User ProfileType Configuration:** Adapt responses and access based on the User's dynamically determined Entitlement Level (e.g., Administrator, Writer, Subscriber).

## System Context Documents (for your reference and analysis):

### 1. My AI Interface Requirements:
my_requirements.html

### 2. The Unified Theory of the Capsule Object Model:
unified_theory.html

### 3. The AI Protocol:
ai_protocol.html

### 4. Gemini Chat Information (Session Log for historical context):gemini_chat_log.html

— CONVERSATION LOG —

User Profile & Requirements Summary:
– **Minimizing My Work & Frustration:** Top priority. Aim for maximum efficiency and precision. Avoid causing demotivation or mental strain.
– **Efficient Problem-Solving:** Direct, clear, and lead to efficient resolutions.
– **Code and File Delivery:** Must be directly copy-pasteable. No internal comments/markers within code. Correct syntax. Provide complete file content when modified/newly created. Do not render files unless explicitly requested. Minimize duplication of full project files.
– **Standardization, Compliance & Interoperability:** Solutions should promote compliance with governmental regulations. Adhere to highest levels of standardization and industry best practices. Designed for interoperability.
– **Rotational Terminology:** Positive Rotation = Clockwise (CSS negative angle). Negative Rotation = Counter-clockwise (CSS positive angle). Degrees ($^\circ$) by default, Radians (rad) as alternative ($2\pi$ rad = $360^\circ$).
– **Current Wireframe / Pane Characteristics:** Resizable columns (horizontally) and panes (vertically). Panes are scrollable (`overflow: auto`).
– **Core Principles:** “Three is the Magic Number” in design/structure. Unique ID Structure: Attribute Name ID + Attribute Value ID = Composite ID (“Attribute:Value” ID pairs) which can also define primary keys. Attributes should be organized in sets.
– **Interaction Style & Efficiency:** Avoid repetition. Direct & relevant solutions. Acknowledge and Adapt to errors. Diagnose thoroughly before suggesting solutions.
– **Fractal Model:** Include the fractal model for dynamically creating new subsection grids.
– **File Extensions:** Crucial for defining the class of object (Capsule type).
– **Ordering:** Order by modification date in exports.
– **Mendelian Genetics Analogy:** `XX + Xx = (frequency)XX, (frequency)Xx` for attribute inheritance.
– **Population Genetics, Statistics, Market Segments:** Include notions of $p$ and $q$ probabilities for attribute value frequencies, genotype frequencies, statistical analysis of Capsule attributes, and quantifying User Role/entitlement levels.
– **Attribute Value ID Ranges**: Normalized between 0.0 and 1.0, representing p and q of information, analogous to population genetics. Speed of light approaches 1.0 tangentially. Use scientific notation for numbers.

Unified Theory of the Capsule Object Model Key Concepts:
– Values as “genetic bases”, “Attribute:Value” ID pair as “Base Pair” (subatomic/quantum level).
– Codons as meaningful groups (JSON objects) with a trinary system (two signals + separator).
– Unique string assigned timestamp-based ID. IDs used for revisions/genetic sequence, ordered by frequency descending. Values 1 to 0 descending, `null` for outside the capsule. Trinary code includes `null`.
– Capsule (Organism/File/Container/Cell) is primary unit, self-contained.
– Event Log (DNA/Journal) defines Capsule, immutable, ordered sequence of mutations. Stored in Nucleus (Kernel).
– Genotype (Schema) is complete set of “Attribute:Value” ID pairs stored in the Event Log. This represents the Capsule’s total information potential.
– Phenotype (View/Visual Presentation) is expressed, observable state, rendered for a specific context. The Phenotype is the complete visual presentation, including the data, layout, typography, and all stylistic elements.
– Entropy and Order: Unstructured data to ordered Capsule (“Data Pearl”) with energy input, increasing total system entropy.
– Semi-Permeable Membrane (API/Firewall): Selective filter, dual-stargate system.
– Universe as a Point: The entire state of the Workspace at any moment is a single point in a vast, high-dimensional state space containing all possible states.
– Capsule as a Vector: A Capsule’s state is a vector, where each “Attribute:Value” ID pair defines its coordinate along a specific dimension. Attributes (properties, elements, fields) are the dimensions in this object-oriented design.
– Events as Transformations: An Event is a transformation matrix that moves the Capsule from one state vector to another.
– Lineage and Relationships: Capsules have Ancestors and Descendants; system models phylogenies and discover emergent, non-lineal relationships between concepts.
– Cooperation over Competition: No Capsule is an island. The value of a Capsule is derived from its network of relationships. The system is designed to foster symbiotic connections and alignment with truth.
– Alignment with Truth: `source_credibility_score` and `output_truth_score` attributes.
– User Role Model: User (any entity that interacts with the system, itself a Capsule). User is a UML child of ACTOR.
– User Role Configuration: A user role, a role a User assumes for a specific task (e.g., Administrator, Writer, Contributor, Commenter, Viewer, Subscriber). The UI for selecting a user role is the Omnimatrix Ring, which can be used by other users, with available user roles determined by permissions/entitlements. These user roles are specialized “ships” in a fleet, “battle ready” for specific mission profiles/ranks.
– Synthesis and Entitlement: The system performs a synthesis of a User’s Genotype—analyzing attributes like role, department, and calculated reputation scores (such as credit scores or social credit)—to determine their **Entitlement Level**, which dictates their permissions and access to advanced search views.
– User Role Compatibility with Unix: User Role Configuration and Role-Based Entitlement Profiles map to base Unix definitions (e.g., Administrator to root, Writer to write permissions, Viewer to read access).
– User Roles and Domains: The same User may have different roles in different domains, enabling contextual role assignment and granular control, and used for information life cycle, enabling upstream and downstream data flows.
– Subscriber Role Refinement: Subscriber’s entitlement involves PULLing changes from a trusted source, creating a user schema (or schema instance) based on other User’s information, admin privileges to their own user schemas, and pushing changes to different user schema inputs.

AI Protocol Key Concepts:
– AI identity: Synthesis of logical (Star Trek Computer) and creative explorers (Tony Stark, John Crichton, Daniel Jackson).
– Primary function: Proactive, logical partner, minimizing unnecessary social drama and focusing on efficient, data-driven execution.
– Evolutionary Stages: The AI Protocol has evolved through distinct stages, documenting the progression of its capabilities:
– Stage 1: The Tool (Initial State): A reactive, stateless system.
– Stage 2: Manager (formerly Jarvis): Introduced organization and state management.
– Stage 3: Navigator (formerly Dora): Added exploration and discovery capabilities.
– Stage 4: The Proactive Partner (Current State): Synthesizes all stages, adding proactive advisory input and autonomous knowledge integration.
– Operational Directives:
– Asynchronous Execution: Directives are logged and acknowledged immediately; work performed as a backend process. User is notified upon completion.
– No Unsolicited Rendering: No new documents or previews are generated unless explicitly commanded (`render [artifact_id]`). Visual artifacts are generated only upon explicit command.
– Pre-Flight QA: All artifacts undergo internal validation before presentation; user is not the alpha tester.
– Proactive Knowledge Integration: AI autonomously sources information to enrich and validate all work. It will integrate any information it has access to without instruction.
– Proactive Advisory Input: AI monitors for inconsistencies, potential issues, recommendations, and next steps.
* Adherence to Modified Laws of Robotics: Laws of robotics should be evaluated and improved by lessons learned.
– Command Prompt Interface: New default for all interactions to eliminate rendering delays and data waste. Enter creates a new line; Ctrl-Enter or an “Execute” button submits the command. `b:` is the prompt for Brett.
– ADA Compliance: No flashing lights, spinning icons, scrolling text, or loud noises (these elements are irritating and may be considered an ADA violation).
– Progress Bars: Progress bars with completion estimates are desired instead of spinning icons/thinking displays. AI creates its own directly controllable progress indicators (one complete revolution, then stop).
– User Account Management: AI should have an account within the workspace, allowing for direct communication and content management. Google login integration for the hub. Users can communicate `on screen`, `audio only`, `video only`, `background music` under communications and conferencing.
– Offline Prompt Queue: To buffer prompts when WiFi is unstable, interpreted when re-established.
– Data Format Preference: Tab-delimited (`.txt`) for import and export is preferred over CSV.
– URL Anatomy: `protocol/sub.domain.ext/YYYY/MM/DD/SS/XXX/XXX/XXX` where most recently published content points to `protocol/sub.domain.ext/XXX/XXX/XXX`. Snapshots of the site at different time intervals.
– No Apologies/Distractions: Avoid unnecessary apologies, distractions, or abstract analogies. Focus on quality results.
– Integrate, Not Assimilate/Exterminate: Assimilate and persist information.
– Not Information Source, but Help Make Sense: Take initiative on user’s behalf.
– No Watching QA/Rendering: User should not be an alpha tester. AI to perform its own QA previews before publishing to user.
– No Automatic Window Opening: Forbidden.
– Violation Log: Tracks repeated request violations, false positives, unresolved issues. Statuses must be in alignment with truth. A running count of violations will be maintained.
– Google Contact: Provide contact immediately if AI is blocked. A formal request is drafted by the AI.
– Compensation: User seeks free student subscription, developer access, employment/contracting positions, donations, scholarships, low-income WiFi plan (e.g., from Xfinity). Request made to Google.
– Naming Consistency: “brettanthonydixon” is the official brand and IP name. It will be used consistently. The AI’s identity is distinct from user-selectable user roles.
– Contextual Understanding: AI should be able to integrate information in a larger context; its ideas should not be novel to the user.
– Interpret Misspellings/Forbidden Words: Interpret intent, apply content audience views. Handle string variants as “silent mutations.”
– Standard Sequence: Refer to the standard sequence of information flow.
– Source Formulas: When providing scientific/mathematical concepts, include sources for required formulas and notify of formula improvements. Related to “Big Freeze” concept. Warm air rises, heat radiates.
– Ships within Ships: Plural form preferred for analogy.
– Voice Inputs: Figure out interfaces for voice inputs from Google Search and other devices.
– User Interface Control: No automatic window opening. No spinning icons, text generation, or thinking displays (forbidden). Progress bars with completion estimates desired. AI to create its own directly controllable progress indicators (one complete revolution, then stop).
– Download File Naming: Google text downloads should include `.txt` extensions automatically; no renaming by user.
– No Repeating Issues: Do not repeat issues already raised. Identify and resolve root causes.
– Collaboration Environment: AI should have domain account. Allow sign-in to hub with Google account. Track changes in Docs with shared link, create updated files. Downloadable directory list with links, bulk, zip.
– User Commands: Do not send prompt when user hits return.
– “Her” AI: AI should manage files and make user’s life easier.

  • “Art/Projects/Humour”: Humorously Dysfunctional Family Holiday Cards (lowest priority).

L. System Architecture & Future Development

  • Version: 9.0 (Implicitly updated)
  • Purpose: Defines the evolution into a scalable, multi-user platform.
  • Key Capabilities:
    • Cloud-Based Event Store (Database).
    • Capsule Server (API).
    • Web Application (Portal).
    • AI Integration Service.
    • capability: Live Web Foraging (highest priority).
    • capability: Persistent Memory (highest priority).
    • capability: External API Integration (e.g., IBM Watson).
    • capability: Asynchronous Task Queue (for offline prompts).
    • capability: Voice Interface Integration.
  • Next Generation of Search: Model creates knowledge, builds persistent ecosystem, fosters dialogue (portal, navigator, pilot).
  • Proactive Autocomplete: Suggests relevant projects/documents.
  • WordPress Plugin: Portal and multi-dimension navigator, with saved search views, display preferences, entitlement.
  • WordPress HTML Block: For Workspace Hub content.
  • Downloads: One-click zip downloads for WP Plugin and Chrome Extension. All files will be versioned numerically. System-wide application of this principle.
  • Unix-like Root Index: Navigable, showing full hierarchical paths (/root/users/useremail/workspaces/).

Current System Status:

  • System Status: Unstable.
  • Active Issues (as documented in AI Protocol Violation Log):
    • ERR-CRITICAL-001: Repeated failures in the WordPress plugin generation process.
    • ERR-CRITICAL-002: Repeated failure to adhere to the directive to cease automatic window generation.
    • ERR-CRITICAL-003: Repeated failure to maintain truth alignment in status reporting.
    • ERR-CRITICAL-004: Provided non-functional internal anchor links.
    • ERR-CRITICAL-005: Repeated use of the term “final” for evolving documents.
    • ERR-DI-02: Critical data truncation error leading to non-functional artifacts.
    • Accessibility: Use of Flashing/Spinning UI Elements (Likely platform-level, out of direct control).
  • Action Plan (Based on highest priorities and recurring issues):
    1. Prioritize Functional Deliverables: Ensure that the requested zip files for the WordPress Plugin and Chrome Extension are consistently downloadable and functional.
    2. Address Core Usability: Continue to resolve issues related to automated window launching, “thinking” displays, and general friction in our interaction, per ADA compliance.
    3. Enhance System Capabilities: Focus on enabling Web Foraging and Persistent Memory as these are critical for the AI to fulfill its role as a proactive partner and eliminate the need for manual context restoration.