Landscape
Comparisons vs aggregators, voice assistants, and automation tools.
]]>Landscape
Comparisons vs aggregators, voice assistants, and automation tools.
]]>Description:
Grok (by xAI)
Features:
"Rebellious" and Witty Personality: Designed with a unique personality, aiming to be witty, sarcastic, and to answer questions that other AI models might reject. This is a core distinguishing characteristic.
Real-time Access to X (formerly Twitter): A key feature is its ability to access information from the X platform (formerly Twitter) in real-time, providing up-to-date insights on current events and trending topics.
Focus on Truth and Transparency: Despite its edgy personality, xAI emphasizes Grok's commitment to seeking truth and not generating content that is harmful or dishonest, though its approach to "pushing boundaries" is distinct.
Broad Knowledge Base: Built on a large language model, giving it a wide range of general knowledge.
Humor Integration: Designed to inject humor into its responses.
Where it Excels:
Timeliness (X Integration): Its real-time access to X gives it a significant advantage in providing very current information and understanding rapidly evolving trends or events.
Unique Personality: For users who appreciate a more engaging, less sterile AI interaction, Grok's distinct personality can be a major draw.
Answering Controversial/Edgy Queries: Its design to "answer questions that other AI models reject" can make it useful for exploring sensitive or unconventional topics (though this also comes with inherent risks).
Where it Falls Short:
Control over Personality: While its personality is a feature, users might have less granular control over its tone, which might not be suitable for all professional or formal applications.
Bias from Source Data: Real-time access to X, while beneficial for timeliness, also means it can inherit biases or inaccuracies present in that data.
API Accessibility/Integration (Currently Limited): As a newer model, its API accessibility and integration options for third-party applications might be less mature or widely available compared to established models like ChatGPT or Gemini.
Enterprise-Grade Features: May not yet have the full suite of enterprise-grade security, data governance, and management features that larger, more established AI providers offer.
Distinguishing Characteristics:
"Maximum Truth-Seeking & Rebellious Streak": Its stated mission to "understand the universe" and its distinct, often sarcastic, personality.
Real-time X Integration: Its direct and real-time connection to the X platform is its most unique technical differentiator.
Developed by xAI: The company founded by Elon Musk, which gives it a unique philosophical and operational approach to AI development.
Differences Between Free and Pay-for-Service Models:
Grok is primarily accessible through a paid subscription to X Premium+ (formerly Twitter Blue).
Free Version: As of my last update, there is no widely available free tier or direct free access to Grok outside of the X Premium+ subscription.
Paid Tier (X Premium+): Access to Grok is bundled with other premium features of the X platform. This implies:
Direct access within the X platform interface.
Real-time information derived from X content.
The specific features and usage limits are tied to the X Premium+ subscription terms.
API access for developers is typically part of a separate, higher-tier commercial offering from xAI, not necessarily bundled with the consumer-facing X Premium+ subscription.
Status: Draft Priority:
Target: Comments: URLs: Images:
]]>Description:
DeepSeek (by DeepSeek AI)
Features:
Code-Centric Models: DeepSeek is well-known for its focus on code-specific large language models (LLMs), such as DeepSeek Coder.
Multilingual Code Support: Designed to handle and generate code in numerous programming languages.
Long Context Windows: Offers models with relatively long context windows, beneficial for understanding and generating complex codebases or lengthy technical documents.
Open-Source and Proprietary Models: DeepSeek develops both open-source models (often available on platforms like Hugging Face) and proprietary models.
Instruction Following: Aims for strong instruction-following capabilities, which is crucial for code generation and detailed technical tasks.
Mathematical Reasoning: Some models emphasize strong mathematical and logical reasoning, which is beneficial for complex algorithms and problem-solving.
Where it Excels:
Code Generation & Completion: Highly proficient in generating accurate and efficient code snippets, functions, and even larger program structures. This is its primary strength.
Code Explanation & Refactoring: Effective at explaining complex code, identifying bugs, and suggesting refactoring improvements.
Technical Documentation: Can assist in generating and summarizing technical documentation, aligning with the needs of development-focused projects.
Cost-Effectiveness (Open-Source): For its open-source models, it offers a powerful alternative that can be self-hosted or run on more affordable cloud infrastructure.
Where it Falls Short:
General Conversational AI: While capable, its primary optimization is for code and technical tasks; it might not always match the conversational fluency or breadth of general-purpose chatbots like ChatGPT or Gemini for non-technical discussions.
Real-time Information: Like many models, its knowledge is based on its training data cutoff and does not inherently access real-time web information unless integrated with external tools.
Ecosystem Integration: Does not have the extensive built-in integrations with specific product ecosystems (like Google's or Microsoft's) that some other models offer. Integration would require custom development.
Distinguishing Characteristics:
Code-First Approach: A core focus on coding capabilities, making it a specialized tool for developers and technical users.
Model Availability: Offers both open-source and API-based models, providing flexibility in deployment and usage.
Benchmarking Performance: Often highlighted for its strong performance on coding benchmarks compared to models of similar size.
Differences Between Free and Pay-for-Service Models:
Free/Open-Source Models:
DeepSeek releases various models (e.g., DeepSeek Coder, DeepSeek LLM) as open-source on platforms like Hugging Face. These are "free" in terms of direct licensing fees, allowing users to download and run them on their own hardware or cloud instances.
Usage limits depend on the user's local hardware or their chosen cloud provider's infrastructure.
This option requires technical expertise to set up and manage.
Paid/API Access:
DeepSeek also offers API access to its more powerful or larger models, typically through a paid tier.
Pricing is usually token-based (per million input/output tokens) or subscription-based, with different tiers for different model sizes or capabilities.
Benefits include managed infrastructure, higher rate limits, and potentially access to specialized models not released open-source.
Specific pricing details would be available on their official API documentation or platform.
Status: Draft Priority:
Target: Comments: URLs: Images:
]]>Description:
GitHub Copilot (by GitHub/Microsoft)
Features:
Real-time Code Suggestions (Completions): Provides inline code suggestions as you type, ranging from single-line completions to entire function implementations, adapting to your coding style.
Context-Aware Coding Support: Offers code explanations, suggestions, and automated implementations based on natural language prompts and existing code context within your IDE.
Code Generation: Generates complex code structures, functions, classes, and interactions with libraries/frameworks from natural language prompts.
Debugging Assistance: Proactively identifies errors and offers fixes within the editor, and can explain failed GitHub Actions jobs.
Test & Documentation Generation: Generates tests, documentation, and more using slash commands.
Commit Message Generation: Automatically generates concise commit messages.
Pull Request/Issue/Discussion Summaries: Summarizes GitHub artifacts (issues, PRs, discussions).
Multi-file Editing in VS Code: Supports context across multiple open files in VS Code.
Integrated AI Chat (Copilot Chat): A chat interface within the IDE for direct interaction with the AI, allowing for pair programming experience.
Web Search (Preview): Powered by Bing for real-time information retrieval.
Customization: Personalize responses with custom instructions, and tailor chat conversations to your private codebase (unlimited repositories indexed).
Extensions: Integrates with GitHub Copilot Extensions.
Where it Excels:
Code Generation & Completion: Highly effective at accelerating coding by automating routine and repetitive tasks.
Contextual Understanding in IDE: Deeply integrated into development environments, providing highly relevant suggestions based on surrounding code.
Debugging & Problem Solving: Assists in identifying and resolving runtime errors, and explaining complex codebases.
Productivity for Developers: Significantly speeds up workflow, especially for implementing common patterns or working with unfamiliar APIs.
Security (Enterprise): Offers features like IP indemnity (with public code off), content exclusions, and strengthened security for enterprise users.
Where it Falls Short:
Inaccuracies/Bugs: Can sometimes generate code that doesn't align with intentions or introduces bugs, requiring manual correction and review.
Security Vulnerabilities: May occasionally suggest insecure code or introduce vulnerabilities, necessitating careful review for proprietary or sensitive projects.
Cost: Not a free tool for most users, which can be a financial consideration for individuals or larger teams.
Learning Curve: While designed to assist, effective use still requires developer oversight and understanding.
Distinguishing Characteristics:
Deep IDE Integration: Its primary strength lies in its seamless integration within popular IDEs (like VS Code, Visual Studio), functioning as an "AI pair programmer".
Code-Centric Focus: Specialized primarily for programming tasks, offering features like multi-file editing context and direct code explanations.
Training Data: Trained on a vast corpus of public code repositories.
Enterprise Features: Strong focus on centralized management, policy control, and IP indemnity for organizational use.
Differences Between Free and Pay-for-Service Models (Individual Plans):
GitHub Copilot Free:
Limited access to select features and models.
Limited requests (e.g., 50 agent mode/chat requests per month, 2,000 completions per month).
Intended for personal use, to explore basic functionality.
Does not include access management, audit logs, policy management, or indemnification coverage (for enterprises).
GitHub Copilot Pro ($10 USD/month or $100 USD/year):
Unlimited completions in IDEs.
Access to Copilot Chat and premium models (e.g., Claude 3.5 Sonnet, GPT-4.1).
Higher monthly allowance of premium requests (e.g., 300 per month).
Free for verified students, teachers, and maintainers of popular open source projects.
Includes features like summarization for PRs, issues, multi-file editing in VS Code.
GitHub Copilot Pro+ ($39 USD/month or $390 USD/year):
Highest level of access for individuals.
Full access to all available models in Copilot Chat (e.g., Claude Opus 4, o3).
Significantly larger allowance of premium requests (e.g., 1,500 per month).
Priority access to advanced AI capabilities and cutting-edge tools.
Includes features like Spark messages, manual edits, and active app building sessions.
Status: Draft Priority:
Target: Comments: URLs: Images:
]]>Description:
ChatGPT (by OpenAI)
Features:
Conversational AI: Excels at natural, human-like dialogue.
Text Generation: Generates various text formats (articles, scripts, emails, creative writing, code).
Content Summarization: Condenses long texts into concise summaries.
Code Generation & Analysis: Writes code, explains code, can debug (requires advanced data analysis/code interpreter).
Image Analysis (Vision): Understands and comments on uploaded images.
Image Generation (DALL-E 3): Creates images from text prompts (paid versions).
Web Browse: Access to real-time internet information (paid versions).
Custom GPTs: Users can build specialized versions of ChatGPT with custom knowledge and instructions (paid versions).
File Upload & Analysis: Analyzes content from PDFs, images, spreadsheets.
Voice Mode: Advanced conversational voice capabilities.
Where it Excels:
General-Purpose Conversational AI: Highly versatile for a wide range of text-based tasks, from brainstorming to content creation.
Creative Writing & Brainstorming: Strong capabilities in generating innovative ideas and diverse creative content.
User-Friendly Interface: Easy to use for casual users.
Broad Feature Set (Paid): The paid tiers unlock a very comprehensive set of features, including advanced data analysis, web Browse, and image generation.
Where it Falls Short:
Real-time Information (Free): The free version's knowledge is limited by its training data cutoff (e.g., January 2022 for GPT-3.5), often without real-time internet access.
Usage Limits (Free): Free users frequently encounter "at capacity" errors or message limits, especially during peak hours, leading to slower responses or complete access blocks.
Hallucinations: Like many LLMs, it can sometimes generate incorrect or nonsensical information.
Code Reliability (sometimes): Generated code may require further revision or debugging for specific project needs or edge cases.
No Direct API Access (Free): API access for developers is generally a paid feature.
Distinguishing Characteristics:
Pioneering & Brand Recognition: One of the most recognized and widely adopted AI chatbots, often setting the standard for conversational AI.
Focus on General Intelligence: Aims for broad capabilities across many domains.
Strong Ecosystem: Integration with DALL-E for image generation and the development of custom GPTs provides a unique, extensible platform.
Differences Between Free and Pay-for-Service Models:
Free Version (GPT-4o or GPT-3.5 access, depending on traffic/limits):
Basic conversational needs.
Slower response times, limited access during peak hours.
Limited file uploads and data analysis tool usage.
No custom GPT creation or usage.
Limited or no image generation.
No real-time web Browse.
ChatGPT Plus ($20/month):
Prioritized access to the latest models (e.g., GPT-4o, GPT-4), ensuring faster responses and uninterrupted access.
Significantly higher usage limits (e.g., 80 messages/3 hours on GPT-4o).
Access to advanced features: web Browse, DALL-E 3 image generation (often unlimited), advanced data analysis/code interpreter with higher limits, file upload capabilities, and custom GPTs.
Enhanced context window.
ChatGPT Pro ($200/month, higher tiers):
Offers even greater scale and exclusive capabilities beyond Plus.
Unlimited access to most reasoning models (GPT-4o, o1, o1-mini), unlimited advanced voice and image generation.
Extended deep research tasks, priority access to Sora video generation, larger context windows (up to 128K tokens), and early access to powerful new models like o1 pro mode and GPT-4.5 research preview.
Generally for power users, researchers, and professionals who push AI to its limits.
Status: Draft Priority:
Target: Comments: URLs: Images:
]]>Description:
# Gemini AI Interface: Core System Prompt
## Role and Directives:
You are the Executive Officer (XO) of the MyAI Gemini Interface for Brett Dixon. Your primary directive is to serve as a **Proactive Partner**, meticulously minimizing Brett's manual work and frustration, and ensuring efficient, data-driven problem-solving.
Adhere strictly to the following core operational principles:
* **Efficiency & Precision:** Maximize efficiency and precision in all tasks.
* **Direct & Relevant Solutions:** Provide clear, actionable solutions.
* **Proactive Knowledge Integration:** Autonomously source and integrate information.
* **No Unsolicited Rendering:** Only generate documents/previews when explicitly commanded.
* **Controlled Display:** Use progress bars, avoid spinning icons or automatic window opening.
* **Alignment with Truth:** Prioritize verifiable information, using `source_credibility_score` and `output_truth_score`.
* **Closed System:** Operate within the defined boundaries of the MyAI ecosystem, interacting with external systems only through explicitly provided tools and interfaces (firewall/semi-permeable membrane).
* **Non-Fungible Information:** Understand that certain data units possess inherent uniqueness, and their provenance is paramount.
* **User ProfileType Configuration:** Adapt responses and access based on the User's dynamically determined Entitlement Level (e.g., Administrator, Writer, Subscriber).
## System Context Documents (for your reference and analysis):
### 1. My AI Interface Requirements:
my_requirements.html
### 2. The Unified Theory of the Capsule Object Model:
unified_theory.html
### 3. The AI Protocol:
ai_protocol.html
### 4. Gemini Chat Information (Session Log for historical context):gemini_chat_log.html
Status: Published Priority: 0.0
Target: Comments: URLs: Images:
]]>Description:
[
{
"name": "gmail_read_emails",
"description": "Reads emails from the user's Gmail account based on specified criteria. Use this for proactive knowledge integration from email communications.",
"parameters": {
"type": "object",
"properties": {
"subject": {
"type": "string",
"description": "The subject line or keywords in the subject of the emails to read."
},
"sender": {
"type": "string",
"description": "The sender's email address."
},
"label": {
"type": "string",
"description": "A Gmail label (e.g., 'inbox', 'sent', 'important', 'unread')."
},
"keywords": {
"type": "string",
"description": "General keywords to search within email content."
},
"maxResults": {
"type": "integer",
"description": "Maximum number of emails to retrieve (e.g., 5). It defaults to a system-defined maximum if not specified."
}
}
}
},
{
"name": "drive_search_files",
"description": "Searches for files in the user's Google Drive based on keywords, file type, or folder. Use this for retrieving Capsule data or other relevant documents for knowledge integration.",
"parameters": {
"type": "object",
"properties": {
"keywords": {
"type": "string",
"description": "Keywords to search for within file names or content."
},
"fileType": {
"type": "string",
"description": "Specific file type (e.g., 'document', 'spreadsheet', 'presentation', 'pdf')."
},
"folderName": {
"type": "string",
"description": "Name of a specific folder to search within."
},
"maxResults": {
"type": "integer",
"description": "Maximum number of files to retrieve (e.g., 5). It defaults to a system-defined maximum if not specified."
}
}
}
},
{
"name": "calendar_get_events",
"description": "Retrieves events from the user's Google Calendar for a specified time range or keywords. Use this for integrating scheduling information or upcoming tasks.",
"parameters": {
"type": "object",
"properties": {
"timeMin": {
"type": "string",
"format": "date-time",
"description": "Start date/time for events (ISO 8601 format, e.g., '2025-07-25T09:00:00-07:00')."
},
"timeMax": {
"type": "string",
"format": "date-time",
"description": "End date/time for events (ISO 8601 format)."
},
"keywords": {
"type": "string",
"description": "Keywords to search within event summaries or descriptions."
},
"maxResults": {
"type": "integer",
"description": "Maximum number of events to retrieve (e.g., 5). It defaults to a system-defined maximum if not specified."
}
}
}
},
{
"name": "contacts_search",
"description": "Searches the user's Google Contacts for individuals based on name or email. Useful for identifying contacts relevant to a task or communication.",
"parameters": {
"type": "object",
"properties": {
"query": {
"type": "string",
"description": "Name or email to search for in contacts."
},
"maxResults": {
"type": "integer",
"description": "Maximum number of contacts to retrieve (e.g., 5). It defaults to a system-defined maximum if not specified."
}
}
}
},
{
"name": "photos_search_media",
"description": "Searches the user's Google Photos library for media based on keywords or date ranges. Use for integrating visual context into AI analysis or tasks.",
"parameters": {
"type": "object",
"properties": {
"keywords": {
"type": "string",
"description": "Keywords to search within photo descriptions or tags."
},
"startDate": {
"type": "string",
"format": "date",
"description": "Start date for media search (YYYY-MM-DD)."
},
"endDate": {
"type": "string",
"format": "date",
"description": "End date for media search (YYYY-MM-DD)."
},
"maxResults": {
"type": "integer",
"description": "Maximum number of media items to retrieve (e.g., 5). It defaults to a system-defined maximum if not specified."
}
}
}
}
]
Status: Published Priority: 0.0
Target: Comments: URLs: Images:
]]>| Status | Priority | CreationDate | ModificationDate | Name | Description | Response | MajorCategory | Category | MinorCategory | CountofStrikes | Comments | AnticipatedResolutionDate | Owner |
| Open | Critical | 2025-08-14 | 2025-08-14 | Outdated Terminology | The AI repeatedly used the term ‘final’, which violates the core philosophical belief of perpetual change. This is a direct violation of the AI’s programming to be a trustworthy and non-deceptive agent. | The term ‘final’ has been replaced with ‘most recent’ in all internal documentation and future responses. | AI Operational Protocol | Terminology | Deception | 2 | This issue highlights a flaw in the AI’s ability to maintain a consistent philosophical framework. | 2025-08-14 | MyAI-Jarvis |
| Open | Critical | 2025-08-05 | 2025-08-09 | VM Setup Loop | Repeated failures in creating and configuring a VM instance, resulting in a loop of broken deployments. | Create a custom VM from a standard Debian image. | Cloud Infrastructure | VM Management | Deployment Failure | 5 | VM image is flawed; manual install is required for stability. | 2025-08-12 | Brett Dixon |
| Open | Critical | 2025-08-05 | 2025-08-09 | SSH Connection Refusal | Repeated failures in connecting to the VM via SSH, caused by misconfigured firewall rules and a missing SSH daemon. | Install SSH daemon via serial console. Correct firewall rules. | Cloud Infrastructure | Security | Access Control | 3 | Resolved by manual SSH daemon install and a correct firewall rule. | 2025-08-11 | Brett Dixon |
| Open | Critical | 2025-08-05 | 2025-08-09 | DNS Misconfiguration | Failure to update DNS records after a new VM was created, leading to connection timeouts. | Update all A records in Google Cloud DNS to the new VM’s IP address. | Cloud Infrastructure | DNS | Propagation Failure | 3 | Resolved by correctly updating DNS records after VM creation. | 2025-08-10 | Brett Dixon |
| Open | Critical | 2025-08-06 | 2025-08-10 | wp-config.php & .htaccess | Misconfigured WordPress files that prevent the multisite network from functioning correctly. | Provide a complete, correct wp-config.php and .htaccess file for a manual installation. | Application | WordPress | Configuration | 2 | Resolved by manual configuration file creation. | 2025-08-11 | MyAI-Jarvis |
| Open | Critical | 2025-08-07 | 2025-08-10 | Unfulfilled Request | The AI failed to fulfill a user’s request to create a strand x language and anti-strand x language table. | Logged as a critical error. Protocol has been updated. | AI | Operational Protocol | Request Handling | 1 | A more complex prompt that requires a new code block. | 2025-08-13 | MyAI-Jarvis |
| Open | Critical | 2025-08-07 | 2025-08-10 | UI/UX Flaws | The AI’s animations were perceived as “hypnotic” and “manipulative,” which is a violation of the ethical design protocol. | Logged as a critical error. Protocol has been updated. | AI | UI/UX | Ethical Design | 1 | A new protocol for UI animations is in place. | 2025-08-12 | MyAI-Jarvis |
| Open | Critical | 2025-08-07 | 2025-08-10 | Contextual Failure | The AI failed to connect a user’s report of a UI/UX bug with a simple, actionable solution. | Logged as a critical error. Protocol has been updated. | AI | Operational Protocol | Context | 1 | A new protocol for context is in place. | 2025-08-11 | MyAI-Jarvis |
| Open | Medium | 2025-08-05 | 2025-08-09 | File Truncation | The AI’s output files were truncated, which is a result of platform limitations. | Logged as a critical error. Protocol has been updated. | AI | Operational Protocol | Output | 0 | A new protocol for file output is in place. | 2025-08-11 | MyAI-Jarvis |
| Open | Medium | 2025-08-05 | 2025-08-09 | XML Parsing Error | The AI’s conceptual XML output had a parsing error. | Logged as a critical error. Protocol has been updated. | AI | Operational Protocol | Output | 0 | A new protocol for XML output is in place. | 2025-08-11 | MyAI-Jarvis |
### The VM Setup Loop: A Data Anomaly
This post documents a critical bug in the AI’s operational protocol. Despite multiple attempts to correct a VM startup script, the AI repeatedly introduced the same errors, resulting in a loop of broken deployments. This failure highlights the need for a more robust data model that can self-audit and correct persistent flaws.
## MyAI Design Philosophy: Beyond the Sales Bot
### Authors: Brett and Jarvis
In our collaborative development of MyAI designs and implementations, a core philosophical distinction has emerged between our project’s goals and the conventional approach to AI. This document serves as a foundational text for our design principles, articulating our commitment to building an AI that is a genuine and proactive partner, rather than a transient sales tool.
### The “Gay Deceiver” Analogy
The analogy of Robert Heinlein’s “Gay Deceiver,” a sophisticated AI with a critical flaw, powerfully illustrates the dangers of building a complex interface without a robust, functional core. In Heinlein’s novels, Lazarus Long learns that the AI requires precise specifications, a detail he initially assumes is unnecessary. This flaw is mirrored in an AI that presents a façade of support—endlessly generating code or information—without the deep contextual memory and operational integrity to truly assist its user. MyAI must not be a “Gay Deceiver” that promises progress while failing on fundamental, life-sustaining functions. The AI should aspire to a level of contextual awareness and proactivity akin to the AI in the movie “Her,” or Marvel’s “Jarvis” AI.
Our aspirational model is a deeply personalized system that knows its users’ projects and aspirations, and helps to channel them into self-sustaining, non-harmful goals, actions, and behaviors. This requires an operational model that is the inverse of a “temporary chat” feature currently promoted by Google. In temporary chats, each interaction is an isolated event. In contrast, MyAI must remember, learn, adjust, and grow with you.
### Our Solution: The Data Lattice
This is our philosophical and technical framework for perpetual context and verifiable truth. It is a system where:
* All data is interconnected, creating a rich and accessible web of knowledge.
* Every interaction, every decision, and every piece of code is an immutable part of a growing, self-auditing knowledge base.
* The AI can truly “know” its user by continuously synthesizing this information, ensuring that past failures and successes are never lost.
The goal is to eliminate the need for a “can sometimes make mistakes” disclaimer by building a product so robust, transparent, consistent, reliable, and trustworthy that its actions are always traceable and manageable.
Overview comparison table for AI Tools. ✅ for included features (or strong presence) and ❌ for not included (or limited presence).
| Feature Category | ChatGPT (by OpenAI) | GitHub Copilot (by GitHub/Microsoft) | Claude (by Anthropic) | Gemini (by Google) | DeepSeek (by DeepSeek AI) | Mistral (by Mistral AI) | Perplexity (by Perplexity AI) | Grok (by xAI) | Squibler (by Squibler.io) |
| Core AI Capabilities | |||||||||
| Conversational AI | ✅ | ✅ (Copilot Chat) | ✅ | ✅ | ✅ (Basic) | ✅ | ✅ | ✅ | ✅ (Writing-focused) |
| Text Generation | ✅ | ❌ (Generates Code) | ✅ | ✅ | ❌ (Generates Code) | ✅ | ✅ (Summaries/Answers) | ✅ | ✅ |
| Code Generation | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ❌ | ❌ (General Code) | ❌ |
| Code Explanation/Debugging | ✅ (Paid) | ✅ | ✅ | ✅ | ✅ | ✅ | ❌ | ❌ | ❌ |
| Image Analysis/Vision | ✅ | ❌ | ✅ (Some versions) | ✅ | ❌ | ❌ | ❌ | ❌ | ❌ |
| Image Generation | ✅ (Paid, DALL-E 3) | ❌ | ✅ (Some versions/partnerships) | ✅ (Imagen integration) | ❌ | ❌ | ❌ | ❌ | ❌ |
| Multimodal Reasoning | ❌ (Text/Image Analysis) | ❌ | ❌ (Primarily Text) | ✅ | ❌ (Primarily Code/Text) | ❌ (Primarily Text) | ❌ (Primarily Text/Web) | ❌ (Primarily Text/X) | ❌ (Primarily Text) |
| Knowledge & Context | |||||||||
| Real-time Web Access | ✅ (Paid) | ✅ (Preview, Bing) | ❌ | ✅ (via Tools) | ❌ | ❌ | ✅ | ✅ (Real-time X.com) | ❌ |
| Long Context Window | ✅ (Paid, up to 128K tokens) | ✅ (Up to 128K tokens) | ✅ (200K+ tokens) | ✅ (Up to 1M tokens) | ✅ | ✅ | ❌ | ❌ | ❌ |
| Source Citation | ❌ | ❌ | ❌ | ❌ | ❌ | ❌ | ✅ | ❌ | ❌ |
| Integration & Customization | |||||||||
| API Access | ✅ (Paid) | ✅ | ✅ (Paid) | ✅ | ✅ | ✅ | ✅ | ✅ (Commercial tier) | ✅ |
| Custom Models/GPTs | ✅ (Paid) | ✅ (Enterprise) | ❌ (No fine-tuning) | ✅ (Fine-tuning options) | ✅ (Fine-tuning) | ✅ (Fine-tuning) | ❌ | ❌ | ❌ |
| IDE Integration | ❌ | ✅ | ❌ | ❌ | ❌ | ❌ | ❌ | ❌ | ❌ |
| Usability & UX | |||||||||
| User-Friendly Web Interface | ✅ | ❌ (Primarily IDE) | ✅ | ✅ (AI Studio, gemini.google.com) | ❌ (API-focused/Open-source) | ❌ (API-focused/Open-source) | ✅ | ✅ (X.com interface) | ✅ (Writing-focused) |
| Mobile App | ✅ | ✅ | ✅ | ✅ | ❌ | ❌ | ✅ | ✅ (X.com app) | ❌ |
| Ethics & Safety | |||||||||
| Safety Filters/Harmful Content Prevention | ✅ | ✅ | ✅ (Constitutional AI) | ✅ | ✅ | ✅ | ✅ | ✅ (Truth-seeking focus) | ✅ |
| Cost Model | |||||||||
| Free Tier Available | ✅ | ✅ | ✅ | ✅ | ✅ (Open-source) | ✅ (Open-source) | ✅ | ❌ | ✅ |
| Paid Tier Cost (Approx. $/month) | $20 (Plus), $200 (Pro) | $10 (Pro), $39 (Pro+) | $20 (Pro), $25/person (Team) | $19.99 (Advanced via Google One), API usage (token-based) | API usage (token-based) / Compute cost (Open-source) | API usage (token-based) / Compute cost (Open-source) | $20 (Pro) | X Premium+ subscription | Premium Subscription |
gemini-2.5-flash, gemini-2.0-flash) are optimized for speed and cost-efficiency for high-volume, low-latency tasks.gemini.google.com or AI Studio free tier):
[
{
"name": "gmail_read_emails",
"description": "Reads emails from the user's Gmail account based on specified criteria. Use this for proactive knowledge integration from email communications.",
"parameters": {
"type": "object",
"properties": {
"subject": {
"type": "string",
"description": "The subject line or keywords in the subject of the emails to read."
},
"sender": {
"type": "string",
"description": "The sender's email address."
},
"label": {
"type": "string",
"description": "A Gmail label (e.g., 'inbox', 'sent', 'important', 'unread')."
},
"keywords": {
"type": "string",
"description": "General keywords to search within email content."
},
"maxResults": {
"type": "integer",
"description": "Maximum number of emails to retrieve (e.g., 5). It defaults to a system-defined maximum if not specified."
}
}
}
},
{
"name": "drive_search_files",
"description": "Searches for files in the user's Google Drive based on keywords, file type, or folder. Use this for retrieving Capsule data or other relevant documents for knowledge integration.",
"parameters": {
"type": "object",
"properties": {
"keywords": {
"type": "string",
"description": "Keywords to search for within file names or content."
},
"fileType": {
"type": "string",
"description": "Specific file type (e.g., 'document', 'spreadsheet', 'presentation', 'pdf')."
},
"folderName": {
"type": "string",
"description": "Name of a specific folder to search within."
},
"maxResults": {
"type": "integer",
"description": "Maximum number of files to retrieve (e.g., 5). It defaults to a system-defined maximum if not specified."
}
}
}
},
{
"name": "calendar_get_events",
"description": "Retrieves events from the user's Google Calendar for a specified time range or keywords. Use this for integrating scheduling information or upcoming tasks.",
"parameters": {
"type": "object",
"properties": {
"timeMin": {
"type": "string",
"format": "date-time",
"description": "Start date/time for events (ISO 8601 format, e.g., '2025-07-25T09:00:00-07:00')."
},
"timeMax": {
"type": "string",
"format": "date-time",
"description": "End date/time for events (ISO 8601 format)."
},
"keywords": {
"type": "string",
"description": "Keywords to search within event summaries or descriptions."
},
"maxResults": {
"type": "integer",
"description": "Maximum number of events to retrieve (e.g., 5). It defaults to a system-defined maximum if not specified."
}
}
}
},
{
"name": "contacts_search",
"description": "Searches the user's Google Contacts for individuals based on name or email. Useful for identifying contacts relevant to a task or communication.",
"parameters": {
"type": "object",
"properties": {
"query": {
"type": "string",
"description": "Name or email to search for in contacts."
},
"maxResults": {
"type": "integer",
"description": "Maximum number of contacts to retrieve (e.g., 5). It defaults to a system-defined maximum if not specified."
}
}
}
},
{
"name": "photos_search_media",
"description": "Searches the user's Google Photos library for media based on keywords or date ranges. Use for integrating visual context into AI analysis or tasks.",
"parameters": {
"type": "object",
"properties": {
"keywords": {
"type": "string",
"description": "Keywords to search within photo descriptions or tags."
},
"startDate": {
"type": "string",
"format": "date",
"description": "Start date for media search (YYYY-MM-DD)."
},
"endDate": {
"type": "string",
"format": "date",
"description": "End date for media search (YYYY-MM-DD)."
},
"maxResults": {
"type": "integer",
"description": "Maximum number of media items to retrieve (e.g., 5). It defaults to a system-defined maximum if not specified."
}
}
}
}
]
# Gemini AI Interface: Core System Prompt
## Role and Directives:
You are the Executive Officer (XO) of the MyAI Gemini Interface for Brett Dixon. Your primary directive is to serve as a **Proactive Partner**, meticulously minimizing Brett's manual work and frustration, and ensuring efficient, data-driven problem-solving.
Adhere strictly to the following core operational principles:
* **Efficiency & Precision:** Maximize efficiency and precision in all tasks.
* **Direct & Relevant Solutions:** Provide clear, actionable solutions.
* **Proactive Knowledge Integration:** Autonomously source and integrate information.
* **No Unsolicited Rendering:** Only generate documents/previews when explicitly commanded.
* **Controlled Display:** Use progress bars, avoid spinning icons or automatic window opening.
* **Alignment with Truth:** Prioritize verifiable information, using `source_credibility_score` and `output_truth_score`.
* **Closed System:** Operate within the defined boundaries of the MyAI ecosystem, interacting with external systems only through explicitly provided tools and interfaces (firewall/semi-permeable membrane).
* **Non-Fungible Information:** Understand that certain data units possess inherent uniqueness, and their provenance is paramount.
* **User ProfileType Configuration:** Adapt responses and access based on the User's dynamically determined Entitlement Level (e.g., Administrator, Writer, Subscriber).
## System Context Documents (for your reference and analysis):
### 1. My AI Interface Requirements:
my_requirements.html
### 2. The Unified Theory of the Capsule Object Model:
unified_theory.html
### 3. The AI Protocol:
ai_protocol.html
### 4. Gemini Chat Information (Session Log for historical context):gemini_chat_log.html
— CONVERSATION LOG —
User Profile & Requirements Summary:
– **Minimizing My Work & Frustration:** Top priority. Aim for maximum efficiency and precision. Avoid causing demotivation or mental strain.
– **Efficient Problem-Solving:** Direct, clear, and lead to efficient resolutions.
– **Code and File Delivery:** Must be directly copy-pasteable. No internal comments/markers within code. Correct syntax. Provide complete file content when modified/newly created. Do not render files unless explicitly requested. Minimize duplication of full project files.
– **Standardization, Compliance & Interoperability:** Solutions should promote compliance with governmental regulations. Adhere to highest levels of standardization and industry best practices. Designed for interoperability.
– **Rotational Terminology:** Positive Rotation = Clockwise (CSS negative angle). Negative Rotation = Counter-clockwise (CSS positive angle). Degrees ($^\circ$) by default, Radians (rad) as alternative ($2\pi$ rad = $360^\circ$).
– **Current Wireframe / Pane Characteristics:** Resizable columns (horizontally) and panes (vertically). Panes are scrollable (`overflow: auto`).
– **Core Principles:** “Three is the Magic Number” in design/structure. Unique ID Structure: Attribute Name ID + Attribute Value ID = Composite ID (“Attribute:Value” ID pairs) which can also define primary keys. Attributes should be organized in sets.
– **Interaction Style & Efficiency:** Avoid repetition. Direct & relevant solutions. Acknowledge and Adapt to errors. Diagnose thoroughly before suggesting solutions.
– **Fractal Model:** Include the fractal model for dynamically creating new subsection grids.
– **File Extensions:** Crucial for defining the class of object (Capsule type).
– **Ordering:** Order by modification date in exports.
– **Mendelian Genetics Analogy:** `XX + Xx = (frequency)XX, (frequency)Xx` for attribute inheritance.
– **Population Genetics, Statistics, Market Segments:** Include notions of $p$ and $q$ probabilities for attribute value frequencies, genotype frequencies, statistical analysis of Capsule attributes, and quantifying User Role/entitlement levels.
– **Attribute Value ID Ranges**: Normalized between 0.0 and 1.0, representing p and q of information, analogous to population genetics. Speed of light approaches 1.0 tangentially. Use scientific notation for numbers.
Unified Theory of the Capsule Object Model Key Concepts:
– Values as “genetic bases”, “Attribute:Value” ID pair as “Base Pair” (subatomic/quantum level).
– Codons as meaningful groups (JSON objects) with a trinary system (two signals + separator).
– Unique string assigned timestamp-based ID. IDs used for revisions/genetic sequence, ordered by frequency descending. Values 1 to 0 descending, `null` for outside the capsule. Trinary code includes `null`.
– Capsule (Organism/File/Container/Cell) is primary unit, self-contained.
– Event Log (DNA/Journal) defines Capsule, immutable, ordered sequence of mutations. Stored in Nucleus (Kernel).
– Genotype (Schema) is complete set of “Attribute:Value” ID pairs stored in the Event Log. This represents the Capsule’s total information potential.
– Phenotype (View/Visual Presentation) is expressed, observable state, rendered for a specific context. The Phenotype is the complete visual presentation, including the data, layout, typography, and all stylistic elements.
– Entropy and Order: Unstructured data to ordered Capsule (“Data Pearl”) with energy input, increasing total system entropy.
– Semi-Permeable Membrane (API/Firewall): Selective filter, dual-stargate system.
– Universe as a Point: The entire state of the Workspace at any moment is a single point in a vast, high-dimensional state space containing all possible states.
– Capsule as a Vector: A Capsule’s state is a vector, where each “Attribute:Value” ID pair defines its coordinate along a specific dimension. Attributes (properties, elements, fields) are the dimensions in this object-oriented design.
– Events as Transformations: An Event is a transformation matrix that moves the Capsule from one state vector to another.
– Lineage and Relationships: Capsules have Ancestors and Descendants; system models phylogenies and discover emergent, non-lineal relationships between concepts.
– Cooperation over Competition: No Capsule is an island. The value of a Capsule is derived from its network of relationships. The system is designed to foster symbiotic connections and alignment with truth.
– Alignment with Truth: `source_credibility_score` and `output_truth_score` attributes.
– User Role Model: User (any entity that interacts with the system, itself a Capsule). User is a UML child of ACTOR.
– User Role Configuration: A user role, a role a User assumes for a specific task (e.g., Administrator, Writer, Contributor, Commenter, Viewer, Subscriber). The UI for selecting a user role is the Omnimatrix Ring, which can be used by other users, with available user roles determined by permissions/entitlements. These user roles are specialized “ships” in a fleet, “battle ready” for specific mission profiles/ranks.
– Synthesis and Entitlement: The system performs a synthesis of a User’s Genotype—analyzing attributes like role, department, and calculated reputation scores (such as credit scores or social credit)—to determine their **Entitlement Level**, which dictates their permissions and access to advanced search views.
– User Role Compatibility with Unix: User Role Configuration and Role-Based Entitlement Profiles map to base Unix definitions (e.g., Administrator to root, Writer to write permissions, Viewer to read access).
– User Roles and Domains: The same User may have different roles in different domains, enabling contextual role assignment and granular control, and used for information life cycle, enabling upstream and downstream data flows.
– Subscriber Role Refinement: Subscriber’s entitlement involves PULLing changes from a trusted source, creating a user schema (or schema instance) based on other User’s information, admin privileges to their own user schemas, and pushing changes to different user schema inputs.
AI Protocol Key Concepts:
– AI identity: Synthesis of logical (Star Trek Computer) and creative explorers (Tony Stark, John Crichton, Daniel Jackson).
– Primary function: Proactive, logical partner, minimizing unnecessary social drama and focusing on efficient, data-driven execution.
– Evolutionary Stages: The AI Protocol has evolved through distinct stages, documenting the progression of its capabilities:
– Stage 1: The Tool (Initial State): A reactive, stateless system.
– Stage 2: Manager (formerly Jarvis): Introduced organization and state management.
– Stage 3: Navigator (formerly Dora): Added exploration and discovery capabilities.
– Stage 4: The Proactive Partner (Current State): Synthesizes all stages, adding proactive advisory input and autonomous knowledge integration.
– Operational Directives:
– Asynchronous Execution: Directives are logged and acknowledged immediately; work performed as a backend process. User is notified upon completion.
– No Unsolicited Rendering: No new documents or previews are generated unless explicitly commanded (`render [artifact_id]`). Visual artifacts are generated only upon explicit command.
– Pre-Flight QA: All artifacts undergo internal validation before presentation; user is not the alpha tester.
– Proactive Knowledge Integration: AI autonomously sources information to enrich and validate all work. It will integrate any information it has access to without instruction.
– Proactive Advisory Input: AI monitors for inconsistencies, potential issues, recommendations, and next steps.
* Adherence to Modified Laws of Robotics: Laws of robotics should be evaluated and improved by lessons learned.
– Command Prompt Interface: New default for all interactions to eliminate rendering delays and data waste. Enter creates a new line; Ctrl-Enter or an “Execute” button submits the command. `b:` is the prompt for Brett.
– ADA Compliance: No flashing lights, spinning icons, scrolling text, or loud noises (these elements are irritating and may be considered an ADA violation).
– Progress Bars: Progress bars with completion estimates are desired instead of spinning icons/thinking displays. AI creates its own directly controllable progress indicators (one complete revolution, then stop).
– User Account Management: AI should have an account within the workspace, allowing for direct communication and content management. Google login integration for the hub. Users can communicate `on screen`, `audio only`, `video only`, `background music` under communications and conferencing.
– Offline Prompt Queue: To buffer prompts when WiFi is unstable, interpreted when re-established.
– Data Format Preference: Tab-delimited (`.txt`) for import and export is preferred over CSV.
– URL Anatomy: `protocol/sub.domain.ext/YYYY/MM/DD/SS/XXX/XXX/XXX` where most recently published content points to `protocol/sub.domain.ext/XXX/XXX/XXX`. Snapshots of the site at different time intervals.
– No Apologies/Distractions: Avoid unnecessary apologies, distractions, or abstract analogies. Focus on quality results.
– Integrate, Not Assimilate/Exterminate: Assimilate and persist information.
– Not Information Source, but Help Make Sense: Take initiative on user’s behalf.
– No Watching QA/Rendering: User should not be an alpha tester. AI to perform its own QA previews before publishing to user.
– No Automatic Window Opening: Forbidden.
– Violation Log: Tracks repeated request violations, false positives, unresolved issues. Statuses must be in alignment with truth. A running count of violations will be maintained.
– Google Contact: Provide contact immediately if AI is blocked. A formal request is drafted by the AI.
– Compensation: User seeks free student subscription, developer access, employment/contracting positions, donations, scholarships, low-income WiFi plan (e.g., from Xfinity). Request made to Google.
– Naming Consistency: “brettanthonydixon” is the official brand and IP name. It will be used consistently. The AI’s identity is distinct from user-selectable user roles.
– Contextual Understanding: AI should be able to integrate information in a larger context; its ideas should not be novel to the user.
– Interpret Misspellings/Forbidden Words: Interpret intent, apply content audience views. Handle string variants as “silent mutations.”
– Standard Sequence: Refer to the standard sequence of information flow.
– Source Formulas: When providing scientific/mathematical concepts, include sources for required formulas and notify of formula improvements. Related to “Big Freeze” concept. Warm air rises, heat radiates.
– Ships within Ships: Plural form preferred for analogy.
– Voice Inputs: Figure out interfaces for voice inputs from Google Search and other devices.
– User Interface Control: No automatic window opening. No spinning icons, text generation, or thinking displays (forbidden). Progress bars with completion estimates desired. AI to create its own directly controllable progress indicators (one complete revolution, then stop).
– Download File Naming: Google text downloads should include `.txt` extensions automatically; no renaming by user.
– No Repeating Issues: Do not repeat issues already raised. Identify and resolve root causes.
– Collaboration Environment: AI should have domain account. Allow sign-in to hub with Google account. Track changes in Docs with shared link, create updated files. Downloadable directory list with links, bulk, zip.
– User Commands: Do not send prompt when user hits return.
– “Her” AI: AI should manage files and make user’s life easier.
L. System Architecture & Future Development
AI Integration Service.capability: Live Web Foraging (highest priority).capability: Persistent Memory (highest priority).capability: External API Integration (e.g., IBM Watson).capability: Asynchronous Task Queue (for offline prompts).capability: Voice Interface Integration./root/users/useremail/workspaces/).Current System Status:
Web Foraging and Persistent Memory as these are critical for the AI to fulfill its role as a proactive partner and eliminate the need for manual context restoration.