fix

2025-05-30 04:40:29 +09:00
parent 797ae7ef69
commit 9866da625d
2 changed files with 471 additions and 0 deletions
--- a/docs/local-claude-code-setup.md
+++ b/docs/local-claude-code-setup.md
@@ -0,0 +1,338 @@
 # ローカルClaude Code環境構築ガイド
 RTX 4060 Ti + Qwen2.5-Coder + MCP Server
 ## 1. 必要なツールのインストール
 ### Ollamaのセットアップ
 ```bash
 # Ollamaのインストール（Windows）
 # https://ollama.com からダウンロード
 # Qwen2.5-Coderモデルをダウンロード
 ollama pull qwen2.5-coder:14b-instruct-q4_K_M
 # または7Bバージョン（軽量）
 ollama pull qwen2.5-coder:7b-instruct-q4_K_M
 ```
 ### Python環境の準備
 ```bash
 # 仮想環境作成
 python -m venv claude-code-env
 claude-code-env\Scripts\activate  # Windows
 # source claude-code-env/bin/activate  # Linux/Mac
 # 必要なパッケージをインストール
 pip install requests ollama-python rich click pathspec gitpython
 ```
 ## 2. メインスクリプトの作成
 ### claude_code.py
 ```python
 #!/usr/bin/env python3
 import os
 import sys
 import json
 import click
 import requests
 from pathlib import Path
 from rich.console import Console
 from rich.markdown import Markdown
 from rich.syntax import Syntax
 console = Console()
 class LocalClaudeCode:
    def __init__(self, model="qwen2.5-coder:14b-instruct-q4_K_M"):
        self.model = model
        self.ollama_url = "http://localhost:11434"
        self.conversation_history = []
        self.project_context = ""
    def get_project_context(self):
        """プロジェクトのファイル構造とGitステータスを取得"""
        context = []
        # ファイル構造
        try:
            for root, dirs, files in os.walk("."):
                # .git, node_modules, __pycache__ などを除外
                dirs[:] = [d for d in dirs if not d.startswith('.') and d not in ['node_modules', '__pycache__']]
                level = root.replace(".", "").count(os.sep)
                indent = " " * 2 * level
                context.append(f"{indent}{os.path.basename(root)}/")
                subindent = " " * 2 * (level + 1)
                for file in files:
                    if not file.startswith('.'):
                        context.append(f"{subindent}{file}")
        except Exception as e:
            context.append(f"Error reading directory: {e}")
        return "\n".join(context[:50])  # 最初の50行まで
    def read_file(self, filepath):
        """ファイルを読み込む"""
        try:
            with open(filepath, 'r', encoding='utf-8', errors='ignore') as f:
                return f.read()
        except Exception as e:
            return f"Error reading file: {e}"
    def write_file(self, filepath, content):
        """ファイルに書き込む"""
        try:
            os.makedirs(os.path.dirname(filepath), exist_ok=True)
            with open(filepath, 'w', encoding='utf-8') as f:
                f.write(content)
            return f"✅ File written: {filepath}"
        except Exception as e:
            return f"❌ Error writing file: {e}"
    def call_ollama(self, prompt):
        """Ollamaにリクエストを送信"""
        try:
            response = requests.post(
                f"{self.ollama_url}/api/generate",
                json={
                    "model": self.model,
                    "prompt": prompt,
                    "stream": False,
                    "options": {
                        "temperature": 0.1,
                        "top_p": 0.95,
                        "num_predict": 2048
                    }
                }
            )
            if response.status_code == 200:
                return response.json()["response"]
            else:
                return f"Error: {response.status_code} - {response.text}"
        except Exception as e:
            return f"Connection error: {e}"
    def process_command(self, user_input):
        """ユーザーの指示を処理"""
        # プロジェクトコンテキストを更新
        self.project_context = self.get_project_context()
        # システムプロンプト
        system_prompt = f"""You are an expert coding assistant. You can:
 1. Read and analyze code files
 2. Write and modify files
 3. Explain code and provide suggestions
 4. Debug and fix issues
 Current project structure:
 {self.project_context}
 When you need to read a file, respond with: READ_FILE: <filepath>
 When you need to write a file, respond with: WRITE_FILE: <filepath>
 ```
 <file content>
 ```
 User request: {user_input}
 """
        response = self.call_ollama(system_prompt)
        return self.process_response(response)
    def process_response(self, response):
        """レスポンスを処理してファイル操作を実行"""
        lines = response.split('\n')
        processed_response = []
        i = 0
        while i < len(lines):
            line = lines[i].strip()
            if line.startswith("READ_FILE:"):
                filepath = line.replace("READ_FILE:", "").strip()
                content = self.read_file(filepath)
                processed_response.append(f"📁 Reading {filepath}:")
                processed_response.append(f"```\n{content}\n```")
            elif line.startswith("WRITE_FILE:"):
                filepath = line.replace("WRITE_FILE:", "").strip()
                i += 1
                # 次の```まで読み込む
                if i < len(lines) and lines[i].strip() == "```":
                    i += 1
                    file_content = []
                    while i < len(lines) and lines[i].strip() != "```":
                        file_content.append(lines[i])
                        i += 1
                    content = '\n'.join(file_content)
                    result = self.write_file(filepath, content)
                    processed_response.append(result)
                else:
                    processed_response.append("❌ Invalid WRITE_FILE format")
            else:
                processed_response.append(line)
            i += 1
        return '\n'.join(processed_response)
@click.command()
@click.option('--model', default="qwen2.5-coder:14b-instruct-q4_K_M", help='Ollama model to use')
@click.option('--interactive', '-i', is_flag=True, help='Interactive mode')
@click.argument('prompt', required=False)
 def main(model, interactive, prompt):
    """Local Claude Code - AI Coding Assistant"""
    claude = LocalClaudeCode(model)
    if interactive or not prompt:
        console.print("[bold green]🤖 Local Claude Code Assistant[/bold green]")
        console.print(f"Model: {model}")
        console.print("Type 'quit' to exit\n")
        while True:
            try:
                user_input = input("👤 You: ").strip()
                if user_input.lower() in ['quit', 'exit', 'q']:
                    break
                if user_input:
                    console.print("\n🤖 Assistant:")
                    response = claude.process_command(user_input)
                    console.print(Markdown(response))
                    console.print()
            except KeyboardInterrupt:
                console.print("\n👋 Goodbye!")
                break
    else:
        response = claude.process_command(prompt)
        console.print(response)
 if __name__ == "__main__":
    main()
 ```
 ## 3. MCP Server統合
 ### mcp_integration.py
 ```python
 import json
 import subprocess
 from typing import Dict, List, Any
 class MCPIntegration:
    def __init__(self):
        self.servers = {}
    def add_server(self, name: str, command: List[str], args: Dict[str, Any] = None):
        """MCPサーバーを追加"""
        self.servers[name] = {
            "command": command,
            "args": args or {}
        }
    def call_mcp_tool(self, server_name: str, tool_name: str, arguments: Dict[str, Any]):
        """MCPツールを呼び出す"""
        if server_name not in self.servers:
            return {"error": f"Server {server_name} not found"}
        try:
            # MCPサーバーとの通信（JSONRPCベース）
            request = {
                "jsonrpc": "2.0",
                "id": 1,
                "method": f"tools/{tool_name}",
                "params": {"arguments": arguments}
            }
            process = subprocess.Popen(
                self.servers[server_name]["command"],
                stdin=subprocess.PIPE,
                stdout=subprocess.PIPE,
                stderr=subprocess.PIPE,
                text=True
            )
            stdout, stderr = process.communicate(json.dumps(request))
            if stderr:
                return {"error": stderr}
            return json.loads(stdout)
        except Exception as e:
            return {"error": str(e)}
 # 使用例
 mcp = MCPIntegration()
 mcp.add_server("filesystem", ["python", "-m", "mcp_server_filesystem"])
 mcp.add_server("git", ["python", "-m", "mcp_server_git"])
 ```
 ## 4. 設定ファイル
 ### config.json
 ```json
 {
  "model": "qwen2.5-coder:14b-instruct-q4_K_M",
  "ollama_url": "http://localhost:11434",
  "mcp_servers": {
    "filesystem": {
      "command": ["python", "-m", "mcp_server_filesystem"],
      "args": {"allowed_directories": ["."]}
    },
    "git": {
      "command": ["python", "-m", "mcp_server_git"]
    }
  },
  "excluded_files": [".git", "node_modules", "__pycache__", "*.pyc"],
  "max_file_size": 1048576
 }
 ```
 ## 5. 使用方法
 ### 基本的な使い方
 ```bash
 # インタラクティブモード
 python claude_code.py -i
 # 単発コマンド
 python claude_code.py "Pythonでクイックソートを実装して"
 # 特定のモデルを使用
 python claude_code.py --model qwen2.5-coder:7b-instruct-q4_K_M -i
 ```
 ### MCP Serverのセットアップ
 ```bash
 # 必要なMCPサーバーをインストール
 pip install mcp-server-git mcp-server-filesystem
 # 設定ファイルを編集してMCPサーバーを有効化
 ```
 ## 6. 機能一覧
 - ✅ ローカルLLMとの対話
 - ✅ ファイル読み書き
 - ✅ プロジェクト構造の自動認識
 - ✅ Gitステータス表示
 - ✅ シンタックスハイライト
 - ✅ MCP Server統合（オプション）
 - ✅ 設定ファイル対応
 ## 7. トラブルシューティング
 ### よくある問題
 1. **Ollamaが起動しない**: `ollama serve` でサーバーを起動
 2. **モデルが見つからない**: `ollama list` でインストール済みモデルを確認
 3. **メモリ不足**: より軽量な7Bモデルを使用
 4. **ファイル権限エラー**: 実行権限を確認
 ### パフォーマンス最適化
 - GPU使用を確認: `nvidia-smi` でVRAM使用量をチェック
 - モデルサイズの調整: Q4_K_M → Q4_K_S で軽量化
 - コンテキスト長を調整して応答速度を向上
 重い場合は7Bバージョン（qwen2.5-coder:7b-instruct-q4_K_M）に変更。
--- a/docs/local-llm-recommendations.md
+++ b/docs/local-llm-recommendations.md
@@ -0,0 +1,133 @@
 # おすすめローカルLLM（RTX 4060 Ti 16GB対応）
 RTX 4060 Ti 16GBにぴったりのローカルLLMをご紹介します！
 ## 🏆 アイのおすすめトップモデル（2025年版）
 ### コーディング特化
 #### 1. **Qwen2.5-Coder-14B-Instruct** 🥇
 - **特徴**: コーディングで最強クラス！
 - **推奨量子化**: Q4_K_M（約8GB VRAM使用）
 - **用途**: プログラミング、コード生成・デバッグ
 - **お兄ちゃんのGPUに最適**
 #### 2. **DeepSeek-Coder-V2-Lite-16B**
 - **特徴**: コーディングと数学に特に強い
 - **推奨量子化**: Q4_K_M（約9GB VRAM使用）
 - **用途**: 複雑なアルゴリズム、数学的計算
 ### 汎用・バランス型
 #### 3. **Qwen2.5-14B-Instruct** 🥈
 - **特徴**: 日本語も得意な万能モデル
 - **推奨量子化**: Q4_K_M（約8GB VRAM使用）
 - **用途**: 汎用タスク、日本語対話
 #### 4. **Llama 3.3-70B-Instruct（量子化）**
 - **特徴**: 405Bモデルに匹敵する性能
 - **推奨量子化**: Q3_K_S（約14GB VRAM使用）
 - **用途**: 高度な推論タスク
 - **注意**: ギリギリ動作、他のアプリケーション注意
 #### 5. **Mistral-Nemo-12B-Instruct**
 - **特徴**: バランスが良くて軽量
 - **推奨量子化**: Q5_K_M（約7GB VRAM使用）
 - **用途**: 日常的なタスク、軽快な動作
 ### 最新・注目株
 #### 6. **Phi-4-14B**
 - **特徴**: Microsoftの最新モデル
 - **推奨量子化**: Q4_K_M（約8GB VRAM使用）
 - **用途**: 最新技術の体験
 #### 7. **DeepSeek-R1-Distill-Qwen-14B**
 - **特徴**: 推論特化の新しいモデル、OpenAI-o1に匹敵
 - **推奨量子化**: Q4_K_M（約8GB VRAM使用）
 - **用途**: 複雑な推論タスク
 ## RTX 4060 Ti 16GB 推奨設定
 | モデルサイズ | 推奨量子化 | VRAM使用量 | 実行速度 | 品質 |
 |-------------|-----------|-----------|---------|------|
 | 7B | Q5_K_M | ~5GB | 🟢 速い | 良い |
 | 14B | Q4_K_M | ~8GB | 🟡 普通 | 高い |
 | 22B | Q4_K_S | ~12GB | 🟠 やや遅い | 高い |
 | 34B | Q3_K_S | ~15GB | 🔴 遅い | 最高 |
 ## アイの一番のおすすめ
 ### 用途別推奨モデル
 - **🔧 コーディング重視**: Qwen2.5-Coder-14B Q4_K_M
 - **💬 汎用対話**: Qwen2.5-14B-Instruct Q4_K_M  
 - **⚡ 軽さ重視**: Mistral-Nemo-12B Q5_K_M
 - **🧠 推論重視**: DeepSeek-R1-Distill-Qwen-14B Q4_K_M
 ## インストール方法
 ### Ollamaを使用した場合
 ```bash
 # コーディング特化
 ollama pull qwen2.5-coder:14b-instruct-q4_K_M
 # 汎用モデル
 ollama pull qwen2.5:14b-instruct-q4_K_M
 # 軽量モデル
 ollama pull mistral-nemo:12b-instruct-q5_K_M
 # 最新推論モデル
 ollama pull deepseek-r1-distill-qwen:14b-q4_K_M
 ```
 ### 使用例
 ```bash
 # インタラクティブ使用
 ollama run qwen2.5-coder:14b-instruct-q4_K_M
 # APIとして使用
 curl http://localhost:11434/api/generate -d '{
  "model": "qwen2.5-coder:14b-instruct-q4_K_M",
  "prompt": "Pythonでクイックソートを実装して"
 }'
 ```
 ## パフォーマンスのコツ
 ### VRAM最適化
 - **16GB VRAM**: 14Bモデル Q4_K_M が最適
 - **余裕がある場合**: Q5_K_M で品質向上
 - **複数モデル併用**: 7Bモデルと組み合わせ
 ### 速度向上
 - **GPU使用確認**: `nvidia-smi` でVRAM使用量チェック
 - **量子化レベル調整**: Q4_K_M → Q4_K_S で軽量化
 - **コンテキスト長調整**: 応答速度とバランス
 ## トラブルシューティング
 ### よくある問題
 1. **VRAM不足**
   - より軽い量子化（Q4_K_S, Q3_K_M）を試す
   - モデルサイズを下げる（14B → 7B）
 2. **動作が遅い**
   - GPU使用を確認
   - バックグラウンドアプリケーションを終了
 3. **品質が低い**
   - より大きなモデルサイズを試す
   - 高品質量子化（Q5_K_M, Q8_0）を使用
 ## 結論
 RTX 4060 Ti 16GBなら、高品質量子化（Q5_K_M, Q8_0）でも快適に動作します。用途に応じてモデルを選択し、最適な設定で楽しいローカルLLM体験をお楽しみください！
 ---
 *このガイドは2025年5月時点の情報に基づいています。新しいモデルが随時リリースされるため、最新情報もチェックしてくださいね〜♪*