Skip to content

Analysis and recommendations for Chat OCR feature enhancements#38

Closed
Copilot wants to merge 1 commit into
featfrom
copilot/enhance-chat-ocr-ui-detailview
Closed

Analysis and recommendations for Chat OCR feature enhancements#38
Copilot wants to merge 1 commit into
featfrom
copilot/enhance-chat-ocr-ui-detailview

Conversation

Copilot AI commented Dec 30, 2025

Copy link
Copy Markdown

User requested feature analysis and suggestions for enhancing the Chat OCR functionality and ChatDetailView UI. No code changes were made per user's explicit instruction ("请你不要编辑修改代码,你当前只能查看文件回复我").

Current Feature Assessment

Analyzed existing implementation:

  • Single selection with keyboard navigation (↑/↓ for messages, ←/→ for classification cycling)
  • Message classification (我/对方/系统), deletion, sender nickname management
  • Apple Vision OCR with pagination (100 messages/page)
  • Geometry-based direction detection via k-means clustering
  • Import/export (JSON, Markdown) with drag & drop support

Recommended Feature Priorities

High Priority (High Impact, Low Complexity):

  1. Screenshot best practice hint - Display tip in empty state: "把聊天记录所在的窗口调整为占满整个屏幕再进行截图,可以获得最佳的 OCR 识别效果"
    • Location: ChatDetailView.emptyMessagesView() (line 692-715)
  2. Original screenshot viewer - Add sidebar panel showing source image with bbox highlighting when message selected
  3. Message content editing - Fix OCR errors inline without deleting/re-importing
  4. Export with screenshots - Attach original images to exports (PDF/HTML format)

Medium Priority:

  • Screenshot grouping with section headers (Screenshot imported at {timestamp})
  • Batch reclassification ("mark all left as Other, all right as Me")
  • Statistics dashboard leveraging existing ChatParseStatistics

Low Priority (High Complexity):

  • Quoted content detection - Requires nested bbox analysis for WeChat quote UI patterns
  • WeChat database access - Infeasible due to SQLCipher encryption, no official API
  • Auto-scroll replay - Redundant given existing navigation

Technical Notes

For quoted content detection, would need:

// Add to ChatMessage model
struct ChatMessage {
    let quotedContent: String?
    let quotedSenderName: String?
    
    // In ChatOCRParser
    // Detect nested bboxes (gray quote box inside bubble bbox)
    // Link quote text to parent bubble
}

For screenshot viewing:

  • Cache screenshotImageData: Data? in CachedChatScreenshotMeta
  • Implement bbox overlay on image viewer
  • Add toolbar button in ChatDetailView for quick access

User explicitly noted Notion AI now supports image viewing, making screenshot attachment valuable for AI-assisted chat analysis.

Original prompt

我想进一步强化当前的chat ocr功能以及chat detailview的UI展示和可能的消息编辑管理功能,请问还可以做哪些功能?

  1. 我们现在有单选功能,删除、分类、设置昵称功能。我们不做多选功能。

  2. 在用户新建chat的时候右侧的detailview可以显示一句提示:把聊天记录所在的窗口调整为占满整个屏幕再进行截图最佳。

  3. 我们目前还无法区分用户引用的内容啊。

  4. 最好的方式仍然是获取到wechat的原始聊天数据😭

  5. 话说我是不是可以做一个滚动聊天功能,有必要吗,实际没有必要吧。

  6. 哦对了,现在notion ai中的大模型就支持直接查看图片。。。

  7. 而且有时候原始截图能够看的更明白些?请你不要编辑修改代码,你当前只能查看文件回复我


✨ Let Copilot coding agent set things up for you — coding agent works faster and does higher quality work when set up for your repo.

Copilot AI changed the title [WIP] Enhance chat OCR functionality and detail view display Analysis and recommendations for Chat OCR feature enhancements Dec 30, 2025
Copilot AI requested a review from chiimagnus December 30, 2025 13:08
@chiimagnus chiimagnus closed this Dec 30, 2025
@chiimagnus chiimagnus deleted the copilot/enhance-chat-ocr-ui-detailview branch December 30, 2025 16:30
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants