Skip to content

Latest commit

ย 

History

History
95 lines (65 loc) ยท 4.18 KB

File metadata and controls

95 lines (65 loc) ยท 4.18 KB
uid 202601190828
created 2026-01-15 09:01
updated 2026-01-19 08:28
tags
inbox
programmatic-upgrade
status draft
ai_processed true
model AutoAgent-Ralph-v1
title 01_Source_Management

์ œ3๊ฐ•. 50๊ฐœ์˜ ๋‘๋‡Œ๋ฅผ ํ•˜๋‚˜๋กœ: ๋Œ€๊ทœ๋ชจ ์†Œ์Šค ๊ด€๋ฆฌ์˜ ๊ธฐ์ˆ 

๐Ÿ‘‹ ์•ˆ๋…•ํ•˜์„ธ์š”! ์—ฌ๋Ÿฌ๋ถ„์˜ AI ์—ฐ๊ตฌ ํŒŒํŠธ๋„ˆ, ์•ˆํ‹ฐ๊ทธ๋ž˜๋น„ํ‹ฐ์ž…๋‹ˆ๋‹ค.

์ง€๋‚œ ์‹œ๊ฐ„, ์šฐ๋ฆฌ๋Š” ์—ฐ๊ตฌ์†Œ๋ฅผ ์„ธํŒ…ํ•˜๊ณ  ์ฒซ ๋ฒˆ์งธ ๊ฐ€๋™์„ ๋งˆ์ณค์Šต๋‹ˆ๋‹ค. ์ด์ œ ๋ณธ๊ฒฉ์ ์œผ๋กœ ์—ฐ๊ตฌ์†Œ์— **'์›์žฌ๋ฃŒ'**๋ฅผ ์ฑ„์›Œ ๋„ฃ์„ ์‹œ๊ฐ„์ž…๋‹ˆ๋‹ค.

NotebookLM์ด ๋‹ค๋ฅธ AI์™€ ๊ฐ€์žฅ ๋‹ค๋ฅธ ์ ์ด ๋ญ˜๊นŒ์š”? ๋ฐ”๋กœ **"๋‚ด๊ฐ€ ์ค€ ๊ฒƒ๋งŒ ๋จน๊ณ  ์ž๋ž€๋‹ค"**๋Š” ์ ์ž…๋‹ˆ๋‹ค. ChatGPT๋Š” ์ „ ์„ธ๊ณ„ ์ธํ„ฐ๋„ท ๋ฐ์ดํ„ฐ๋ฅผ ๋จน๊ณ  ์ž๋ž์ง€๋งŒ, NotebookLM์€ ์—ฌ๋Ÿฌ๋ถ„์ด ๋„ฃ์–ด์ค€ PDF, ์—ฌ๋Ÿฌ๋ถ„์ด ๊ณ ๋ฅธ ์œ ํŠœ๋ธŒ ์˜์ƒ๋งŒ์„ ์‹ ๋ขฐํ•ฉ๋‹ˆ๋‹ค. ๊ทธ๋ž˜์„œ **"์–ด๋–ค ์†Œ์Šค๋ฅผ, ์–ด๋–ป๊ฒŒ ๊ด€๋ฆฌํ•˜๋А๋ƒ"**๊ฐ€ ๊ฒฐ๊ณผ๋ฌผ์˜ ํ€„๋ฆฌํ‹ฐ๋ฅผ 100% ์ขŒ์šฐํ•ฉ๋‹ˆ๋‹ค.

์ด๋ฒˆ ๊ฐ•์˜์—์„œ๋Š” 2025๋…„ ๊ฐ€์žฅ ํ•ซํ–ˆ๋˜ **'์•ต์ปค ๋ฌธ์„œ ์ „๋žต(Anchor Document Strategy)'**์„ ํฌํ•จํ•ด, 50๊ฐœ ์ด์ƒ์˜ ๋ฐฉ๋Œ€ํ•œ ์†Œ์Šค๋ฅผ ํ”„๋กœ์ฒ˜๋Ÿผ ๋‹ค๋ฃจ๋Š” ๋น„๋ฒ•์„ ์•Œ๋ ค๋“œ๋ฆฌ๊ฒ ์Šต๋‹ˆ๋‹ค.


๐Ÿ“š 1. 50๊ฐœ ์†Œ์Šค, ์–ด๋–ป๊ฒŒ ์ฑ„์šธ๊นŒ? (The Big Container)

Module 1. Procurement of Raw Data: Sourcing for English Excellence

(Cleaning OCR and Standardizing CSAT Passages)

๐Ÿ‘‹ Hello, Data Chemists!

In English education, the quality of your source is everything. If you upload a blurry OCR scan of a EBS workbook with messy headers and footers, your AI will produce messy results.

In this module, we learn how to "clean" English educational data to ensure 100% accuracy in logical analysis.


๐Ÿงน 1. The OCR Cleanup: English Workbook Edition

Most English teachers work with scanned PDFs or captured images from workbooks. These are full of "noise":

  • Header/Footer Noise: "2026 EBS Su-neung-teukgang Page 42".
  • Question Numbering: "31. ๋‹ค์Œ ๊ธ€์˜ ์ฃผ์ œ๋กœ ๊ฐ€์žฅ ์ ์ ˆํ•œ ๊ฒƒ์€?".
  • Vocab Glossaries: Small footnotes at the bottom of the page.

The Solution: "Focus Cropping"

Use a PDF tool (like Acrobat or PDFElement) to crop the margins. If you only want the AI to analyze the Paragraph Logic, remove the question stems and the footnotes before uploading.


๐ŸŽญ 2. Managing Bilingual Content

English classrooms in Korea are primarily bilingual. You need to manage how NotebookLM sees English vs. Korean text.

Best Practice: The Layered Sourcing

  1. Level 1 (English Only): Upload the pure English paragraph as a .txt file for Logic Mapping.
  2. Level 2 (Bilingual): Upload the version with Korean translations for "Explanation Generation" and "Vocabulary Mapping".
  3. Level 3 (Logic Anchor): Upload our 00_Anchor_Comparative_Lens_EN.txt to guide the AI's "Logical Voice".

๐Ÿ“Š 3. CSAT Data Tables: Analyzing Mock Exams

Did you just get the results of a National Mock Exam (๋ชจ์˜๊ณ ์‚ฌ)? Don't just look at the scores.

  1. Upload the PDF of the student score results.
  2. Use the Data Table feature to convert the PDF chart into a CSV.
  3. Ask: "Which specific question type (Inference, Blank completion, Order) had the lowest accuracy?"
  4. Result: Instant personalized clinic data for your entire class.

โœ‚๏ธ 4. Segmentation by CSAT Question Type

Instead of one giant "EBS Workbook.pdf", split your sources by Question Type:

  • Folder A: Blank Completion (๋นˆ์นธ ์ถ”๋ก )
  • Folder B: Paragraph Ordering (์ˆœ์„œ ๋ฐฐ์—ด)
  • Folder C: Sentence Insertion (๋ฌธ์žฅ ์‚ฝ์ž…)

By isolating the sources, you prevent the AI from confusing the distinct logical patterns required for each type.


๐Ÿงช 5. Today's Mini-Lab: Source Refinement

๐Ÿ“Œ Mission: "The Clean Link"

Step 1: Find a PDF passage that is "messy" (contains page numbers, logos, or multiple questions). Step 2: Create a cleaned-up version of just the body text in a .txt file. Step 3: Upload both to a new Notebook and ask: "Summarize the logic of the paragraph." Step 4: Compare the results. Notice how the cleaned version leads to a much more "elegant" and accurate summary.

Pro-tip: A clean source is the first step to becoming a "Prompt Grandmaster". ๊ธฐ๋Œ€ํ•ด์ฃผ์„ธ์š”!