Praxis 5362-IV評量與評估核心問答 (Assessment & Evaluation) Discussion

Mu Mei Hsueh
Jun 15
20 min read

Updated: Jun 20

Summary: This study guide covers the critical principles of assessing English Learners (ELs). A core theme in ESOL testing is ensuring that assessments truly measure a student's content knowledge or specific language skills without being unfairly hindered by their overall language proficiency or cultural background. Understanding how to appropriately accommodate ELs and utilize alternative assessments like portfolios are key to success in this domain.

Key Concepts:

Formative vs. Summative Assessment:
- Formative (形成性): "Assessment FOR learning." Ongoing, informal checks used to guide and adjust future instruction (e.g., exit tickets, observation, thumbs up/down).
- Summative (總結性): "Assessment OF learning." Final evaluations at the end of a unit or year to measure total mastery (e.g., final exams, state standardized tests).
Norm-Referenced vs. Criterion-Referenced:
- Norm-Referenced (常模參照): Compares a student's performance against the performance of a peer group (the "norm"). Results are often in percentiles. (e.g., SAT, GRE).將學生的成績與「其他學生（常模）」進行比較。常見關鍵字：Percentile (百分位數), Bell curve (鐘型曲線)。例如：SAT, IQ test。
- Potential negative consequence: The scores may lead to inaccurate teacher perceptions of the student's level of intelligence and result in unfair placement in remedial classes.
- Criterion-Referenced (標準參照): Measures a student's performance against a specific, predetermined standard or criteria, regardless of how other students do. (e.g., a driving test, a class spelling test). 評估學生是否達到某個特定的「標準或技能」，不跟別人比。常見關鍵字：Mastery, Standards, Cut score。例如：考駕照、學校的期末考。
Test Validity and Reliability (效度與信度):
- Validity (效度): Accuracy. Does the test measure exactly what it claims to measure? (e.g., A math word problem that is too linguistically complex for an EL lacks validity because it tests reading, not math).測驗是否真的測量了它「宣稱要測量的東西」。在這個例子中，考卷宣稱要考「數學」，但因為英文太難，它實際上測量了學生的「英文閱讀能力」。這對 EL 來說是無效的 (Invalid)。
- Reliability (信度): Consistency. If the student takes the test again tomorrow, will they get a similar score? 測驗結果的「穩定性與一致性」。(例如：同一個學生今天考和明天考，分數應該差不多)。
Cultural Bias (文化偏見):
- When a test question assumes background knowledge specific to a certain culture. (e.g., asking a student from a desert climate to read a passage about snow shoveling or baseball rules). This disadvantages ELs.
Testing Accommodations for ELs (考試融通/調整):
- Changes made to the testing environment or format to level the playing field without changing what the test measures.
- Acceptable: Extended time, word-to-word bilingual dictionaries (no definitions), reading directions aloud, testing in a small, quiet room.
- Unacceptable: Simplifying the content/concepts, translating the reading passages.
Authentic / Alternative Assessment (真實/替代性評量):
- Assessments that require students to perform real-world tasks rather than just recalling facts.
- Portfolios (學習檔案): A collection of student work over time. Excellent for ELs because it highlights growth and progress, reduces anxiety, and captures multiple modes of expression.
- Rubrics (評分量規): Provide clear expectations and grading criteria before the task begins.

Key Questions:

Why does a math word problem potentially lack "validity" when administered to a beginner English learner?
How does an "accommodation" differ from a "modification" during testing?
Why is a portfolio assessment often considered fairer for ELs than a standardized multiple-choice exam?

1. Primary uses of individual and group literacy assessments?

個人與團體讀寫評量的主要用途是什麼？

👩‍🏫 老師詳解：
- 團體評量 (Group Assessments) 的主要用途：
  - 普查與篩選 (Universal Screening)： 開學初對全校或全年級進行快速測試，挑出「可能」需要 ESOL 服務的學生。
  - 州立問責 (State Accountability)： 每年配合聯邦法規進行的年度英語能力檢測（如 WIDA ACCESS），用來向政府證明學校的教學成效。
  - 課程評鑑 (Program Evaluation)： 評估學校目前使用的 ESOL 課程或教材是否對「大多數學生」有效。
- 個人評量 (Individual Assessments) 的主要用途：
  - 精確診斷 (Diagnostic)： 當團體測驗發現某個學生落後時，老師會透過一對一測驗，精確找出他的「病因」（例如：是不會自然發音？還是看不懂長句？）。
  - 進度監控 (Progress Monitoring)： 透過一對一的「閱讀追蹤紀錄 (Running Records)」，仔細聽學生朗讀，並記錄他常犯的錯誤類型。

2. Advantages and disadvantages of each?

這兩者（個人與團體）的優缺點分別是什麼？

👩‍🏫 老師詳解：
- 團體評量 (Group Assessments)
  - ✅ 優點 (Advantages)： 效率極高 (Time-efficient)。可以一次測量大量學生，並且能提供標準化的數據，方便進行校際或全州比較。
  - ❌ 缺點 (Disadvantages)： 缺乏細節 (Lacks detail)。它只能告訴你學生考了 60 分，卻無法告訴你他「為什麼」答錯。此外，這類考試容易引發學生的高焦慮 (High affective filter)，導致表現失常。
- 個人評量 (Individual Assessments)
  - ✅ 優點 (Advantages)： 精準且真實 (Accurate and Authentic)。老師可以觀察學生的「思考過程」，測驗壓力較小，且能針對學生的語言程度即時給予引導或調整。
  - ❌ 缺點 (Disadvantages)： 極度耗時 (Time-consuming)。需要老師一對一進行，在學生人數眾多的公立學校實施起來非常困難。

3.National requirements for exit from a language-support program ?

(退出語言支持計畫的國家標準是什麼？)

👩‍🏫 老師詳解：
- 核心考點： 美國聯邦法規（如 ESSA）嚴格規定，學校**絕對不能只憑「單一指標（單一考試）」**就讓學生退出 ESOL 計畫 (Exit/Reclassification)。
- 必須具備的「多重標準 (Multiple Criteria)」：
  1. 客觀的英語能力測驗達標 (Proficiency Test Scores)： 學生必須在州立的年度英語能力測驗中達到規定的精熟門檻（例如：聽說讀寫總分達到某個等級）。
  2. 學科學業表現 (Academic Achievement)： 學生在主流課程（如數學、科學、社會）的成績，或在州立學科標準測驗中，展現出「沒有語言支持也能跟上同儕」的能力。
  3. 教師推薦與觀察 (Teacher Recommendation)： 必須由 ESOL 老師與一般科 (General Education) 老師共同開會評估，確認學生在日常課堂中的語言表現已準備好獨立上課。

4. When and how is a home-language survey used?

(何時以及如何使用母語調查表？)

👩‍🏫 老師詳解：
- 何時 (When)： 學生**「初次註冊入學 (Initial enrollment)」**時使用。這是一項聯邦政府規定的法定程序，每一位剛踏入美國公立學校的學生家長都必須填寫。
- 如何 (How)： 它是學校的第一道**「初步篩選工具 (Initial Screening Tool)」**。如果家長在調查表上指出孩子在家會說英語以外的語言，學校就會依法在規定期限內（通常是開學 30 天內），安排學生進行正式的「英語能力安置測驗 (Placement test)」，來判定他是否具備 ESOL 服務的資格。

5. What kinds of assessments best focus on ESOL students' comprehension skills in all four domains of language acquisition?

(哪種評量最能針對 ESOL 學生在四大語言習得領域的理解/技能？)

👩‍🏫 老師詳解：
- 破題關鍵： 四大領域 (Four domains) 分別是 聽 (Listening)、說 (Speaking)、讀 (Reading)、寫 (Writing)。
- 最佳評量工具： 州立標準化英語能力測驗 (State-mandated English Language Proficiency Assessments)，例如美國非常具代表性的 WIDA ACCESS 或 ELPAC。
- 為什麼最佳？ 傳統的學科考試（如數學考卷）通常把聽說讀寫混在一起，老師看不出學生到底卡在哪。但這類專業的 ELP 測驗會將四大領域「完全獨立」出來評分。這樣老師就能準確知道：「喔！這個學生的『聽力理解』已經是高級，但『寫作』還停留在初級」，進而對症下藥。

6. What types of formative and summative assessments are effective for measuring ELs' knowledge and/or skills?

(哪些類型的形成性與總結性評量能有效測量 EL 的知識與技能？)

👩‍🏫 老師詳解： 對 EL 來說，最有效的評量必須能「降低語言障礙造成的干擾」。
- 有效的形成性評量 (Formative - 過程檢核)： * 視覺化與肢體回應： 手勢確認 (Thumbs up/down)、使用圖表組織圖 (Graphic organizers)。
  - 低焦慮測試： 觀察 (Observation)、出門條 (Exit tickets)、白板快速作答 (Whiteboards)。
- 有效的總結性評量 (Summative - 期末結算)：
  - 表現型任務 (Performance-based tasks)： 讓學生透過做海報、口頭報告或科學實驗來展現知識，並搭配明確的 評分量規 (Rubrics)，而不是只給他們一張充滿密密麻麻英文字的選擇題考卷。

7. What is one test task that could be used to assess productive language skills?

(請舉一個可用來評量「產出性語言技能」的測驗任務？)

👩‍🏫 老師詳解：
- 觀念釐清： 語言技能分為「接收性 (Receptive：聽、讀)」與**「產出性 (Productive：說、寫)」**。這題要求我們設計「說」或「寫」的任務。
- 測驗任務舉例：
  - 評估「說 (Speaking)」： 給學生一組有順序的圖片（Sequence pictures），請他們**「口頭描述 (Orally retell)」**圖片中的故事發展。
  - 評估「寫 (Writing)」： 給予一個寫作提示 (Writing prompt)，請學生寫一篇簡短的日記，或是完成一篇「比較與對比 (Compare and contrast)」的短文。

8. How can a portfolio assessment be an effective tool to evaluate ELs' progress?

(為什麼學習檔案評量是評估 EL 進步的有效工具？)

👩‍🏫 老師詳解：

看見成長軌跡 (Shows growth over time)： 學習檔案 (Portfolio) 是長期收集學生作品（如草稿、錄音檔、畫作、最終版文章）的資料夾。它不是「一試定生死」，所以能真實反映學生這一學期以來的進步。
降低情感過濾 (Lowers affective filter)： 傳統考試容易讓 EL 極度焦慮。學習檔案允許學生有時間修改、反思自己的作品，大大降低了考試焦慮。
多元展現 (Multiple modalities)： 學生可以放入不同形式的作品（圖文並茂的報告、音檔），幫助那些「口語還不行，但很會寫」或「閱讀很弱，但很會畫圖表達」的學生展現真實能力。

9. What criteria should be taken into account when selecting the appropriate assessment instrument for EL skills?

(在選擇適合 EL 學生技能的評量工具時，應考慮哪些標準？)

👩‍🏫 老師詳解：

觀念釐清： 選擇評量工具絕對不能只看「方便性」或「現成可用的考卷」。身為 ESOL 教師，我們必須確保該測驗對語言學習者是「公平的 (Fair)」且「準確的 (Accurate)」。如果選錯工具，考出來的分數就毫無參考價值。
必考的四大篩選標準 (Evaluation Criteria)：
- 效度與信度 (Validity and Reliability)： 這是最根本的標準。測驗是否真的測量了我們想測的技能（效度）？測驗結果是否穩定且沒有被瞎猜干擾（信度）？
- 缺乏文化與語言偏見 (Absence of cultural and linguistic bias)： 老師必須嚴格審視題目中是否隱含了 EL 學生不可能懂的「美國本土文化背景」（例如：題目設定在棒球場或萬聖節情境）或「艱澀的慣用語」。如果有，這個測驗就不適合。
- 學生的當前語言熟練度 (Current language proficiency level)： 評量形式必須符合學生的程度。如果是針對「初級 (Beginning)」學生考歷史，就不該選擇「申論題 (Essay)」，而應該選擇「圖片配對」或「時間軸填空」，以免語言障礙干擾對歷史知識的評估。
- 對齊課堂目標與標準 (Alignment with objectives and standards)： 評量工具必須與你教過的內容，以及州政府規定的課綱標準（如 WIDA 或 TESOL 標準）緊密扣合。

10. What is the difference between a needs assessment and a diagnostic assessment?

(需求評估和診斷性評量有什麼不同？)

👩‍🏫 老師詳解：

觀念釐清： 這兩者通常都發生在「教學初期」，但它們的視野完全不同！「需求評估 (Needs Assessment)」看的是巨觀的「大環境與資源配置 (Big Picture)」；而「診斷性評量 (Diagnostic Assessment)」看的是微觀的「學術知識漏洞 (Academic Details)」。
詳細比較與舉例：
- 需求評估 (Needs Assessment)：
  - 目的： 了解學生為了成功學習，整體上「需要哪些外部資源或計畫支持」。
  - 評估內容： 學生的母語背景、先前的教育經歷（例如是否為中斷正規教育的 SIFE 學生）、學習動機，甚至家庭是否需要額外的社群資源協助。
  - 實際任務舉例： 開學初讓家長填寫的母語調查問卷 (Home-language survey)、入學面談，或是對學生發放「學習風格與興趣問卷」。
- 診斷性評量 (Diagnostic Assessment)：
  - 目的： 在「特定單元或教學開始前」實施，用來像醫生把脈一樣，精準找出學生的「起點行為」與「先備知識 (Prior knowledge)」。
  - 評估內容： 具體的學術強項與弱項 (Academic strengths and weaknesses)。
  - 實際任務舉例： 在開始教「過去簡單式」這個文法單元之前，老師先發布一篇極短的閱讀測驗（不計分），請學生圈出所有表示過去時間的動詞，藉此「診斷」他們是否已經具備時態轉換的先備概念。

11. What different means of evaluation can teachers use to measure their students' progress toward meeting state and national standards?

(教師可以使用哪些不同的評估方式來測量學生達成州立和國家標準的進度？)

👩‍🏫 老師詳解：
- 核心概念：三角交叉驗證 (Triangulation)。在教育評量中，我們絕對不能只用一種方法打分數。要確認學生是否達到國家標準，老師必須混合使用以下三種手段：
  1. 正式/總結性評量 (Formal/Summative)： 州立標準化測驗 (State-mandated tests)、期末考（用來客觀對齊國家標準）。效標參照測驗 (Criterion-referenced assessments) 與對齊標準的評分量表 (Rubrics)。
  2. 表現型/替代性評量 (Performance-based/Alternative)： 專題報告 (Projects)、學習檔案 (Portfolios)，並搭配明確的評分量規 (Rubrics) 來衡量學生應用知識的能力。
  3. 非正式/形成性評量 (Informal/Formative)： 日常的課堂觀察、隨堂測驗（用來即時監控日常進度）。

12. How do state and national requirements affect the reporting of ESOL students' scores on standardized tests?

(州和國家要求如何影響 ESOL 學生在標準化測驗分數上的匯報？)

👩‍🏫 老師詳解：
- 關鍵字：分類匯報 (Disaggregation of Data) 與 問責制 (Accountability)。
- 美國聯邦法（如 ESSA - Every Student Succeeds Act）嚴格規定，學校在呈報標準化測驗成績時，必須將 EL 學生的分數作為一個「獨立的次群體 (Separate subgroup)」特別列出來。
- 為什麼？ 為了防止學校用一般生 (Native speakers) 的高分來「掩蓋」EL 學生的落後。這項法規強迫學校必須對 EL 的學習成效負責，證明他們每年都有達到「適當的年度進步 (Adequate Yearly Progress)」。

13. What are some formal and informal techniques that could be used to assess how well students are progressing in content-area learning?

(可以使用哪些正式和非正式的技巧來評估學生在「內容學科」學習上的進度？)

👩‍🏫 老師詳解： 這題考驗的是您在教自然、社會、數學等「學科 (Content-area)」時的武器庫。
- 正式評量 (Formal Techniques)： 這些是有固定格式且會打分數的。例如：提供適當考試融通（如延長時間、雙語字典）的單元章節測驗 (Chapter tests)、或是要求學生做一份有評分量規的科學海報報告 (Science poster project)。
- 非正式評量 (Informal Techniques)： 隨機、不打分數的過程檢核。例如：課堂上的軼事紀錄 (Anecdotal records/Observations)、讓學生填寫圖表組織圖 (Graphic organizers) 整理重點、或是下課前的出門條 (Exit tickets)。

14. What is one assessment on the Industrial Revolution that is appropriate for an intermediate-level EL?

(對於一個「中級」EL 來說，關於「工業革命」的適當評量是什麼？)

👩‍🏫 老師詳解：
- 破題： 中級 (Intermediate) 學生具備基礎的日常對話能力，能寫出簡單的句子，但面對充滿專有名詞的長篇申論題仍會感到極大挫折。
- 適當的評量設計 (Accommodated Assessments)：
  1. 時間軸 (Timeline)： 請學生畫出工業革命的時間軸，並在每個重要發明旁寫下 1-2 句簡單的英文描述。
  2. 原因與結果圖表 (Cause-and-Effect Graphic Organizer)： 給學生一個表格，讓他們填入關鍵字或短句來解釋工廠制度的影響。
  3. 圖文配對 (Matching)： 將簡化的英文定義與歷史圖片（如蒸汽機、紡織廠）進行配對。

15. Why is it important for teachers to model techniques for self-assessment?

(為什麼教師「示範」自我評量的技巧很重要？)

👩‍🏫 老師詳解：
- 關鍵字：自主學習 (Autonomous learning), 後設認知 (Metacognition) 與獨立學習者 (Independent Learners)。
- 許多 EL 學生不知道如何監控自己的學習狀況。老師必須親自「示範 (Model)」——例如使用 放聲思考法 (Think-aloud) 說出：「我寫完這段後，我會拿著 Rubric 檢查我有沒有寫出三個支持性細節...」。
- 透過示範，學生能學會如何使用檢核表 (Checklists) 或量規 (Rubrics) 來發現自己的錯誤，減少對老師的依賴，最終成為能對自己學習負責的獨立學習者。

16. What is the value of peer assessment?

(同儕評量的價值是什麼？)

👩‍🏫 老師詳解：
- 降低焦慮 (Lowers affective filter)： 對 EL 來說，被同學給予回饋，通常比直接面對老師的「紅筆批改」來得不那麼可怕。
- 促進意義協商 (Negotiation of Meaning)：同儕在互相評分的過程中，必須用英文討論和解釋「為什麼這裡要這樣改」，這創造了極佳的真實溝通機會 (Meaningful interaction)。
- 內化評分標準 (Internalizing Rubrics)：當學生必須拿著評分量規去檢視別人的作品時，他們會更深刻地理解「好作品的標準是什麼」，這能直接提升他們未來的自我監控能力。

17. How can language-proficiency skills affect the outcome of an assessment of cognitive achievement?

(語言熟練度會如何影響「認知成就測驗」的結果？)

👩‍🏫 老師詳解：
- 核心觀念：效度威脅 (Threat to validity)or 破壞效度 (Compromises Validity)
- 認知成就測驗（例如智力測驗、邏輯推理）的目的是測量學生的「聰明才智」。但如果這個測驗是用「高難度的英文」寫成的，一個極度聰明但剛來美國的 EL 就會考得很低分。
- 結果： 學生的「低語言能力」會掩蓋 (Mask) 他們真實的「高認知成就」。這會導致老師或學校誤判這個學生「不聰明」或「有學習障礙」，這是一個非常嚴重且常見的誤區。

18. What accommodations can be given to ESOL students to accurately measure their linguistic and academic proficiencies?

(可以提供 ESOL 學生哪些「考試融通」，以準確測量他們的語言和學術能力？)

👩‍🏫 老師詳解：
- 核心原則： 融通 (Accommodations) 的目的是「讓比賽場地公平 (Leveling the playing field)」，消除語言障礙，但絕對不降低學科難度。
- 常見且合法的融通：
  1. 延長考試時間 (Extended time)。
  2. 提供雙語詞彙表 (Bilingual word-to-word dictionary)： 注意，只能是「單字對單字」的純翻譯字典，不能包含單字定義或解釋。
  3. 大聲朗讀指示 (Read-aloud directions)： 老師可以把考試的「規則」大聲念出來，但不能翻譯「考題內容」。
  4. 獨立或小組考場 (Small group administration)： 安排安靜、低干擾的空間以降低學生的焦慮。

19. How do special education needs factor into decisions about ESOL student placement?

(特殊教育需求如何影響 ESOL 學生的安置決定？)

👩‍🏫 老師詳解：
- 雙重身分 (Dual Identification)：一個學生完全可以同時是 EL 也是特教生 (SPED)。
- 法律規定： 學生絕對不能因為有特教需求（如學習障礙）就被剝奪接受 ESOL 語言支持的權利；反之亦然。不能說「他已經去上資源班了，所以不用上 ESOL」。
- 實務作法： 安置與課程規劃必須由一個「跨領域團隊 (Multidisciplinary Team/IEP Team)」共同決定，這個團隊必須包含 ESOL 老師、特教老師、一般科老師與家長。ESOL 老師要在其中確保語言目標被納入 IEP (個別化教育計畫) 中。

20. What kind of evidence can indicate that an EL might be a candidate for a gifted program?

(哪種證據可以表明 EL 可能是「資優計畫」的候選人？)

👩‍🏫 老師詳解：
- 考場陷阱： EL 學生在美國的資優班 (Gifted & Talented Programs) 中極度缺乏代表性，因為傳統的資優測驗太過依賴「英文閱讀能力」。
- EL 資優的具體證據：
  1. 非語文智力測驗表現優異 (High performance on non-verbal assessments)： 例如圖形邏輯推理、空間測驗。
  2. 母語能力卓越 (Exceptional L1 ability)： 在自己的母語中展現出遠超同齡人的詞彙量、講故事能力或抽象思維。
  3. 快速的學習率 (Rapid learning rate)：雖然剛開始學英文，但吸收與應用新概念的速度驚人。
  4. 解決問題的創意： 在面對數學難題或日常問題時，能跳脫框架思考。

21. What are examples of concrete evidence that indicate that an EL has cognitive difficulties in addition to language-learning difficulties?

(有哪些具體證據可以表明，EL 除了語言學習困難外，還有「認知困難 / 學習障礙」？)

👩‍🏫 老師詳解：
- 黃金判斷標準：障礙必須存在於「兩種語言」中！
- 具體證據：
  1. 跨語言的共同困難： 學生在母語 (L1) 和英文 (L2) 中都表現出相同的邏輯或閱讀困難(Difficulties present in L1 and L2)。例如：用母語也無法理解字母拼讀的邏輯，或無法重述一個簡單的故事。
  2. 對介入教學無效 (Lack of progress with RTI)： 經過長時間、高質量的 ESOL 教學與介入輔導 (Response to Intervention) 後，學生的學習進度仍然遠遠落後於其他具有相似背景的 EL 同儕。
  3. 非語言智力測驗（如圖形推理）表現持續低落。

22. How might vastly different scores achieved by the same ESOL student on the same test material be explained?

(同一個 ESOL 學生在相同的測驗材料上，取得截然不同的分數，該如何解釋？)

👩‍🏫 老師詳解：
破題關鍵： 信度受損 (Compromised reliability)。
- 如果一個學生昨天考 90 分，今天考類似的內容卻只考 40 分，通常不是因為他突然變笨，而是受到以下變數干擾：
  1. 測驗形式 (Testing Format) 的差異： 90 分那次可能是「口頭報告」或「連連看」，而 40 分那次是「全英文選擇題」。需要大量閱讀的格式會嚴重拉低 EL 的分數。
  2. 情感過濾 (Affective Filter) 的波動： 學生當天可能特別焦慮、疲累、或生病，導致大腦啟動了防禦機制，無法提取學過的語言知識。

23. How can cultural bias affect the scores of ESOL students on standardized tests?

(文化偏見會如何影響 ESOL 學生在標準化測驗中的分數？)

👩‍🏫 老師詳解：
- 破壞效度 (Compromises Validity)：標準化測驗常常預設學生具備「美國主流文化」的背景知識 (Background knowledge)。
- 具體影響： 考題如果出現關於「萬聖節 Trick-or-Treat」、「美式足球規則」或「鏟雪」的閱讀測驗，剛來美國的 EL 學生可能因為「文化不熟」而答錯，而不是因為「英文不好」。這種文化偏見會人為地壓低 EL 的分數，造成不公平的評估。

📊 核心概念比較表格 (Comparison Table)

核心概念 (Concept)	定義 (Definition)	測驗被母語人士「試測」造成的影響 (Impact of Field-Testing on Proficient Speakers)	具體例子 (Concrete Example)
效度 (Validity)	測驗是否真正測量了它想測量的東西？ (Does it measure what it is supposed to measure?)	如果考題是根據精通英語的學生所設計，它對 EL 學生測量的往往不是「學科知識」，而是「英文閱讀能力」，這樣效度就被破壞了。	一份數學測驗使用了極度艱澀的英文單字。EL 學生其實懂數學公式，但因為看不懂題目而考低分。這份測驗對 EL 學生缺乏「效度」。
信度 (Reliability)	測驗結果是否穩定且一致？ (Are the results consistent?)	因為考題的語境或文化只有母語人士懂，EL 學生在作答時往往只能靠「猜測」。靠運氣作答會導致分數忽高忽低，破壞了信度。	同一位 EL 學生在星期一和星期三做同一份難度相同的測驗，但因為星期三的題目剛好都沒有用到他不懂的文化俚語，分數突然暴增。這表示測驗缺乏「信度」。
偏見 (Bias)	測驗內容是否對特定群體不公平（如文化或語言上的預設）？ (Does the test unfairly penalize certain groups?)	測驗在開發時（field-tested）往往只找母語人士測試，導致題目預設了所有考生都具備美國本土的文化背景知識。	題目問及「棒球比賽中的全壘打 (Home run)」。從未接觸過棒球文化的 EL 學生會因為缺乏文化背景而失分，這就是「文化偏見 (Cultural Bias)」。

24-25. What are the characteristics of a criterion-referenced assessment? For what purposes are norm-referenced assessments used?

(標準參照評量的特徵是什麼？常模參照評量的用途是什麼？)

👩‍🏫 老師詳解：
- 標準參照 (Criterion-Referenced)：
  - 特徵： 評估學生是否達到特定的「標準或精熟度 (Mastery)」，絕對不跟別人比。
  - 例子： 學校的單元期末考、考駕照（及格就是及格，不管別人考幾分）。
- 常模參照 (Norm-Referenced)：
  - 用途： 用來將學生與「全國同年級的常模群體」進行比較 (Compare)，通常以「百分位數 (Percentile)」呈現，用來進行大規模的排名或篩選。
  - 例子： SAT, GRE 考試。

26. How can assessment results be used to modify classroom instruction to meet students' needs?

(如何利用評量結果來修改課堂教學以滿足學生的需求？)

👩‍🏫 老師詳解：
- 核心精神：以評量引導教學 (Assessment informs instruction)。
- 破題關鍵：數據驅動教學 (Data-driven instruction)。
- 具體作法： 老師在分析完形成性評量（如隨堂小考、出門條）的數據後，如果發現全班有 80% 的 EL 都把過去式動詞寫錯，老師明天就不應該繼續教未來式。老師必須「暫停進度」，修改教案，重新分組並重新教學 (Reteach) 過去式，或者針對少數落後的學生提供小組鷹架輔導。

27. What are some factors that determine a student's candidacy for an ESOL program?

(有哪些因素決定了學生是否具備參加 ESOL 計畫的資格？)

👩‍🏫 老師詳解：
- 這是一個法定的標準流程 (Standardized identification process)：
  1. 母語調查表 (Home-Language Survey)： 這是第一步。只要家長在表上填寫了家裡有使用「英文以外的語言」，學校就有法律義務進行下一步測試。
  2. 安置測驗成績 (Placement Test Scores)： 學生必須接受州政府認可的「初步英語能力測驗 (Screener)」（例如 WIDA Screener）。如果成績低於「精熟 (Proficient)」的標準，該生就正式具備 ESOL 資格。
  3. 之前的學業紀錄 (Prior Academic Records)： 如果學生有來自其他學區的轉學紀錄，顯示其為 EL，也會作為資格判斷的依據。

28. What criteria should be used to determine whether an ESOL student is ready to be exited from an ESOL program?

(應該使用哪些標準來決定 ESOL 學生是否準備好「退出」ESOL 計畫？)

👩‍🏫 老師詳解：
- 這題跟我們稍早討論的「國家標準」互相呼應。記住最高指導原則：絕對不能只看單一考試成績！必須使用「多重標準 (Multiple Criteria)」。
- 具體標準包含：
  1. 州立標準化英語測驗 (Annual ELP Assessment)： 聽、說、讀、寫總分必須達到州政府規定的「精熟 (Proficient)」門檻。
  2. 主流課程的學業表現 (Academic Performance in mainstream classes)： 學生在一般科目的成績必須證明他們「在沒有語言輔助的情況下也能成功」。
  3. 教師評估與推薦 (Teacher Recommendation)： 由 ESOL 老師和一般科老師共同認可學生已具備足夠的獨立學習能力。

29. What important factors contribute to the decision to advance an ESOL student to the next level of instruction or retain the student for further instruction at the current level?

(在決定要讓 ESOL 學生「升級」還是「留級」時，哪些是重要的考量因素？)

👩‍🏫 老師詳解：
- 考試大陷阱： 在美國教育法規中，「因為英文不好而讓學生留級 (Retention)」是被強烈禁止且可能違法的！
- 真正該考量的因素：
  1. 內容學科的掌握度 (Content Mastery)：如果學生在使用母語或適當的語言融通下，能夠理解該年級的數學、科學概念，他們就應該升級。
  2. 教學介入的有效性： 學校必須問自己：「我們有沒有提供他足夠的語言支援 (Scaffolding/Accommodations)？」如果沒有，那是學校的錯，不是學生的錯，絕不能因此留級。
  3. 整體認知與社會發展 (Cognitive and Social Development)： 必須全面評估學生的身心發展是否符合升級條件，而不是單看語言。

30. How can assessment results be communicated to parents who are not proficient in English?

(如何將評量結果溝通給「不精通英文的家長」？)

👩‍🏫 老師詳解：
- 法定權利： 根據民權法案，學校有義務以家長「能理解的語言」提供重要資訊。
- 破題關鍵： 無障礙溝通 (Accessible communication) 與合規性。
- 正確溝通策略：
  1. 提供專業翻譯 (Qualified Interpreters & Translators)： 學校必須安排專業口譯員進行家長會。絕對不能叫學生幫自己的父母翻譯成績單！ (這會破壞親子關係與資訊準確度)。
  2. 避免使用「教育行話」 (Avoid Teacher Jargon)： 不要對家長說 "Your child failed the formative criterion-referenced assessment."，要用白話文說 "Your child needs more help with reading."。
  3. 使用視覺輔助 (Visual Aids)： 使用圖表、分數長條圖、或是直接拿出學生的「學習檔案 (Portfolio)」實體作品，讓家長「看見」進步，而不用受限於文字描述。

📚 Praxis 5362 Assessment & Evaluation 終極比較表

評量類型	中文	核心概念	問自己什麼問題？	Key Words（高頻關鍵字）	常見例子	與誰比較？
Norm-Referenced Assessment	常模參照評量	跟其他學生比較	我比別人好嗎？	percentile, ranking, national average, bell curve, compared to peers, top 10%, norm group	SAT、ACT、IQ Test	其他學生
Criterion-Referenced Assessment	標準參照評量	是否達到標準	我達標了嗎？	mastery, proficiency, benchmark, objective, standard, learning target, meets expectations	Unit Test、State Assessment、州學習標準測驗	學習標準
Formative Assessment	形成性評量	學習中的回饋	學生現在學得如何？	feedback, monitor progress, check understanding, ongoing assessment, during instruction, exit ticket	Exit Ticket、Kahoot、白板作答、課堂提問	不比較
Summative Assessment	總結性評量	學習成果總結	最後學會了嗎？	final exam, end-of-unit, final project, report card, end-of-course	期末考、單元測驗、學期成績	不比較
Diagnostic Assessment	診斷性評量	找出起點	學生已經會什麼？	pretest, prior knowledge, baseline data, strengths and weaknesses, before instruction	前測、閱讀能力測驗	不比較
Alternative Assessment alter = 改變、另一個Alternative = Another Way 在腦中喊一聲「Alt 鍵！」，然後告訴自己：這就是 Plan B (另一個選擇)。	另類評量	非傳統紙筆測驗(不一定要做給老師看)	除了考卷還能如何評量？	portfolio, journal, reflection, self-assessment, peer assessment, observation	學習檔案、反思日誌、自評表	不比較
Performance-Based Assessment *Performance-Based Assessment 又是 Alternative Assessment 的其中一種。	表現評量	展示能力	能做給我看嗎？	demonstrate, perform, create, produce, presentation, role-play, experiment	演講、科學實驗、戲劇表演	表現標準
Authentic Assessment	真實評量	真實情境應用	真實生活中會做嗎？	real-world task, authentic task, practical application, simulation, real-life problem	模擬商店、商業企劃、模擬法庭	真實情境標準

Alternative Assessment

│

├── Performance-Based Assessment

│ ├── Oral Presentation

│ ├── Speech

│ ├── Role Play

│ ├── Debate

│ └── Demonstration

│

├── Portfolio Assessment

├── Self-Assessment

├── Peer Assessment

├── Observation

└── Project-Based Assessment

Praxis 5362 Section IV 評量與評估雙語互動閃卡

https://gemini.google.com/share/397e0fb0695f
https://gemini.google.com/share/73ef1af5c545
Alternative Assessment（另類評量／替代性評量） 是指：不只是用紙筆測驗（paper-and-pencil tests）來評量學生，而是透過實際表現、作品、觀察等方式來了解學生是否真正學會。(但 Performance-Based Assessment 又是 Alternative Assessment 的其中一種。)

不只是用紙筆測驗（paper-and-pencil tests）來評量學生，而是透過實際表現、作品、觀察等方式來了解學生是否真正學會。

Praxis 5362: Assessment & Evaluation Bilingual Quiz

模擬測驗:https://gemini.google.com/share/2c52bc71100f
進階版測驗:https://gemini.google.com/share/7472f750d02f

Praxis 5362-IV評量與評估核心問答 (Assessment & Evaluation) Discussion

1. Primary uses of individual and group literacy assessments?

👩‍🏫 老師詳解：

2. Advantages and disadvantages of each?

👩‍🏫 老師詳解：

👩‍🏫 老師詳解：

4. When and how is a home-language survey used?

5. What kinds of assessments best focus on ESOL students' comprehension skills in all four domains of language acquisition?

6. What types of formative and summative assessments are effective for measuring ELs' knowledge and/or skills?

7. What is one test task that could be used to assess productive language skills?

8. How can a portfolio assessment be an effective tool to evaluate ELs' progress?

9. What criteria should be taken into account when selecting the appropriate assessment instrument for EL skills?

10. What is the difference between a needs assessment and a diagnostic assessment?

11. What different means of evaluation can teachers use to measure their students' progress toward meeting state and national standards?

12. How do state and national requirements affect the reporting of ESOL students' scores on standardized tests?

13. What are some formal and informal techniques that could be used to assess how well students are progressing in content-area learning?

14. What is one assessment on the Industrial Revolution that is appropriate for an intermediate-level EL?

15. Why is it important for teachers to model techniques for self-assessment?

16. What is the value of peer assessment?

17. How can language-proficiency skills affect the outcome of an assessment of cognitive achievement?

18. What accommodations can be given to ESOL students to accurately measure their linguistic and academic proficiencies?

19. How do special education needs factor into decisions about ESOL student placement?

20. What kind of evidence can indicate that an EL might be a candidate for a gifted program?

21. What are examples of concrete evidence that indicate that an EL has cognitive difficulties in addition to language-learning difficulties?

22. How might vastly different scores achieved by the same ESOL student on the same test material be explained?

23. How can cultural bias affect the scores of ESOL students on standardized tests?

📊 核心概念比較表格 (Comparison Table)

24-25. What are the characteristics of a criterion-referenced assessment? For what purposes are norm-referenced assessments used?

26. How can assessment results be used to modify classroom instruction to meet students' needs?

27. What are some factors that determine a student's candidacy for an ESOL program?

28. What criteria should be used to determine whether an ESOL student is ready to be exited from an ESOL program?

29. What important factors contribute to the decision to advance an ESOL student to the next level of instruction or retain the student for further instruction at the current level?

30. How can assessment results be communicated to parents who are not proficient in English?

Praxis 5362 Section IV 評量與評估雙語互動閃卡

Praxis 5362: Assessment & Evaluation Bilingual Quiz

Recent Posts

Comments

1. Primary uses of individual and group literacy assessments?

👩‍🏫 老師詳解：

2. Advantages and disadvantages of each?

👩‍🏫 老師詳解：

👩‍🏫 老師詳解：

4. When and how is a home-language survey used?

5. What kinds of assessments best focus on ESOL students' comprehension skills in all four domains of language acquisition?

6. What types of formative and summative assessments are effective for measuring ELs' knowledge and/or skills?

7. What is one test task that could be used to assess productive language skills?

8. How can a portfolio assessment be an effective tool to evaluate ELs' progress?

9. What criteria should be taken into account when selecting the appropriate assessment instrument for EL skills?

10. What is the difference between a needs assessment and a diagnostic assessment?

11. What different means of evaluation can teachers use to measure their students' progress toward meeting state and national standards?

12. How do state and national requirements affect the reporting of ESOL students' scores on standardized tests?

13. What are some formal and informal techniques that could be used to assess how well students are progressing in content-area learning?

14. What is one assessment on the Industrial Revolution that is appropriate for an intermediate-level EL?

15. Why is it important for teachers to model techniques for self-assessment?

16. What is the value of peer assessment?

17. How can language-proficiency skills affect the outcome of an assessment of cognitive achievement?

18. What accommodations can be given to ESOL students to accurately measure their linguistic and academic proficiencies?

19. How do special education needs factor into decisions about ESOL student placement?

20. What kind of evidence can indicate that an EL might be a candidate for a gifted program?

21. What are examples of concrete evidence that indicate that an EL has cognitive difficulties in addition to language-learning difficulties?

22. How might vastly different scores achieved by the same ESOL student on the same test material be explained?

23. How can cultural bias affect the scores of ESOL students on standardized tests?

📊 核心概念比較表格 (Comparison Table)

24-25. What are the characteristics of a criterion-referenced assessment? For what purposes are norm-referenced assessments used?

26. How can assessment results be used to modify classroom instruction to meet students' needs?

27. What are some factors that determine a student's candidacy for an ESOL program?

28. What criteria should be used to determine whether an ESOL student is ready to be exited from an ESOL program?

29. What important factors contribute to the decision to advance an ESOL student to the next level of instruction or retain the student for further instruction at the current level?

30. How can assessment results be communicated to parents who are not proficient in English?

Praxis 5362 Section IV 評量與評估 雙語互動閃卡

Praxis 5362: Assessment & Evaluation Bilingual Quiz

Comments

Praxis 5362 Section IV 評量與評估雙語互動閃卡