Coursework | JC4003: Natural Language Processing - Group Assessment: Understanding and Generating Explanations from the RuozhiBa Dataset

NLP Group Assessment: Assignment Writing Service

Understanding and Generating Explanations from the RuozhiBa Dataset Assignment Writing Service

1 Assessment Overview Assignment Writing Service

In this group assessment, you will explore and experiment with traditional machine learning and deep learning models, including large language models (LLMs), to generate accurate meanings and explanations for the samples provided in the RuozhiBa dataset. The purpose of this exercise is to apply your knowledge from the course to a real-world dataset, practicing your skills in data annotation, model design, and evaluation. Assignment Writing Service

2 Objectives
2.1 Data Annotation Assignment Writing Service

Each student will be responsible for annotating a portion of the RuozhiBa dataset in Chinese to provide clear explanations for each sample. This will aid in understanding the actual meaning behind the data samples. Assignment Writing Service

2.2 Model Training Assignment Writing Service

Working in groups, you will build models to generate accurate meanings or explanations for unseen data samples using the annotated dataset. You may choose from traditional machine learning methods, deep learning models, or large language models. Assignment Writing Service

2.3 Model Evaluation
Your model’s performance will be assessed using both automatic evaluation metrics, Assignment Writing Service

such as BLEU and ROUGE, and human evaluation conducted by your group. 2.4 Presentation & Report Assignment Writing Service

Each group will present their model, explain their design choices, and demonstrate their model’s performance. Additionally, you will submit a detailed report on your process. Assignment Writing Service

1 Assignment Writing Service

3 Steps & Requirements 3.1 Data Annotation Assignment Writing Service

The dataset will be divided, and each student will receive a subset for annotation with everyone’s student ID as filename. You should annotate each data sample in Chinese, explaining its meaning in clear and concise terms. These annotated samples will be combined to form the final dataset, which will be split into training and test datasets. Assignment Writing Service

Here are some examples to help you understand better: Assignment Writing Service

Example 1: Assignment Writing Service

Original data: 根据牛顿第一定律，我推算出本次世界百大物理学家排名，爱因斯坦只能屈居第二 Assignment Writing Service
Annotated result: 牛顿第一定律本意指的是牛顿提出的第一条被公认的物理定律，而不是牛顿排名第一的定律 Assignment Writing Service

Example 2: Assignment Writing Service

• Original data: 浴霸打了一个响指，给全世界一半的人洗了澡
• Annotated result: 这里浴霸打响指借用了复仇者联盟中灭霸一个响指可以消灭全世 Assignment Writing Service
```
   界一半人口的概念进行类比，因此有了给全世界一半的人洗澡的结果
```
3.2 Forming Groups Assignment Writing Service

You will form groups of 4-6 students. Each group will work collaboratively on building a model to generate the meanings for the data samples. Group formation is flexible within each programme (cross-programme group is not allowed), but must be completed by Monday, September 23, 2024. Each group leader should send the member list to the corresponding course coordinator after the deadline. Assignment Writing Service

3.3 Model Development Assignment Writing Service

You are free to use traditional machine learning models (e.g., Naive Bayes, Logis- tic Regression) or deep learning models (e.g., RNN, LSTM, Transformers). For those interested in LLMs, the recommended approach is to use prompt engineering techniques to guide the LLM in generating accurate meanings for the dataset samples. Groups that wish to challenge themselves can attempt to fine-tune LLMs using the annotated RuozhiBa dataset to improve their model’s performance. Assignment Writing Service

3.4 Model Evaluation Assignment Writing Service

Each group’s model will be evaluated using: Assignment Writing Service

2 Assignment Writing Service

• Automatic evaluation metrics: such as BLEU, ROUGE, and other applicable metrics to assess the generated meanings’ accuracy. Assignment Writing Service

• Human evaluation: where your group will assess the quality of the outputs based on specific criteria (e.g., fluency, accuracy, relevance). Assignment Writing Service

3.5 Presentation & Report Assignment Writing Service

Each group will prepare a presentation to explain the design of their model, demonstrate its performance on the test dataset, and discuss challenges faced and solutions implemented. Assignment Writing Service

You will also submit a detailed report that covers: Assignment Writing Service

Introduction & Objectives: Why you chose your model(s) and what you aimed to Assignment Writing Service

achieve. Assignment Writing Service
Methodology: A step-by-step explanation of your approach, from annotation to model design and training. Assignment Writing Service
Experiments & Results: Your evaluation results, observations, and any adjustments you made to improve your model. Assignment Writing Service
Discussion: Insights, challenges, and future work you would consider. Assignment Writing Service

The report should be between 3000-5000 words and include references to the tools, li- braries, and models you used. Each group member’s contribution and percentage should be highlighted at the beginning of the report. Assignment Writing Service

4 Evaluation Criteria 4.1 Data Annotation (20%) Assignment Writing Service

Goal: Evaluate the clarity, accuracy, and comprehensiveness of the annotated explanations for the RuozhiBa dataset samples. Assignment Writing Service

Criteria Weight Description Assignment Writing Service

Clarity of Explanation Assignment Writing Service	5% Assignment Writing Service	The annotations should provide clear, eas- ily understandable explanations of each sam- ple’s meaning. No ambiguity or vagueness should be present. Assignment Writing Service
Accuracy of Annotation Assignment Writing Service	5% Assignment Writing Service	The meaning of each sample should be anno- tated correctly in line with its context. This involves capturing nuances and key elements accurately. Assignment Writing Service

3 Assignment Writing Service

Consistency of Terminology Assignment Writing Service	5% Assignment Writing Service	The use of terminology should be consistent throughout the annotations, especially when describing similar concepts across different samples. Assignment Writing Service
Completeness Assignment Writing Service	5% Assignment Writing Service	All samples in the assigned portion of the dataset should be annotated. No gaps or skipped samples should be present. Assignment Writing Service

4.2 Presentation (40%)
Goal: Assess how effectively the group explains their methodology, model design, and results Assignment Writing Service

in a clear, professional manner. Assignment Writing Service

Criteria Weight Description Assignment Writing Service

Introduction & Objectives Assignment Writing Service	8% Assignment Writing Service	The group provides a clear introduction to their approach, objectives, and rationale for choosing their models and methods. Assignment Writing Service
Methodology Explanation Assignment Writing Service	12% Assignment Writing Service	Clear and logical explanation of the method- ology. This includes the model design, choice of algorithms, training processes, and evalu- ation setup. Assignment Writing Service
Results & Demonstration Assignment Writing Service	12% Assignment Writing Service	The group demonstrates their model’s per- formance on the test dataset. Includes a clear discussion of automatic and human evalua- tion metrics. Assignment Writing Service
Visual Aids & Communica- tion Assignment Writing Service	4% Assignment Writing Service	The presentation is well-organized, with clear slides, diagrams, or visual aids. The group communicates confidently and explains key points clearly. Assignment Writing Service
Q&A Handling Assignment Writing Service	4% Assignment Writing Service	The group effectively handles questions from the audience or instructor, demonstrating understanding of their model and results. Assignment Writing Service

4.3 Report (40%)
Goal: Assess the depth of the group’s understanding, analytical rigor, and ability to Assignment Writing Service

communicate their work in a structured, professional format. Assignment Writing Service