TrueLearn Blogs

Week 1-2

10 October 2022

Organization

During the first week, the team members introduced themselves to each other and began discussing our project's content. Our project focused on developing a python library TrueLearn from the available algorithm logic and creating a dashboard to visualize the parameters of the algorithm.

HCI (Human Computer Interaction)

We were introduced to the concept of Human-Computer Interaction (HCI). We spent two weeks learning about:

How to gather user requirements
How to represent our target customers from the gathered requirements
How to design a product using sketches
How to refine sketches to generate a prototype
How to evaluate and iterate on our designs

Background Information

To learn more about our project, we did some research on the project initiators and the project itself.

We went through Knowledge4All website, learned about their mission and some of their ongoing projects, and discovered some of their products such as videolectures.net and their relationship with x5gon organization.

We then investigated what the family of the TrueLearn algorithm is. We found out that it is a set of algorithms that builds a knowledge model of the user from implicit data generated by the users. The knowledge model built from the library can be an important part of an educational recommendation system.

These background studies gave us some high-level overviews of the project.

Week 3-4

24 October 2022

Gathering User Requirement

As we are building a library and a list of different visualizations around the library, we decided to focus on the visualizations side of the project for our HCI assignment which requires us to “design and evaluate a prototype for [our] software system.”

In the first week, we spoke to our potential users to better understand their needs. We discussed some of their requirements, their experiences and pain points of using the existing platform, and their expectations of the new platform. Based on these interviews, we collected two sample responses from our targeted users (students and teachers) and put the responses in our assignment.

Persona

Based on the gathered requirement, we try to represent our users more systematically by identifying their goals, motivations, pain points, and characteristics. Using the attributes that we identified, we created personas for teachers and students and conceptualized how they would use our products in some scenarios.

Sketches and Iterations

In response to the pain points, we gathered from our users, we started working on the design of our sketches. We worked on two versions of the design, analyzed the strengths and weaknesses of our design based on the feedback collected from users and improved them.

First meeting with the client

Before the first meeting, we put together a list of questions to ask the client, covering the following areas:

Background information about the TrueLearn paper
Python Library: APIs, licenses, potential users
Visualizations: what we should visualize, how we should implement it, potential users

After we had completed our meetings with the client, we had a basic understanding of the following concepts:

The functionalities, input, and output of the TrueLearn algorithms
How the input data is collected and pre-processed
Background information about the Python library and its potential users
What kind of visualization needs to be created and who are the users of the visualization

Week 5-6

7 November 2022

Prototyping

After we had finished our sketches, we started making our prototype of the visualization dashboard. We used Balsamiq to draw our prototype and separated our prototype into two parts: prototype for students and prototype for teachers.

Evaluations and Iterations

Considering our limited amount of time after making our prototype, we chose to perform heuristic evaluations based on 10 Usability Heuristics for User Interface Design proposed by Jakob Nielsen. Based on the 10 metrics, we identified the following problems in our prototype:

#	Heuristic	Problem	Solution	Severity
1	Visibility of system status	There is no text indicating the meaning of each diagram on the analytics page for both teacher and student.	Each diagram in analytics should be grouped into categories with a subtitle indicating their meaning and text explaining what they are showing.	4
2	Visibility of system status	It is not clear to the user how to open the various sub-sections of the “My Profile” and “My Content” in the sidebar.	Add an icon to indicate that the sub-sections can be opened by clicking the icon.	2
3	Visibility of system status	Courses in the video uploaded tab should be labelled with name.	Add the course name below each picture.	2
4	Aesthetic and minimalist design	The topics in the home and history pages are scattered over many rows at the top.	Topics can be grouped in a carousel so that they are not spread over multiple rows.	1
5	Consistency and standards	In the teacher’s dashboard, there are two “Analytics” which might be confusing for the user.	Rename user profile “analytics” to “my progress” and rename teacher “analytics” to Content insights.	2

We immediately performed a round of iterations on the prototype that solved the above problem.

Meeting and Literature Review

During these two weeks, we had a second meeting with the client. This meeting focused on how we write our literature reviews and understand the technical details of the library and visualizations.

For the literature review, after a discussion with the client, we decided to split the whole report into two parts: python library and visualizations, and to research similar projects and technologies for each part.

For the python library, we briefly discussed the mechanism behind the TrueLearn algorithm, Bayesian knowledge tracing. At the same time, the client told us that we could go and learn how pyBKT, a project that is like TrueLearn is implemented.

For the visualizations, the client introduced us to the open learner model, encouraging us to read about what types of visualizations are available and how each motivates learners. In addition, the client envisaged us building some dynamic visualizations via React, as this could be easily integrated into their existing video platform x5learn. He also presented some libraries for building the visualizations, including D3.js.

Week 7-8

21 November 2022

Literature Review: tools

This fortnight we have been investigating the tools needed to develop python libraries and visualizations.

From the perspective of building a library that is easy for developers to use and learn, we believe that the python library should:

have thorough documentation for each method
be tested by unit tests
be properly analysed and formatted by linters and formatters

For each of these objectives, we have investigated the tools available. For the documentation, we looked at Sphinx, pdoc, and pydoctor, comparing them based on their functionalities, ease of use, and the UI (User Interface) for the output and finally choosing Sphinx as our documentation generation tool. For testing, we focused on unit testing, doc testing, and generating test coverage. Our motivation to deploy these testing methods drives us to select pytest as our testing framework as it is easy to use and supports additional features via plugin systems and coverage.py as our test coverage report generator as it is way more powerful than its alternative trace. Regarding linters and formatters, we ended up choosing Prospector over PyLint and Flake8 because it integrates the most publicly available python linters and supports out-of-the-box use.

From the point of view of our target users, we believe that visualization should provide the richest possible information in both a static and dynamic way. So, we looked at static and dynamic visualization separately. We ended up choosing several libraries for comparison:

WHERE/VISUALIZATION TYPES	Static	Dynamic
Front-end	D3.js and Chart.js	D3.js and Chart.js
Back-end	matplotlib, seaborn, plotly	plotly

In terms of ease of development, Plotly is the winner at the end of the day, as it supports us in both static and dynamic visual generation on the back end. So, we decided to choose it as the library for the back-end visualization. For the front end, Chart.js is a bit weaker than D3.js, despite its ability to integrate better with React. So, from a functional point of view, we decided to use D3.js for the development of the front-end visualization.

We were also informed by our client during this week’s meeting that the the library should be extensible, meaning that it is easy for other developers to add more features to the library and integrate the library into their system. He suggested we read the paper API design for machine learning software: experiences from the scikit-learn project written by the developers of scikit-learn.

Refine requirements

After the last meeting with the client, we started to build a MoSCoW list to refine some of the requirements for our project. We divided the requirements into functional requirements and non-functional requirements. Inside functional requirements, we listed different requirements for

Python Library
Documentation
Testing
Licensing
Visualizations

Inside non-functional requirements, we focused on usability, compatibility, maintainability, and performance as we need to ensure our library and visualization are accessible and can be easily used by the targeted users.

We decided to have a conversation with the client next week and finalize the requirements at the end of this term.

Gant Charts

During these two weeks, we also had preliminary planning of the project and completed the first two parts of the Gant chart:

Week 9-10

5 December 2022

Finalize Requirements

During these two weeks, we discussed our requirements with our client, who gave us some advice, such as:

Adding descriptions to generated visualizations.
Removing the ability to show related topics in visualizations as it is beyond the scope of the project.
Provide filtering based on skills for the generated visualizations in frontend.

Following these suggestions, we finalize our requirements and publish the requirement on the project website.

Literature Review: design

After the last meeting, we read about how scikit-learn designed their API and used this as a basis for designing TrueLearn's API. We expect our API to follow the same principles used in scikit-learn:

Consistency: “All objects share a consistent interface composed of a limited set of methods.”
Inspection: “Constructor parameters and parameter values determined by learning algorithms are stored and exposed as public attributes.”
Non-proliferation of classes: “Learning algorithms are the only objects to be represented using custom classes. Datasets are represented as NumPy arrays or SciPy sparse matrices. Hyper-parameter names and values are represented as standard Python strings or numbers whenever possible.”
Composition: “Whenever feasible, meta-algorithms parametrized on other algorithms are implemented and composed from the existing building blocks.”
Sensible defaults: “Whenever an operation requires a user-deﬁned parameter, an appropriate default value is deﬁned by the library.”

Implementing these principles in scikit-learn relies on the use of the estimator, predictor, and transformer interfaces, which we decided to deploy in the TrueLearn library. This design allows us to reduce the learning cost of the user and makes TrueLearn easier to extend, maintain and use.

Combining our API design with the final requirements, we finalized our system design as the following:

Week 11-12

9 January 2023

CI

After agreeing on the project's criteria and deliverables in the previous term the focus has now shifted to the development of the TrueLearn library. The project can be split into two distinct components:

The first is the refactoring of the existing code into an intuitive and well-designed API.
The second is the generation of the necessary data structures to visualise key components such as the perceived skill level of the user. This additionally would involve the use of static visualisation tools to output this information in a human-readable format, with the ability to generate dynamic visualisations if the project constraints allow.

After gathering the necessary files for the project, we began to start developing our workflow to make development more efficient. This namely involved setting up GitHub actions to automatically run certain tests and checks on our repositories code. The checks that we wanted were three:

Static analysis: ensure that our code follows good programming practices.
Unit testing: verify the correctness of our API.
Code coverage: indicate how well we have written our unit tests to cover all cases.

All the above have been set up to run on certain triggers (i.e., events), and the reports produced are integrated directly into GitHub for easy access.

Check status is automatically updated and available on the repository's ‘homepage.’

Errors are formatted using GitHub checks API

Design

Moving forward, we aim to make our proposed plan concrete, detailing how we would refactor the existing code. One element we have discussed is the use of interfaces to define shared behaviour between the AI models which are used to provide these recommendations. However, this would constrain future development of the project to use this specific interface. Another approach used already by an existing learner-focused AI library scikit-learn is to use the programming paradigm of duck typing. This more flexible approach allows developers to add functionalities to the model without worrying about interface constraints.

In terms of the visualisations, we plan to determine what the 3 key visualisations are and what data structures we can use to represent that data that we would like to model.

In terms of general design, we have proposed a first version of the 'truelearn' package structure and structured it as shown below:

bayesian_models: contains all the classifiers that we need to implement: knowledge, novel, interest, and INK (meta classifier)
preprocessing: contains the Wikifier code that uses Wikifier API to extract top-n topics from some given texts
unit_tests: contains all the unit tests of the package
visualisations: contains the visualization code

Elevator Pitch

In preparation for the upcoming elevator pitch, the group prepared two online meetings to discuss the script, design the PowerPoint, and rehearse. We ended up with the following design.

Week 13-14

23 January 2023

Design (Package Structure)

Before we started implementing the library, we had another meeting with the client to discuss how to structure truelearn as we felt that the current design was too simple to accommodate some of the “could have” features we wanted to implement.

In terms of library structure, we discuss several points during the meeting:

Naming
- use learning instead of bayesian_models
- create a separate sub-package for the implementation of models which contain the user model and event model.
Classifier
- We could implement the baseline classifiers so that it’s easy to run experiments presented in the TrueLearn paper via our library
Dependencies
- NumPy, trueskill, mpmath, sklearn
- pytest (This is not a dependency of our package. It’s only a part of test dependency.)
User Models
- Design a Topic class that includes id, description about the topic
- User Model includes a dictionary mapping Topic => mean and variance, and stores the weights/dynamic factors used in the training process
Visualizations
- Finalized visualizations: Bar charts, Line charts, Pie charts. Possibly Bubble Charts.
- Cosine similarity is used to determine the parameters and uncertainty is used to display visualization (For instance, by changing the colour of the chart)
- Library to use: Matplotlib x Plotly

As a result of the design considerations above, we finalized our package structure:

truelearn/learning: contains all the classifiers.
truelearn/models: contains the implementation of the learner model and event model.
truelearn/preprocessing: contains the pre-processing function, such as wikifier.
truelearn/util: contains some utility sub-packages.
metrics: contains methods to calculate precision, accuracy, recall, and f1 score.
- visualizations: contains methods to visualize learner models. It supports bar charts, line charts, pie charts, bubble charts, etc.
truelearn/tests: contains unit tests for each package shown above.

You can refer to PR #5, #8 for more details.

Baseline Classifier

We start our first step of refactoring at #9.

In this PR, we implement the first three baseline classifiers presented in the TrueLearn paper: EngageClassifier (always predict that learner will engage with the given event), PersistentClassifier (predict based on the last label), MajorityClassifier (predict engagement if the number of engagements is greater than the number of non-engagements).

To make our API easy to use, we add type hints to our methods return type and plan to add type hints to the parameter after we finalized the implementation of the learner model. We encountered some problems when we added type hints for the return value of the instance methods, as we needed to annotate the return type to the class itself (fit method). We initially used the quotation as a workaround to wrap the return type to support Python 3.6+. However, after a discussion with the client, we decided only to support Python 3.7+ as Python 3.6 is end-of-life, which allows us to import something from __future__ to resolve this problem.

from __future__ import annotations

Model

We then started a long journey of exploring the implementation of the learner model.

Initially, we are switching back and forth between two different implementations that use the four classes: Topic, KnowledgeComponent, Knowledge, and LearnerModel.

You can refer to #10 for more details. We discuss how we should implement __repr__ and __hash__ of Topic, how we should store the mapping (should we use Topic or other ways of mapping), how different ways of mapping potentially affect the usability of other components in our library (visualization).

After some discussions among the team and the client, we decided to switch to the following design:

AbstractKnowledgeComponent defines an abstract interface that can be inherited by developers to implement their knowledge components.
KnowledgeComponent represents a knowledge component in the learning process. It contains information about the mean, variance, title, description and URL of the knowledge component.
Knowledge stores a dictionary mapping a Hashable type (e.g. topic id) to KnowledgeComponent.
LearnerModel stores the Knowledge.

Knowledge Classifier

Based on the learner model, we implement our first version of KnowledgeClassifier and fix all the type hints for the parameters. Now, x in fit and predict is of type Knowledge.

Now, all the classifiers have the following public APIs:

fit(x: Knowledge, y: bool): train the classifier by using the knowledge of the learnable unit (x) and a label (y) that indicates whether the learner engages with the learnable unit.
predict(x: Knowledge): predict (output True/False) whether the learner will engage with the learnable unit represented by the knowledge.
predict_proba(x: Knowledge): predict (output probability between 0-1) whether the learner will engage with the learnable unit represented by the knowledge.
get_params(): return the parameters of the classifier in the dictionary (name => value)
- For EngageClassifier, this returns an empty dictionary.
- For PersistClassifier, this returns a dictionary with only one key-value pair, storing whether the learner engages with the last Knowledge.
- For MajorityClassifier, this returns a dictionary with two key-value pairs, storing the number of engagements and non-engagements.
- For KnowledgeClassifier, this returns a dictionary like this:
```
{
    "threshold": self.__threshold,
    "init_skill": self.__init_skill,
    "def_var": self.__def_var,
    "beta": self.__beta,
    "positive_only": self.__positive_only
}
```

A note from the future: currently, you may notice KnowledgeClassifier is not very customizable and powerful (e.g. lack of public methods to set parameters). However, we will gradually enhance these APIs and add more public methods as we move towards implementing all of the classifiers.

Week 15-16

6 February 2023

Preprocessing (Wikifier)

While implementing the model and classifier, we also started to implement the wikifier API.

The main functionality of the Wikifier is to request API provided by Wikifier and convert the returned JSON to a list of topics represented by a dictionary containing keys like title, URL, cosine, PageRank and id.

As we need to parse the JSON and convert it to our data structure, we need a library to load it. For this, we experiment with the Python standard json library, UltraJSON, orjson and python-rapidjson. Based on our experimentation, orjson is the best among these four, about 10x faster than json in Python standard library.

Classifier

In #13, we implement all the classifiers, finishing 10+ listed tasks.

This request aims to implement NoveltyClassifier, InterestClassifier and INKClassifier and to refactor the structure of the library.

The following steps are required to complete this pull request:

Augment LearnerModel: include engagement data to help the impl of draw probability. Currently, the impl is incorrect.

Augment Abstract/KnowledgeComponent: include timestamps for the impl of interests.

Create an EventModel => record the knowledge representation of the learnable unit and the timestamp when the learning event happens (useful in interests)

Create BaseClassifier (define abstract methods) and common base class for Knowledge, Novelty, and Interest Classifier as they share many helper methods

Fix the draw probability implementation

Implement NoveltyClassifier

Implement InterestClassifier

Utilize BaseClassifier for type checking

Implement INKClassifier

There are some non-functional refactorings to make our library better:

Use @dataclass to implement LearnerModel and EventModel

Extract some methods from the class hierarchy and make them free functions (e.g. team_sum_quality, select_topic_kc_pairs). These methods are not closely related to the internal state of the classifier, nor are they part of the classifier’s behaviour.

Remove the default argument in InterestNoveltyKnowledgeBaseClassifier. The default should be set in the base class.

Use keyword arguments to make AbstractKnowledgeComponent difficult to use incorrectly

Include the typing_extension package (its support of types like Self and Final is beneficial to the library impl) and rewrite some of the type hints

Remove AbstractKnowledge and only keep AbstractKnowledgeComponent.

Switch to google style docstring as it’s easier to write and read. (Don’t need to write ------- and Don’t need to maintain the type information.)

In summary, the main achievements of this PR include:

truelearn.learning
- Implement NoveltyClassifier, InterestClassifier and INKClassifier.
- Define a class BaseClassifier and implement type-based constraint checking validate_params(), get_params() and set_params() in it
- Define a class InterestNoveltyKnowledgeBaseClassifier that implements the shared methods used by the three classifiers.
truelearn.models
- Add timestamp to the knowledge components as we need to use time in InterestClassifier and INKClassifier.
- Replace the Knowledge that represents the learnable unit with an EventModel that models a learning event. In the event model, we store the timestamp when the event happened. This timestamp is used in InterestClassifier and INKClassifier.
Others
- Use @dataclass
- Switch from numpy style docstring to google style docstring as the latter is more concise (we don’t need to mark the type twice).
- Add typing_extension to our dependencies as it brings more type from later Python versions to Python 3.7 (i.e. Self and Final).
- Add black formatter. We now have a formatter that keeps our codebase consistent style.
- Enable mypy and bandit linters, which allows us to do more strict type checking and security checking.
- Fix some Python 3.7 compatibility issues caused by the usage of |, tuple and dict.

There are also some discussions around the type of hints and private/public variables. You could find them in the PR.

Metrics

We implement the following metrics:

get_precision_score
get_recall_score
get_accuracy_score
get_f1_score

These functions are implemented by importing and re-exporting the scikit-learn library. We have included it as part of the truelearn package because we envisage the need to add more metrics here in the future.

Datasets

As we progressed on classifiers and models, we felt we needed to provide some ways for developers to experiment with different classifiers.

We intend to provide APIs that mimic those in scikit-learn. The dataset we use is PEEK-Dataset, which is described in this paper and hosted here.

We provide two methods load_peek_dataset() and load_peek_dataset_raw() to load the PEEK dataset in parsed/raw format.

To implement these two methods, we initially wanted to include these datasets inside our package, like some of the datasets in scikit-learn. However, we soon realised that this was not feasible as the dataset was over 30M. Including this non-essential resource would have inflated our package and taken the users longer to download it.

We have therefore implemented a basic downloader that can download PEEK datasets and validate their sha256sum. Users can use it to download datasets as needed. It also provides a cache ability: when you call load_peek_dataset/_raw() multiple times, the data will only be downloaded once.

When implementing the truelearn.datasets, we also made a PR to the upstream PEEK-Dataset. The motivation for this PR is to add some additional information (i.e. title and description) for each topic. Adding this information to the mapping gives us more choices when implementing the visualization.

We implemented a crawler using Python to fetch the title from the URL in the PEEK-dataset mapping and then use the fetched title to request another Wikipedia API (https://en.wikipedia.org/w/rest.php/v1/search/title?q={title}&limit=10) to get the relevant description.

We processed 30366 URLs in the dataset and discovered something interesting about Wikipedia and the first version of the dataset. We will present them briefly below:

The limit in the API that provides the description is not simply a limit on the length of the data. Sometimes, you will find that the first results of limit=1 and limit=10 are different, and the latter is more accurate. This is probably related to some voting algorithm (like KNN).
In Wikipedia, many topics lack descriptions.
Wikipedia and Wikimedia, though both hosted by the Wikimedia Foundation, provide different descriptions of the same topics.

Some topics (their id shown below) are deleted from Wikipedia for various reasons (lack of evidence, promotion, etc.):

broken_links = [
    "1256",
    "3203",
    "4924",
    "6057",
    "8543",
    "13172",
    "16347",
    "20258",
    "25968",
]

From the future: after the upstream merged the changes, we utilized the title and description in our implementation (#27).

Doc

As we already have the docstrings in the source files, we started to implement some CIs to build documentation automatically for each new commit/release.

We used Sphinx to automatically generate documentation based on some pre-generated templates and the docstrings in the source files. In #14, we set up a basic template at docs.

Later, after discussing with the client, we decided to switch to the state-of-art documentation-hosting platform readthedocs and set up configurations in readthedocs.yaml. (This is still Work In Progress).

CI

Based on the discussion #32, we decided to make Github Annotations available only when the tests/linting failed, which reduces the number of checks shown when all the tests are successful.

Also, CI caching is enabled for static analysis, unit testing and code coverage. It reduces the time to install the dependencies every time we run these actions.

prospector.yml is slightly adjusted to reflect the changes in #13, where we implemented the classifiers. We limit the line length to 88 and exclude some checkings in pydocstyle because we switched to Google docstrings.

Others

In #16, we made the following changes related to packaging and project structure:

Build and publish on PyPi for new releases
Replace requirements.txt and setup.py with pyproject.toml
Change README.md to README.rst and adjust syntax accordingly

Week 17-18

27 February 2023

Datasets

As mentioned before, we raised a PR to the upstream of the PEEK Dataset, and it was merged. In week 17, we applied the upstream changes to truelearn. Now, truelearn can provide titles and descriptions that match the Wikipedia topic id.

Refactor

With most of the implementation completed, we gradually started to refactor the existing code.

We started by replacing the ABC abstract classes in truelearn.models with Protocols, which makes our library more extensible: developers implementing their own KnowledgeComponent don’t necessarily need to inherit from our AbstractKnowledgeComponent; they just need to implement the AbstractKnowledgeComponent APIs. This is the benefit of duck typing. (To be precise, this is explicit duck typing since we define our API explicitly via Protocol).

Second, to serve time-related visualizations, we designed the HistoryAwareKnowledgeComponent, which inherits from the KnowledgeComponent and can store previous updates into a history buffer.

In addition to the changes to truelearn.models, we formally decided to remove truelearn.utils.persistent because:

Most persistence methods, such as pickle.dump, require only one line of code from the user to save the class locally. Therefore, there is no need for truelearn to encapsulate these functions in its subpackage.
We also refer to the sklearn implementation. They also leave the responsibility of saving and loading the model to the user, providing examples of saving using pickle, joblib and skops.

Testing

To improve the usability and reliability of the library, we have added examples to public classes/methods of models, learning and datasets in #27. These examples can be used not only to quickly help users understand how to use the class and methods but also as a test to ensure that our implementation of the class provides consistent and accurate values.

In addition to the doctests embedded in the docstrings, we started implementing unit tests for all the classes and methods in the truelearn package.

In the process of implementing the unit test, we experimented with several advanced features of pytest:

fixture: allows us to share test data, capsys (to capture stdout), monkeypatch (to patch some standard libraries, allowing us to simulate exceptional situations). These fixtures allow us to implement unit tests more concisely.
extension: pytest_socket (allows us to simulate network disconnections for testing); pytest_cov (allows us to write tests targeting uncovered code).

Visualizations

After implementing all the classifiers and merging the upstream updates of datasets, we started exploring how to use the datasets and the existing classifiers to generate visualizations.

At the beginning of the project, we had some simple ideas for visualization:

Line chart: It can be used to show how the MEAN of a particular learner’s KC changes over time. Also, in this line chart, we can use several lines to label the results from different classifiers.
Bar chart: used to visualize multiple KCs. The height represents the mean, and the colour represents the variance.
Pie chart: used to visualize the learner’s knowledge representation, with a pop-up to show the mean, variance, title and description of the knowledge.

Through our later literature review, user studies, and discussions with the client, we have added some ideas:

Bar chart: prefer shade to colour. This is more friendly to colour-blind people as ultimately, these visualizations are used to help learners to understand their learning status.
Rose pie chart: We can use a rose pie chart. We define the distance from the arc to the centre of the circle as mean, the shade as variance, and the angle of each slice is proportional to the number of times the learner has encountered that kc.
Bubble chart: The knowledge is represented by multiple points (KCs). The size of the point (KC) is the mean, and the shade of the point is the variance.
Word cloud: The knowledge of the learner is displayed in the form of a word cloud. The size of the word (KC) is the mean of that KC.

To implement so many different types of visualisations, we define the following classes:

BasePlotter: defines the API that each type of plot needs to implement (the following is the public API)
- plot: plot something to the figure
- show: show the image in a newly opened window
- `to_jpg/png/html: convert the image to the corresponding format
Based on this class, we define different plotters, including but not limited to: LinePlotter, PiePlotter, RosePlotter, BarPlotter, and WordPlotter…

To use these plotters, the user only needs to pass the Knowledge in LearnerModel to the plot function.

CI

In #25, #34, #38 we optimized the workflow of our CI:

Completely replaced requirements.txt and setup.py with pyproject.toml.
Improve integration with GitHub: when ci fails, an annotation is generated. We can easily see from Github which part of the code fails instead having to go through long command output.

In addition to the above improvements, we have introduced two new GitHub Actions in #44 and #45:

Typos: help us check for syntax errors in the codebase.
black: help us enforce consistent code styles in our codebase (e.g., preferring double quotes over single quotes).

Also, to ensure our library support Python 3.7+ and is cross-platform (Mac, Linux, Windows), we let unit tests ci run truelearn on each of these three systems using these five python versions (3.7, 3.8, 3.9, 3.10, 3.11).

From a future editor: We found a test failure on windows due to floating point comparison with the help of this ci.

This change allows us to quickly find and fix code that is not cross-platform and compatible with Python 3.7+.

learning

In #41, we added constraint-based checking to truelearn. This method is inspired by scikit-learn, which defines various constraints and uses validate_params to ensure that all parameters of the __init__ method satisfies the specified constraints.

Before #41, we only used simple type-based checks for parameters, which led us to leave some complex checks to classifier implementers, such as checking that the value of draw_proba_type must be either static or dynamic, which leaves the various checks scattered across different classifiers, making the code more difficult to manage.

Also, type-based checks do not fit well with set_params. When we audited the code, we found that in many classifiers, we had incorrect assumptions about the values of the variables. We assumed that the values of the classifier’s variables would not be changed to an illegal value after initialization. However, this conflicts with the functionality provided by our set_params: set_params is the API we provide for modifying classifier parameters. It only does type-based checking (which makes illegal value possible) to enforce the “correctness” of the classifiers. To fix this, we need to do a value-based check whenever the variable is used, which is inefficient. If there were more powerful tools for describing constraints, we could validate the parameters inset_params since the user should only modify the internal state of the classifier through set_params.

After #41, developers can use three ways to define constraints:

type-based: the type must be one of the given types.
value-based: the value must be one of the given values.
function-based: the developer provides a function that performs more complex, multiple-variable constraint checks.

By using the above architecture, we can solve the two main problems described above and make our library easier to extend.

Week 19-20

13 March 2023

Refactor

Before officially delivering the product, we did one last major refactoring. This refactoring is based on the suggestions from the client’s code review.

In #46, #48, we mainly did the following:

Modify how constraint throws the error. We use a more idiomatic way to raise errors.
Modify the INKClassifier API to make it easier for users to initialize it.
Convert some classes’ static method to free method and remove some duplicate code.
Update the default values of some classes so that models trained with default parameters can make more accurate predictions.

Examples

For users to understand how to use our sub-packages together, we provide many examples, including but not limited to:

wikify: How to use wikifier to convert text into Wikipedia topics.
truelearn_experiments: a simple version that mimics the experiments in the TrueLearn paper.

Documentation

We finally merged #31 into the main branch. In this change, we completely switched to readthedocs and made a lot of improvements. Some highlights are:

We have an examples section in the documentation, automatically generated based on the examples in the repository.
We have a developer’s guide that describes truelearn’s design philosophy and how to develop and contribute to truelearn projects.
We have a complete and working API reference that developers can use to find the APIs they want.
Developers can jump from the API reference to the corresponding code in the GitHub repository.
Users can copy code snippets in the documentation with one click.

You can view our documentation at here.

Visualisations

We are also pleased to present some of our visualizations:

Rocket 🚀

We’re ready to release TrueLearn 1.0.0!

Thank you for reading our blog!