Let's see how many judgments we have per unit

Let's remove the units that have only one judgment

data = data[~data['_unit_id'].isin(a)] len(data)

Basic aggregation

Quantitative variables

If we are also doing a per-worker analysis, we can compute values from the worker

Categorical variables

Now we can't do the following because the following is a categorical variable:

Let's explore what is this column and decide what to do

The majority vote of an array is simply the mode

How is the variable distributed?

Let's compute the majority voting

Sometimes this returns two values, let's get the first in that case (better way would be random)

Weighted measures

Weighted mean

Weighted majority voting

Now we need, for each unit, to find the category with the highest trust score

Creating a summary table

Free text

Now we analyse the case in which we have free text

We can't use the weighted majority voting here! We need first to assign a score to this values.

Exercise