Difference estimators

In standard experiments, discrimination can be measured directly by the difference in callback proportions. When there are more than one source of discrimination, differences in differences may be used. Components models can be interpreted as a generalization of all these difference estimators. They can be used to test restrictions on the discrimination coefficients, and provide an optimal estimator when such restriction holds. Consider a simple gender discrimination example. Let \(p_m\) the callback rate of a man and \(p_f\) the callback probability of women. They can be written:

\[\begin{align*} p_m &= p_0\\ p_f &= p_0 + \delta_g \end{align*}\]

where \(p_0\) is the benchmark probability and \(\delta_g\) the gender discrimination coefficient. It is negative when there is discrimination. By convention, the candidate who does not suffer from discrimination is the benchmark candidate since it measures how the labor market should work. This is why we took the male candidate as the benchmark candidate: he should not suffer from gender discrimination. The probability of the female candidate is \(p_f\) and it is the sum of the benchmark probability (\(p_0\)) and of the gender discrimination coefficient (\(\delta_g\)). Discrimination occurs whenever \(\delta_g<0\Leftrightarrow p_f<p_m\). In this simple case, the discrimination coefficient is obtained by the difference estimator:

\[\begin{align*} p_m - p_f &= \delta_g \end{align*}\]

Now, consider a more complicated case: gender and origin. We assume that the probabilities are now written \(p_{ij}\) where \(i\) is gender and \(j\) is origin. Discrimination can take place in three ways: gender (\(\delta_g\)), origin (\(\delta_o\)) and the combination of two (\(\delta_{go}\)), often called intersectionality. Let \(\ell\) designate the local origin candidate and \(f\) the foreign origin candidate. There are four candidates with probabilities:

\[\begin{align*} p_{m\ell}&=p_0\\ p_{mf}&=p_0+\delta_o\\ p_{f\ell}&=p_0+\delta_g\\ p_{ff}&=p_0+\delta_o+\delta_g+\delta_{go} \end{align*}\]

The local orign male candidate (\(m\ell\)) is the benchmark (callback probability \(p_0\)) because we do not expect this candidate to be discriminated against because of his gender or origin. The male foreign candidate (\(mf\)) could be discriminated against because of his origin (\(\delta_o\)) but not because of his gender. The local origin female candidate (\(f\ell\)) could be discriminated against because of her gender but not because of her origin. And the last candidate (\(ff\)) could be discriminated both because of her gender (\(\delta_g\)) or origin (\(\delta_o\)). The last term, \(\delta_{go}\), measures intersectionality. If \(\delta_{go}=0\) the two discrimination sources add up, it is the additive case. If \(\delta_{go}<0\), the two discrimination sources lead to a stronger discrimination than the sum of gender and origin discriminations, it is the superadditive case. Last, when \(\delta_{go}>0\), having both characteristics lead to a smaller discrimination and we have the subadditive case. The solution to the previous system is:

\[\begin{align*} \delta_o&=p_{mf}-p_{m\ell}\\ \delta_g&=p_{f\ell}-p_{m\ell}\\ \delta_{go}&=p_{ff}-p_{mf}-(p_{f\ell}-p_{m\ell}) \end{align*}\]

The two discrimination coefficients are still measured by differences, but the interaction term requires a difference in differences estimation. More complex situation an arise. The previous model is said to be just identified because there is only one way to get from the callback probabilities to the discrimination coefficient. Sometimes, there are parameters restrictions, like \(\delta_{go}=0\), and the system becomes overidentified. In this situation, there are several ways to retrieve the discrimination coefficients from the callback probabilities. Consider the case \(\delta_{go}=0\). One can estimate, for example, \(\delta_o\) in two ways:

\[\begin{align*} \delta_0&=p_{mf}-p_{m\ell}\\ \delta_0&=p_{ff}-p_{f\ell}=p_0+\delta_o+\delta_g-(p_0+\delta_g) \end{align*}\]

In such a case, one should use an optimal estimator that uses the redundancy of the constraint. Intuitively, it should provide some weighted average of the available definitions.

Components models

Components models are useful when there is more than one source of discrimination. We consider that the discrimination coefficients \(\delta=(\delta_o,\delta_g,\delta_{og})\) are the parameters of interest, since their estimation is the goal of the analysis, and that the callback proportions are the auxiliary parameters since they are the source of the estimation method. The constrained between the parameters of interest and the auxiliary parameters can be written:

\[\begin{align*} \left( \begin{array}{c} p_{mf}-p_{m\ell}\\ p_{f\ell}-p_{m\ell}\\ p_{ff}-p_{m\ell} \end{array}\right) &= \left( \begin{array}{ccc} 1&0&0\\ 0&1&0\\ 1&1&1 \end{array}\right) \left( \begin{array}{ccc} \delta_o\\ \delta_g\\ \delta_{og} \end{array}\right) \Leftrightarrow \pi = A \delta \end{align*}\]

where \(\pi\) is the vector of the auxilliary parameters (here, the callback rates differences). The matrix \(A\) is called the Boolean matrix because it simply indicates the presence or absence of a component in the callback rates differences. If \(A\) is square and invertible, the system is just identified. Here, its determinant equals 1, so it is the case. Now, consider the constraint \(\alpha_{go}=0\), the Boolean matrix is not square anymore:

\[\begin{align*} \left( \begin{array}{c} p_{mf}-p_{m\ell}\\ p_{f\ell}-p_{m\ell}\\ p_{ff}-p_{m\ell} \end{array}\right) &= \left( \begin{array}{ccc} 1&0\\ 0&1\\ 1&1 \end{array}\right) \left( \begin{array}{ccc} \delta_o\\ \delta_g \end{array}\right) \Leftrightarrow \pi = A \delta \end{align*}\]

and we should look at the column rank of the Boolean matrix. The two columns are linearly independent so that the model is overidentified. In the general case, get the following linear relationship:

\[\begin{align*} \pi = A \delta \end{align*}\]

with \(A\) of full column rank. This will lead us to a minimum distance estimator (Asymptotic Least Squares). In practice, we do not observe \(\pi\) but its estimate from the correspondence test, denoted \(\hat{\pi}\), so that, there is an error term, denoted \(\omega\) defined as:

\[\begin{align*} \hat{\pi}=\pi+\omega\Leftrightarrow \hat{\pi}=A\delta +\omega \end{align*}\]

with \(\hat{\Omega}=\hat{\mathbb{V}}(\omega)=\hat{\mathbb{V}}(\hat{\pi})\), computable directly from the callback data. The Ordinary Least Squares estimator \(\hat{b}\) gives the difference estimators when the system is just identified.

\[\begin{align*} \hat{\delta}=(A^\prime A)^{-1}A^\prime \hat{\pi} \end{align*}\]

and the optimal estimator \(b^\ast\) should be used when the system is overidentified:

\[\begin{align*} b^\ast&=(A^\prime\hat{\Omega}^{-1} A)^{-1}A^\prime \hat{\Omega}^{-1} \hat{\pi}. \end{align*}\]

We also have the property that \(\hat{b}=b^\ast\) when the system is just identified.

Application

Standard case

We use the data from an experiment about mobility, for the profession of management controled in the Paris area. The candidates can be women or men, and have either no driving license or both the car and moto licences. The data set is:

library(callback)
m <- mobility1
str(m)
#> 'data.frame':    1200 obs. of  12 variables:
#>  $ offer    : Factor w/ 300 levels "1","2","3","4",..: 1 1 1 1 2 2 2 2 3 3 ...
#>  $ firstn   : Factor w/ 4 levels "Audrey","Christophe",..: 4 3 1 2 3 4 1 2 3 4 ...
#>  $ lastn    : Factor w/ 4 levels "Bonnet","Michel",..: 2 3 1 4 3 2 1 4 3 2 ...
#>  $ licenses : Factor w/ 2 levels "No","Yes": 2 2 1 1 2 2 1 1 2 2 ...
#>  $ cv       : Factor w/ 2 levels "A","B": 1 1 1 1 1 1 1 1 1 1 ...
#>  $ sentorder: int  4 2 1 3 2 4 1 3 2 4 ...
#>  $ ansorder : int  1 5 5 5 5 5 5 5 5 5 ...
#>  $ callback : logi  TRUE FALSE FALSE FALSE FALSE FALSE ...
#>  $ gender   : Factor w/ 2 levels "Man","Woman": 1 2 2 1 2 1 2 1 2 1 ...
#>  $ paris    : Factor w/ 2 levels "No","Yes": 2 2 2 2 1 1 1 1 1 1 ...
#>  $ cont     : Factor w/ 2 levels "LTC","STC": 1 1 1 1 1 1 1 1 1 1 ...
#>  $ date     : Factor w/ 6 levels "December 2008",..: 6 6 6 6 6 6 6 6 6 6 ...

and the raw callback rates are:

c <- callback(data=m,cluster="offer",candid=c("gender","licenses"),callback="callback")
r <- stat_raw(c)
print(r)
#> 
#>  Proportions: raw callback rates 
#>  Confidence intervals: Student at 95 %
#>  
#>           tests callback inf_p_callback p_callback sup_p_callback
#> Man.No      300       28     0.06022661 0.09333333     0.12644006
#> Man.Yes     300       30     0.06585751 0.10000000     0.13414249
#> Woman.No    300       37     0.08591095 0.12333333     0.16075572
#> Woman.Yes   300       21     0.04096215 0.07000000     0.09903785
plot(r)

We see that women with no driving licenses were preferred to the other candidates. In order to investigate this issue, we write the following components model:

\[\begin{align*} p_{m1}&=p_0\\ p_{m0}&=p_0+\delta_\ell\\ p_{f0}&=p_0+\delta_g+\delta_\ell+\delta_{g\ell}\\ p_{f1}&=p_0+\delta_g \end{align*}\]

where \(p_0\) is the benchmark probability, \(\delta_\ell\) the penality for not having any license (if negative), \(\delta_g\) the effect of gender and \(\delta_{g\ell}\) the intersectionality parameter (when being a woman without license).

Consider the model on the probability differences:

\[\begin{align*} \left( \begin{array}{c} p_{m0}-p_{m1}\\ p_{f0}-p_{m1}\\ p_{f1}-p_{m1} \end{array}\right) = \left( \begin{array}{ccc} 1&0&0\\ 1&1&1\\ 0&1&0 \end{array}\right) \left( \begin{array}{ccc} \delta_\ell\\ \delta_g\\ \delta_{og} \end{array}\right) \end{align*}\]

Before to create the components model, we check the reference levels of our factors :

levels(m$gender)
#> [1] "Man"   "Woman"
levels(m$licenses)
#> [1] "No"  "Yes"

so that the reference candidate is the man with no license. This won’t work because our reference candidate is the man with both licenses. We have the make the callback object again after fixing this:

m2 <- m
m2$licenses <- relevel(m2$licenses,ref="Yes")
levels(m2$licenses)
#> [1] "Yes" "No"

Now, we can create the components model, an object with class callback_comp. The first equation is useless since it defines the benchmark candidate, and \(p_0\) should not be indicated since it is present in all the equations. Starting with the second equation:

model <- list( c("licenses"), 
               c("licenses","gender","inter"),
               c("gender"))
  
cpm <- callback_comp(data = m2, cluster = "offer", 
                   candid = c("gender","licenses"), 
                 callback = "callback", 
                    model = model)

and the model is checked with the instructions:

print(cpm)
#> 
#> 
#> Your components model:
#> ----------------------
#>  
#> Man.Yes = reference
#> Man.No = reference + licenses
#> Woman.Yes = reference + licenses + gender + inter
#> Woman.No = reference + gender
#>  
#> 
#> Boolean matrix (A):
#> -------------------
#>  
#>           reference licenses gender inter
#> Man.Yes           1        0      0     0
#> Man.No            1        1      0     0
#> Woman.Yes         1        1      1     1
#> Woman.No          1        0      1     0
#>  
#> Determinant of A'A = 1, the model is just identified.

Since the model is identified, we can proceed to the estimation:

estim <- reg(cpm)
print(estim)
#> 
#>  Components:
#>  
#>    reference     licenses       gender        inter 
#>  0.100000000 -0.006666667  0.023333333 -0.046666667 
#> 
#>  Use summary() for the regression table.

The reference candidate had a 10% callback rate, licenses would have a negative effect and gender a positive effect. But we need to know which effects are significant:

summary(estim)
#> 
#>  Component model estimation:
#>  
#>               Estimate Std. Error    t value     Pr(>|t|)    
#> reference  0.100000000 0.01734945  5.7638722 8.220569e-09 ***
#> licenses  -0.006666667 0.01888375 -0.3530372 7.240606e-01    
#> gender     0.023333333 0.01792984  1.3013691 1.931322e-01    
#> inter     -0.046666667 0.02615166 -1.7844629 7.434848e-02 .  
#> 
#>  Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.10 ' ' 1
#> 
#>  Model just identified.

We see that the licenses do not have a significant effect when they are alone, but they do in interaction with gender (\(\delta_{\ell g}<0\)). Women would be discriminated against when they hold both driving licenses. In order to complete the study, we would like to drop the terms that are not significant. In a components model, this implies to regroup the candidates.

Candidates grouping

Droping \(\delta_\ell\) from the model implies that:

\[\begin{align*} p_{m1}=p_{m0} \end{align*}\]

so that all the males candidates should be put together. The model becomes:

\[\begin{align*} \left( \begin{array}{c} p_{f0}-p_{m}\\ p_{f1}-p_{m} \end{array}\right) &= \left( \begin{array}{cc} 1&1\\ 1&0 \end{array}\right) \left( \begin{array}{c} \delta_g\\ \delta_{og} \end{array}\right) \end{align*}\]

We first redefine our data set with the new factor:

m2 <- m
m2$cand <- as.factor(ifelse(m2$gender == "Man","m",
                            ifelse(m2$licenses == "Yes","f1","f0")))
m2$cand <- relevel(m2$cand,ref = "m") 

levels(m2$cand)
#> [1] "m"  "f0" "f1"

and write the new components model:

model <- list( c("gender"),c("gender","inter"))
  
cpm2 <- callback_comp(data = m2, cluster = "offer", 
                   candid = "cand", 
                 callback = "callback", 
                    model = model)

summary(reg(cpm2))
#> 
#>  Component model estimation:
#>  
#>              Estimate Std. Error   t value     Pr(>|t|)    
#> reference  0.09666667 0.01424294  6.786987 1.144995e-11 ***
#> gender     0.02666667 0.01447200  1.842638 6.538184e-02 .  
#> inter     -0.05333333 0.01802595 -2.958697 3.089426e-03 ** 
#> 
#>  Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.10 ' ' 1
#> 
#>  Model just identified.

All the components are significant at the 10% level. While women experience an advantage over men in this job (+2.7%), they fully lose it when they hold both the motocycle and car driving licenses (-5.3%).

Callback: Components Models

Emmanuel Duguet

2025-03-03

Difference estimators

Components models

Application

Standard case

Candidates grouping

References