Train | Dev | Test | |
---|---|---|---|
Arabic Entries | 45200 | 6400 | 6410 |
Arabic English mapped dictionary | 2843 | 299 | 1213 |
{
"id":"ar.45",
"word":"عين",
"gloss":"عضو الإبصار في ...",
"pos":"n",
"electra":[0.4, 0.3, …],
"sgns":[0.2, 0.5, …],
"enId": "en.150"
}
The value associated to "id" key tracks the language and unique identifier for this example.
The value associated to the "gloss" key is a definition, as you would find in a classical
dictionary. It is to be used as the source in the RD task.
The value associated to "enId" key tracks the mapped identifier in the English dictionary.
All other keys ("sgns", "electra") correspond to embeddings, and the associated values are
arrays of floats representing the components. They all can serve as targets for the RD task.
{
"id":"ar.45",
"arword":"عين",
"argloss":"عضو الإبصار في ...",
"arpos":"n",
"electra":[0.4, 0.3, …],
"sgns":[0.2, 0.5, …],
"enId":"en.150",
"word":"eye",
"gloss":"One of the two ...",
"pos":"n",
}
The value associated to "id" key tracks the Arabic unique identifier in the Arabic
dictionary.
The value associated to the "argloss" and "gloss" keys is the Arabic and English definitions,
as you would find in an Arabic and English dictionary, respectively. The "gloss" is to be
used as the source in the CLRD task.
The value associated to "enId" key tracks the mapped identifier in the English dictionary.
All other keys ("sgns", "electra") correspond to embeddings, and the associated values are
arrays of floats representing the components. They all can serve as targets for the CLRD
task.
{
"id":"en.150",
"word":"eye",
"gloss":"One of the two ...",
"pos":"n",
"electra":[0.7, 0.1, …],
"sgns":[0.2, 0.8, …]
}
The English dictionary has the same value as the Arabic dictionary and can be utilized in the
second task
This shared task includes 2 tracks:
Dev | Test | ||||||
---|---|---|---|---|---|---|---|
No. epochs | Cosine similarity | mean squared error (MSE) | Rank | Cosine similarity | mean squared error (MSE) | Rank | |
Task1 (RD) (Sgns) | 200 | 35.61 | 5.03 | 38.52 | 40.58 | 4.49 | 36.28 |
Task1 (RD) (Electra) | 200 | 48.84 | 24.94 | 31.27 | 50.79 | 23.04 | 31.87 |
Task2 (CLRD) (Sgns) | 300 | 26.22 | 4.92 | 50.16 | 25.21 | 4.85 | 49.95 |
Task2 (CLRD) (Electra) | 300 | 54.09 | 22.10 | 36.22 | 51.66 | 23.81 | 40.72 |