Help pages for GEMDE: Grid Enabled ethnic Minority Data Environment
Quick links:
- Entering the system as a 'guest'
- Entering the system as a registered user
- Using the browse facility
- Using the search facility
- Our 'most useful things' when working with ethnicity
GEMDE is a resource for supporting quantitative data broadly related to the analysis of ethnicity for social science research (spanning topics such as ethnic itentity, nationality and national origins, religion, immigration, language). It boils down to a means of providing access to organised, easy to use statistical resources on the topic. We refer to these resources as either 'MUGs' or 'MIRs' (see our FAQs).
We are trying to make GEMDE easy to use, but there are lots of features to the service which are likely to be new to most people. This page gives detailed instructions including screenshots illustrating features of the GEMDE online 'portal' environment where the data resources are made available: the GEMDE portal (needs a login).
The portal iself does have some help notices and instructions for users within its pages, but you will probably benefit from using these webpages in combination to make sense of the portal. We are also preparing pdf instruction guides (to appear under our GEMDE references list).
The portal should work smoothly for most mainstream internet browsers, but we have only tested it extensively for Internet Explorer (v8 or above) and Firefox. Please let us know if you experience difficulties with the portal and we will address them if we can.




GEMDE PORTAL SERVICES
If you haven't already read them, it is probably helpful to first read our notes on the background to GEMDE and our higher-level description of using GEMDE
.This note provides more extended descriptions of the following GEMDE services:
- Entering GEMDE: Guest login instructions
- Entering GEMDE: Login for registered users
- Using the browse facility
- Using the search facility
- Submitting data (1): Initial submission of data on a MUG or MIR
- Submitting data (2): Further editing and metadata provision
- Entering and using user-ratings
- Using the 'live data analysis' facility
We're still developing GEMDE. Please contact us with feedback/comments on the service and its usability.

Guest login instructions
You can login to the GEMDE portal as a guest. This will allow you to search and browse across, and subsequently download data from, the portal resources. As a guest, however, you won't be able to access the functionality which covers uploading files to GEMDE or making other contributions to the service. For many users, 'guest' level access will be all that you need from the system.
To login as a guest, you do still need to go through an authentication process as if you were a named user. This can seem a little cumbersome, but at present it is the only mechanism available to us. So, to login as a guest:
- 1. Go to the GEOMDE portal login site: https://dames.nesc.gla.ac.uk/web/guest/gemde
- 2. The link should take you automatically to the Shibboleth authentication page for the 'National e-Science centre'
- No further action is normally needed, however in some circumstances your browser could remember previous authentications via Shibboleth; it is sometimes necessary to close down all your browsers then reopen them to get a 'fresh slate' here.
- 3. Enter the following (guest level) username and password:
- User: dames
- Pass: dames
- 4. Done! After a temporary screen providing update details on the process, you ought to be logged into the portal with guest level access rights.

Login for registered users
If you login to the GEMDE portal as a named user, there are several functions available to you which are not available at the 'guest' level. These are to upload new records and data resources; to edit and revise existing resources; and to provide comments and user feedback on resources through the 'expert ratings' option.
To login as a named user, you first have to contact the GEMDE project group to request a personal account. Then, you need to go through a Shibboleth authentication process . At the time of writing (June 2010) we have a transitory arrangement in place whereby you need to manually request a personal account by email, then register as part of the 'National e-Science Centre' (i.e. rather than your own institution). In the longer term we do plan to use institutional login accounts, to make the process simpler. So, for the time being, to login as a named user:
- 1. Request a personalised account for GEMDE by email to John Watt j.watt@nesc.gla.ac.uk
- 2. Go to the GEOMDE portal login site: https://dames.nesc.gla.ac.uk/web/guest/gemde
- 3. The link should take you automatically to the Shibboleth authentication page for the 'National e-Science centre'
- No further action is normally needed, however in some circumstances your browser could remember previous authentications via Shibboleth; it is sometimes necessary to close down all your browsers then reopen them to get a 'fresh slate' here.
- 4. Enter the appropriate personalised username and password:
- User: yourusername
- Pass: yourpassword
- In the long term, we will use your standard institutional identity for this login, but in the short term you require a personalised account entered via the National e-Science Centre
- 5. Done! After a temporary screen providing update details on the process, you ought to be logged into the portal with registered user level access rights.
Here are some graphics illustrating the steps involved:
![]() |
|
| (a) Click on the link to the portal | (b) Entering details for guest level access |
![]() |
|
| (c) Entering named account details | (d) Image of the portal when logged in as a registered user |

Using the browse facility
'Browsing' through GEMDE refers to searching through resources registered at GEMDE in a structured way. The browse interface works in a fairly conventional way on the GEMDE system. When you go to the browse page, you will be presented with lists of categories and subcategories, which you can click accordingly to take you to further information on relevant resources.
There are a couple of important points to note however. These can often cause confusion the first time you use the Browse system:
- The browse trees for MUGs (ethnic Minority Unit Groups) and MIRs (ethnic Minority Information Resources) are separate. You are either browsing one or the other set of resources, but not both
- Remember, a MUG (Minority Unit Group) is a systematic listing of different categories from a measure or definition related to ethnicity
- A MIR (ethnic Minority Information Resource) is a data file containing some relevant information about the categories of a MUG (e.g. a table of statistics about categories.
- You need to look across MUGs and MIRs separately; sometimes the resource you're looking for might be available as a MIR but not as a MUG, or vice-versa.
- The default placement within the Browse service is at the last known results (rather than the front page). Sometimes you may forget which search you are undertaking (of MUGs or MIRs) for this reason - remember to look at the surrounding descriptive text to understand the results. The radio buttons above and below the Browse form can also be used to to move around the tree
- Not all resources at GEMDE have been sufficiently classified in a way which allows the browse service to detect them . Resources that can be found on Browse are those for which the depositor of the information explicitly added in a categorisation. It is possible that some other resources on GEMDE apply to the relevant definition (e.g. country or ethnic referent), even though the browse functionality doesn't identify them. If you don't find the relevant resource in the Browse system, it is often worth trying a free text search via the 'search' porlet in addition.

Using the search facility
You can search for resources in GEMDE using free-text entries into the 'search' tab. By and large, the search engine works in the standard ways (described below). As with browsing, however, it is important to understand that two different types of resources can be searched (i.e. MIRs - ethnic Minority Information Resources; and MUGs - ethnic Minority Unit Groups).
- You can either search across MIRs or across MUGs, but not both at the same time
- To search across minority information resources (MIRs), click on 'Find Resources'
- To search across minority unit groups(MUGs), click on 'Find MUGs'
- Search results cover the metadata linked to a resource (i.e. all the information provided about the resource, such as its title, abstract, etc).
- ..but search results do not cover the actual content of files uploaded as MUGs and MIRs
- Search results can be sorted in order of user and expert ratings concerning the quality of the resource (see below under user ratings)
- Features of the search engine behaviour:
- The default search is an exact match: results are returned if the exact phrase features somewhere in the metadata for the relevant information resource (the MIR or the MUG). This result is regardless of surrounding text.
E.g. searching for 'land' returns results featuring England, Scotland, etc. - Search entries are not case sensitive.
E.g., searching for 'England' and for 'england' returns the same results. - Searching on multiple words generates hits with records featuring all of the words at least once anywhere in the record's metadata
- Searching on multiple words enclosed in double quotes generates exact matches to the enclosed phrase
- Wildcard expressions can be included after a word (but not at the start of a word) by using the * symbol.
E.g., entering 'Pakist*' would generate any matches on phrases including pakistani, Pakistan, etc.


Initial submission of data on a MUG or MIR
If you're a registered user, you can enter data to the GEMDE system. See access instructions to arrange user registration if required.
Please do consider adding your own data into the GEMDE system. The GEMDE portal is intended to be a collaborative, research oriented resource to support the sharing and distribution of data related to the analysis of ethnicity in the social science research community. If you have generated data - e.g. a definition of ethnic categories; a recode of an ethnicity measure; or some relevant aggregate statistics about ethnic groups - then GEMDE would be a good place to deposit it and distribute it for the wider benefit of the research community.
Don't be hesitant!The forms for initially submitting data to GEMDE are intended to be short and fairly self-explanatory, but in any case you can't do much too damage if you initially enter something wrong:
- As you fill out the form the entry is saved into the system
- Anything you've previously entered, you can return to later and edit/update/delete, via the 'edit data resource' option
- There is also an 'expert group' of editors on GEMDE who periodically review submitted resources, and will make minor edits/adjustments/clarifications to resources submitted if appropriate
- The expert group are the current investigators of the DAMES project (see personnel). The edits we might make would usually be very minor (e.g. correcting spelling errors; deleting accidentally submitted duplicate copies). If we do something more substantial, we let the depositor know
The images below try to illustrate the steps involved in depositing data at GEMDE
| Follow the 'deposit' link to submit a data resource on the GEMDE portal | |
| As an illustrative example, these images show us uploading data transcribed from a research paper by Khattab (2009). This data features both a MUG and a MIR. The MUG us a list of ethno-religious groups, supplied as a MS Word file. The MIR is a table of statistics about those grous, supplied as an MS Excel file | ![]() |
| To upload the MUG fill out the form... | |
| ...taking care to attach the data file... | |
| ...before clicking on 'upload MUG'. | |
| Likewise to upload the MIR fill out the form and attach the file, then click upload... | |
| Note that as soon as the resources are uploaded, they are visible to search terms submitted to other users... | |
| ..but we strongly encourage you to edit/update the submissions, for instance in order to nominate the categories which will be picked up under 'browse' searches | |


Further editing and metadata provision
It is very valuable to us if after completing an initial submission of a resource, you go back to it under the 'edit data resource' tab and add further information about it.
- It is particularly useful if you can indicate what country the resource is from, and if relevant what time period and ethnic referents it related to (because these are used by the 'browse' function).
- When editing data there are also opportunities to add further details or clarification to a resource, or correct small errors
- You can use the 'manage files' option to upload additional files, or delete (and overwrite) a previously uploaded file. In general, for minor ammendments (e.g. correcting spelling errors in uploaded files) it is suitable to delete and replace the relevant data; for major changes it is preferable to upload the new data as a different resource (e.g. a second edition). In any case, the system will keep a time stamp on your resource
- Note also that members of the GEMDE project also periodically review submitted resources, and will sometimes make ammendments to records if it is approriate.


Entering and using user-ratings
Tools for discerning the quality of data resources are obviously valuable in a resource provision such as GEMDE. We are deliberately 'pluralistic' in accepting most resources into the system without substantial quality checks. We do however have 'expert' and 'user' ratings systems available to help navigate according to opinions on quality.
- Whenever you find a resource on GEMDE, we strongly encourage you to fill out a user-rating (and/or submit a comment) about the quality of the resource. The links for doing this are visible on the same pages as the results are returned from a 'search' or 'browse' operation.
- When you locate resources in the system, it should be possible to see associated average user ratings and expert ratings for the resources, and to retrieve any comments that were submitted about them.
- You also have the option of sorting results according to average user ratings, which can be helpful if you have many results available to you.


Using the 'live data analysis' facility
The GEMDE portal includes a facility for running queries on selected micro-data files with relevant information on ethnicity. The files have been stored securely on the GEMDE server and the facility doesn't allow you to access the data or to run custom analyses on them. However, it is possible to submit information requests about relevant distributional features of the sample.
The examples apply to the UK since the 1970's onwards, and analyses focus upon important correlates of ethnic groups - age, gender and socio-economic measures such as education. We put this resource up because we consider it is very important for researchers to have a basic idea of correlations between ethnicity and these factors, and how they change by time and the age of a sample.
As a guest or registered user, you can follow the link to 'microdata' and identify the query you would like to run, according to your specification from a choice of a few parameters about the data you're interested. The output from the query will be a graph depicting the data, and it is also possible for you to access he log file from the analysis, including the statistics, which was used to generate the graph (in the R language). The current queries avaialable (we hope to add more) cover:
- Distribution of UK population by ethnic group, according to the data from Li and Heath (2008), for a variable period including 1972-2005, and variable age range.
- Distribution of average age per ethnic group and gender in the UK, according to the data from Li and Heath (2008), for a variable period including 1972-2005.
- Distribution of regression model coefficients (often called 'ethnic penalties') for ethnic groups for a model of socio-economic attainment in the UK, according to the data from Li and Heath (2008), for a variable period including 1972-2005, and variable age range.


WORKING WITH DATA ON ETHNICITY: MOST USEFUL THINGS
A consistent form of feedback we've recieved during the development of GEMDE is that most users would simply like a fairly firm steer on what they should be doing with measures of ethnicity in a particular context. This often isn't what GEMDE offers - the GEMDE portal services concentrate upon allowing researchers to find out about all sorts of different approaches that might be taken with data on the topic, in a rather pluralistic way. After spending some time working in the topic domain, though, there are certain principles or resources we would tend to highlight to people as particular useful to know about or as good practice to follow. So, with apologies for introducing another daft acronym, here our our 'MUTs':
- Simplifying categories: Although most of us do it intuitively, it is rarely ideal to collapse together different ethnic categories just because they are sparsely represented. If you do do this, try to use the same collapse as has been done in another cited reference. Alternatively, consider a device like effect proportional scaling in a dimension of difference as a feasible alternative to grouping many categories together. There are resources on both previous collapsed measures, and effect proportional scaling, deposited as 'MIRs' within GEMDE
- Comparative research: There have already been some efforts to think systematically about comparing measures of ethnicity across time or between countries. In the UK, Bosveld et al (2006) is a good staring point. For cross-national research, a paper by Lambert(2005) is available via on the GEMDE portal.
- Multivariate analysis: Consider age differences! In many countries, there are substantial differences in age profiles between ethnic groups (due to immigration cohort differences) so socio-economic analysis of ethnic differences need to be careful to fully control for age differences in order that they don't conflate age or life-course stage effects with ethnicity effects.
- Statistical modelling: Consider interaction terms! A lot of the time, the substantive analysis of interest is actually an interaction between an ethnicity difference and some other factor (e.g. the hypothesis is that the effects of education operate differently for different ethnic groups, so an interaction term could naturally test for this).
- Statistical modelling: Since most measures of ethnicity have quite skewed distributions, it is valuable to use a 'quasi-variance' style approach to reporting parameter estimates for different ethnic group effects, because of the 'reference category problem' (for explanation see our own QV webpages, which include some illustrative worked examples).

Last updated 18/DEC/2010, by Paul Lambert
















