Expert Identification

Expert Identification workflow consists of two steps. In the first step, the user extracts concepts from an abstract. The user can review and modify the list of extracted concepts and then feed it into the actual expert identification workflow.

Todo

Check out the API LAB notebook Expert Identification with the Dimensions API - An Introduction , for more detailed examples of how to use this API feature.

Step 1: Concept Extraction

Extracting concepts is implemented using the extract_concepts function:

    extract_concepts("document abstract")
    ----------------------------------------
    extract_concepts(<phrase>)

This query will return a list of extracted concepts, ordered by weight, in descending order.

Step 2: Expert Identification

Extracted concepts, from step one, can be used in identify experts queries, for example:

    identify experts from concepts "+malaria OR \"effective malaria vaccine\" OR \"effective prevention\""
        using publications
        where research_org_countries is not empty
            and year >= 2013
        return experts[basics]
        limit 20 skip 0
    --------------------------------------------------------------------------------------------------------------------------------------------
    identify experts from concepts [<list of extracted concepts>] using <source> <where-filter>? return experts<[fields]> <paging>?

Experts identification can be executed on publications or grants sources. using publications may be omitted and publications source is used by default

It is possible to specify a where clause, but it is not required. Fields available for filtering are exactly the same as they are using standard search expression.

Similarly, Paginating Results is optional. It can be used to retrieve more reviewer candidates than twenty by default.

Returned experts are ordered by their relevance.

Annotating results

Annotating results with organizational and/or coauthorship conflicts will produce another object for each identified expert, with keys organizational and coauthorship where these will contain a boolean value if an organizational conflict with provided researchers is found, or a number if a coauthorship conflict is found.

Organizational conflict is defined as a boolean value that is true if the expert and the researchers from the query have the same current research organization.

Coauthorship conflict is defined as the number a of documents the expert has coauthored with any of the researchers provided in the query, in the last three years.

    identify experts from concepts "+malaria OR \"effective malaria vaccine\" OR \"effective prevention\""
        using publications
        where research_org_countries is not empty
            and year >= 2013
        return experts[basics]
        limit 20 skip 0
        annotate organizational, coauthorship overlap
            with ["ur.016204724721.35", "ur.012127355561.32"]
    --------------------------------------------------------------------------------------------------------------------------------------------
    identify experts from concepts [<list of extracted concepts>] using <source> <where-filter>? return experts<[fields]> <paging>? <annotate?>

Restricting the set of experts returned

It is possible to restrict the set of experts returned to a subset of researchers of interest, simply by specifying a where clause and a list of researcher IDs. For example:

  identify experts from concepts "+malaria OR \"effective malaria vaccine\" OR \"effective prevention\""
      using publications
          where researchers in ["ur.01225135650.70", "ur.07447331037.97"]
          and year >= 2013
      return experts[basics]
          limit 20 skip 0

Note

The where researchers restriction determines which publications to use for the expert identification. As a result, the number of experts returned may be greater than the number of researchers provided, as all the authors of those publications will be considered.