    This is a time-limited position for an Astrophysicist in the NASA Astrophysics Data System (ADS) Project, High Energy Astrophysics Division of the Smithsonian Astrophysical Observatory. The ADS serves a corpus of more than 15M scientific articles with more than 130M citations to a worldwide community of astronomers, researchers and librarians with hundreds of thousands of requests per day.

    The primary responsibility of the employee is to act as a technical advisor and provide related support perform statistical analysis and prepare datasets of enriched / tagged full text articles to be used as training and validating material for machine / deep learning models created by ADS developers. This work will enable the ADS to create new services that will impact the astronomy and astrophysics community. The enriched datasets will be publicly shared to engage a broader audience in creating better models, fueling research in other domains such as computer science or bibliometrics.

    Duties may include, but are not limited to:

    • Function as a part of a team to develop and execute statistical analysis with the ADS corpus and external data such as the Unified Astronomy Thesaurus and/or the AAS Facilities keywords
    • Manually label, tag, and classify documents, paragraphs and words, as well as enrich the ADS data holdings following certain directives
    • Prepare training and validating datasets for machine / deep learning models to perform tasks such as paper classification, sentiment analysis, and/or named entity recognition
    • Document all the steps involved in creating datasets and participate in the organization of public machine / deep learning challenges
    • Perform other duties as may be required

    Occasional travel - Occasional travel to scientific conferences, to observing facilities and/or technical collaboration meetings may be required.

