Entities (Nodes)

Complete reference for all entity types and their attributes in ResearchArcade.

Overview

Entities are the fundamental building blocks of ResearchArcade's knowledge graph. Each entity type represents a distinct concept in academic research, from papers and authors to reviews and decisions. This page provides a comprehensive reference for all entity types, their attributes, and data types.

Note: Entities are stored across both ArXiv and OpenReview datasets. See Schema Details for information on schema organization.

Entity Categories

Content Entities

Papers, sections, paragraphs, figures, and tables

People Entities

Authors and reviewers

Review Entities

Reviews, rebuttals, decisions, and revisions

Paper

The core entity representing an academic paper from ArXiv or OpenReview. Contains metadata, abstract, and links to all paper components.

Attributes

Attribute Type Required Description
paper_id string Yes Unique identifier (primary key)
title string Yes Paper title
abstract text Yes Paper abstract
publish_date date No Publication or submission date
source string Yes Data source (arxiv or openreview)
arxiv_id string No ArXiv identifier (e.g., 2301.12345)
doi string No Digital Object Identifier
categories array No ArXiv categories (e.g., cs.AI, cs.LG)

Author

Represents an author of academic papers. May include affiliation and contact information.

Attributes

Attribute Type Required Description
author_id string Yes Unique identifier (primary key)
name string Yes Author full name
email string No Contact email address
affiliation string No Institutional affiliation
orcid string No ORCID identifier

Section

Hierarchical sections within papers (e.g., Introduction, Methods, Results). Supports nested structure.

Attributes

Attribute Type Required Description
section_id string Yes Unique identifier (primary key)
paper_id string Yes Reference to parent paper
title string Yes Section heading
section_type string No Type (e.g., introduction, methods, conclusion)
depth integer No Nesting level in hierarchy
position integer No Order within parent

Paragraph

Text content at the paragraph level within sections. Enables fine-grained text analysis and citation tracking.

Attributes

Attribute Type Required Description
paragraph_id string Yes Unique identifier (primary key)
section_id string Yes Reference to parent section
text text Yes Paragraph text content
position integer No Order within section
word_count integer No Number of words

Figure

Images, diagrams, and visualizations within papers.

Attributes

Attribute Type Required Description
figure_id string Yes Unique identifier (primary key)
paper_id string Yes Reference to parent paper
caption text No Figure caption
file_path string No Path to image file
position integer No Order within paper

Table

Tabular data and structured information within papers.

Attributes

Attribute Type Required Description
table_id string Yes Unique identifier (primary key)
paper_id string Yes Reference to parent paper
caption text No Table caption
content json No Structured table data
position integer No Order within paper

Review

Peer reviews from OpenReview, including ratings, confidence scores, and review text.

Attributes

Attribute Type Required Description
review_id string Yes Unique identifier (primary key)
paper_id string Yes Reference to reviewed paper
reviewer_id string No Reference to reviewer (may be anonymous)
rating integer No Numerical rating
confidence integer No Reviewer confidence level
review_text text No Review content
review_date date No When review was submitted

Decision

Editorial decisions on paper submissions (accept, reject, etc.).

Attributes

Attribute Type Required Description
decision_id string Yes Unique identifier (primary key)
paper_id string Yes Reference to paper
decision_type string Yes Type (accept, reject, etc.)
decision_date date No When decision was made
comments text No Additional comments

Rebuttal

Author responses to reviews during the peer review process.

Attributes

Attribute Type Required Description
rebuttal_id string Yes Unique identifier (primary key)
review_id string Yes Reference to review being addressed
content text Yes Rebuttal text
rebuttal_date date No When rebuttal was submitted

Revision

Different versions of papers as they are updated and revised.

Attributes

Attribute Type Required Description
revision_id string Yes Unique identifier (primary key)
paper_id string Yes Reference to paper
version integer Yes Version number
revision_date date No When revision was made
changes text No Description of changes

Venue

Publication venues such as conferences and journals.

Attributes

Attribute Type Required Description
venue_id string Yes Unique identifier (primary key)
name string Yes Full venue name
abbreviation string No Short name (e.g., ICLR, NeurIPS)
venue_type string No Type (conference, journal, workshop)
year integer No Year of venue instance