Overview
Entities are the fundamental building blocks of ResearchArcade's knowledge graph. Each entity type represents a distinct concept in academic research, from papers and authors to reviews and decisions. This page provides a comprehensive reference for all entity types, their attributes, and data types.
Note:
Entities are stored across both ArXiv and OpenReview datasets. See
Schema Details for information on schema organization.
Paper
The core entity representing an academic paper from ArXiv or OpenReview. Contains metadata, abstract, and links to all paper components.
Attributes
| Attribute |
Type |
Required |
Description |
paper_id |
string |
Yes |
Unique identifier (primary key) |
title |
string |
Yes |
Paper title |
abstract |
text |
Yes |
Paper abstract |
publish_date |
date |
No |
Publication or submission date |
source |
string |
Yes |
Data source (arxiv or openreview) |
arxiv_id |
string |
No |
ArXiv identifier (e.g., 2301.12345) |
doi |
string |
No |
Digital Object Identifier |
categories |
array |
No |
ArXiv categories (e.g., cs.AI, cs.LG) |
Author
Represents an author of academic papers. May include affiliation and contact information.
Attributes
| Attribute |
Type |
Required |
Description |
author_id |
string |
Yes |
Unique identifier (primary key) |
name |
string |
Yes |
Author full name |
email |
string |
No |
Contact email address |
affiliation |
string |
No |
Institutional affiliation |
orcid |
string |
No |
ORCID identifier |
Section
Hierarchical sections within papers (e.g., Introduction, Methods, Results). Supports nested structure.
Attributes
| Attribute |
Type |
Required |
Description |
section_id |
string |
Yes |
Unique identifier (primary key) |
paper_id |
string |
Yes |
Reference to parent paper |
title |
string |
Yes |
Section heading |
section_type |
string |
No |
Type (e.g., introduction, methods, conclusion) |
depth |
integer |
No |
Nesting level in hierarchy |
position |
integer |
No |
Order within parent |
Paragraph
Text content at the paragraph level within sections. Enables fine-grained text analysis and citation tracking.
Attributes
| Attribute |
Type |
Required |
Description |
paragraph_id |
string |
Yes |
Unique identifier (primary key) |
section_id |
string |
Yes |
Reference to parent section |
text |
text |
Yes |
Paragraph text content |
position |
integer |
No |
Order within section |
word_count |
integer |
No |
Number of words |
Images, diagrams, and visualizations within papers.
Attributes
| Attribute |
Type |
Required |
Description |
figure_id |
string |
Yes |
Unique identifier (primary key) |
paper_id |
string |
Yes |
Reference to parent paper |
caption |
text |
No |
Figure caption |
file_path |
string |
No |
Path to image file |
position |
integer |
No |
Order within paper |
Table
Tabular data and structured information within papers.
Attributes
| Attribute |
Type |
Required |
Description |
table_id |
string |
Yes |
Unique identifier (primary key) |
paper_id |
string |
Yes |
Reference to parent paper |
caption |
text |
No |
Table caption |
content |
json |
No |
Structured table data |
position |
integer |
No |
Order within paper |
Review
Peer reviews from OpenReview, including ratings, confidence scores, and review text.
Attributes
| Attribute |
Type |
Required |
Description |
review_id |
string |
Yes |
Unique identifier (primary key) |
paper_id |
string |
Yes |
Reference to reviewed paper |
reviewer_id |
string |
No |
Reference to reviewer (may be anonymous) |
rating |
integer |
No |
Numerical rating |
confidence |
integer |
No |
Reviewer confidence level |
review_text |
text |
No |
Review content |
review_date |
date |
No |
When review was submitted |
Decision
Editorial decisions on paper submissions (accept, reject, etc.).
Attributes
| Attribute |
Type |
Required |
Description |
decision_id |
string |
Yes |
Unique identifier (primary key) |
paper_id |
string |
Yes |
Reference to paper |
decision_type |
string |
Yes |
Type (accept, reject, etc.) |
decision_date |
date |
No |
When decision was made |
comments |
text |
No |
Additional comments |
Rebuttal
Author responses to reviews during the peer review process.
Attributes
| Attribute |
Type |
Required |
Description |
rebuttal_id |
string |
Yes |
Unique identifier (primary key) |
review_id |
string |
Yes |
Reference to review being addressed |
content |
text |
Yes |
Rebuttal text |
rebuttal_date |
date |
No |
When rebuttal was submitted |
Revision
Different versions of papers as they are updated and revised.
Attributes
| Attribute |
Type |
Required |
Description |
revision_id |
string |
Yes |
Unique identifier (primary key) |
paper_id |
string |
Yes |
Reference to paper |
version |
integer |
Yes |
Version number |
revision_date |
date |
No |
When revision was made |
changes |
text |
No |
Description of changes |
Venue
Publication venues such as conferences and journals.
Attributes
| Attribute |
Type |
Required |
Description |
venue_id |
string |
Yes |
Unique identifier (primary key) |
name |
string |
Yes |
Full venue name |
abbreviation |
string |
No |
Short name (e.g., ICLR, NeurIPS) |
venue_type |
string |
No |
Type (conference, journal, workshop) |
year |
integer |
No |
Year of venue instance |