Overview
Entities are the fundamental building blocks of ResearchArcade's knowledge graph. Each entity type represents a distinct concept in academic research, from papers and authors to reviews and revisions. This page provides a comprehensive reference for all entity types, their attributes, and data types.
Note:
Entities are stored across both ArXiv and OpenReview datasets. See
Schema Details for information on schema organization.
arxiv.papers
Academic papers from ArXiv with comprehensive metadata including version tracking and submission dates.
Attributes
| Attribute |
Type |
Required |
Description |
id |
integer |
Yes |
Auto-generated unique identifier (primary key) |
arxiv_id |
string |
Yes |
Full ArXiv identifier with version (e.g., 1706.03762v7) |
base_arxiv_id |
string |
Yes |
Base ArXiv identifier without version (e.g., 1706.03762) |
version |
integer |
Yes |
Version number of the paper |
title |
string |
Yes |
Paper title |
abstract |
text |
No |
Paper abstract |
submit_date |
date |
No |
Publication or submission date |
metadata |
json |
No |
Additional metadata in JSON format |
arxiv.authors
Authors of ArXiv papers with Semantic Scholar integration for persistent identification.
Attributes
| Attribute |
Type |
Required |
Description |
id |
integer |
Yes |
Auto-generated unique identifier (primary key) |
semantic_scholar_id |
string |
Yes |
Semantic Scholar author identifier (unique key) |
name |
string |
Yes |
Author full name |
homepage |
string |
No |
Author's homepage URL from Semantic Scholar |
arxiv.categories
ArXiv subject categories (e.g., cs.AI, cs.LG, math.CO) for classifying papers.
Attributes
| Attribute |
Type |
Required |
Description |
id |
integer |
Yes |
Auto-generated unique identifier (primary key) |
name |
string |
Yes |
Category name (e.g., cs.AI, cs.LG) |
description |
text |
No |
Category description |
arxiv.sections
Hierarchical sections within ArXiv papers (e.g., Introduction, Methods, Results) with full text content.
Attributes
| Attribute |
Type |
Required |
Description |
id |
integer |
Yes |
Auto-generated unique identifier (primary key) |
content |
text |
Yes |
Full text content of the section |
title |
string |
Yes |
Section heading/title |
appendix |
boolean |
Yes |
Whether this section is part of the appendix |
paper_arxiv_id |
string |
Yes |
Reference to parent paper (foreign key) |
section_in_paper_id |
integer |
No |
Sequential order within the paper |
arxiv.paragraphs
Individual paragraphs within ArXiv paper sections, enabling fine-grained text analysis and citation tracking.
Attributes
| Attribute |
Type |
Required |
Description |
id |
integer |
Yes |
Auto-generated unique identifier (primary key) |
section_id |
integer |
Yes |
Reference to parent section (foreign key) |
content |
text |
Yes |
Paragraph text content |
paragraph_in_section_id |
integer |
No |
Sequential order within the section |
Images, diagrams, and visualizations within ArXiv papers with captions and labels.
Attributes
| Attribute |
Type |
Required |
Description |
id |
integer |
Yes |
Auto-generated unique identifier (primary key) |
paper_arxiv_id |
string |
Yes |
Reference to parent paper (foreign key) |
path |
string |
Yes |
File path to the figure image |
caption |
text |
No |
Figure caption text |
label |
string |
No |
LaTeX label for cross-referencing (e.g., fig:architecture) |
name |
string |
No |
Figure identifier or name |
arxiv.tables
Tabular data and structured information within ArXiv papers.
Attributes
| Attribute |
Type |
Required |
Description |
id |
integer |
Yes |
Auto-generated unique identifier (primary key) |
paper_arxiv_id |
string |
Yes |
Reference to parent paper (foreign key) |
path |
string |
No |
File path to table source |
caption |
text |
No |
Table caption text |
label |
string |
No |
LaTeX label for cross-referencing (e.g., tab:results) |
table_text |
text |
No |
Table content in text format |
openreview.papers
Academic papers submitted to OpenReview venues with review decisions and PDF links.
Attributes
| Attribute |
Type |
Required |
Description |
venue |
string |
Yes |
Conference or venue identifier (e.g., ICLR.cc/2024) |
paper_openreview_id |
string |
Yes |
Unique OpenReview paper identifier (composite key with venue) |
title |
string |
Yes |
Paper title |
abstract |
text |
Yes |
Paper abstract |
paper_decision |
string |
Yes |
Editorial decision (Accept, Reject, etc.) |
paper_pdf_link |
string |
Yes |
URL to paper PDF |
openreview.authors
Authors of OpenReview papers with affiliation and contact information.
Attributes
| Attribute |
Type |
Required |
Description |
venue |
string |
Yes |
Conference or venue identifier |
author_openreview_id |
string |
Yes |
Unique OpenReview author identifier (composite key with venue) |
author_full_name |
string |
Yes |
Author full name |
email |
string |
Yes |
Contact email address |
affiliation |
string |
Yes |
Institutional affiliation |
homepage |
string |
Yes |
Personal homepage URL |
dblp |
string |
Yes |
DBLP profile identifier |
openreview.paragraphs
Individual paragraphs extracted from OpenReview papers, organized by section for detailed content analysis.
Attributes
| Attribute |
Type |
Required |
Description |
venue |
string |
Yes |
Conference or venue identifier |
paper_openreview_id |
string |
Yes |
Reference to parent paper (foreign key) |
paragraph_idx |
integer |
Yes |
Sequential index of paragraph (composite key with venue and paper_id) |
section |
string |
Yes |
Section name where paragraph appears |
content |
text |
Yes |
Paragraph text content |
openreview.reviews
Peer reviews and comments from OpenReview with structured content including ratings, strengths, and weaknesses.
Attributes
| Attribute |
Type |
Required |
Description |
venue |
string |
Yes |
Conference or venue identifier |
review_openreview_id |
string |
Yes |
Unique review identifier (composite key with venue) |
replyto_openreview_id |
string |
Yes |
ID of the paper or comment being replied to (foreign key) |
writer |
string |
Yes |
Reviewer identifier (may be anonymous) |
title |
string |
Yes |
Review title |
content |
json |
Yes |
Structured review content (ratings, summary, strengths, weaknesses, questions) |
time |
string |
Yes |
Timestamp when review was submitted |
openreview.revisions
Different versions of OpenReview papers as they are updated and revised during the review process.
Attributes
| Attribute |
Type |
Required |
Description |
venue |
string |
Yes |
Conference or venue identifier |
original_openreview_id |
string |
Yes |
Reference to original paper (foreign key) |
revision_openreview_id |
string |
Yes |
Unique revision identifier (composite key with venue) |
content |
json |
Yes |
Revision content and metadata as structured data |
time |
string |
Yes |
Timestamp when revision was made |
openreview.arxiv
Cross-platform links connecting OpenReview papers to their corresponding ArXiv versions.
Attributes
| Attribute |
Type |
Required |
Description |
venue |
string |
Yes |
OpenReview venue identifier |
paper_openreview_id |
string |
Yes |
OpenReview paper identifier (composite key with venue) |
arxiv_id |
string |
Yes |
Corresponding ArXiv identifier |
title |
string |
Yes |
Paper title for reference |