ScienceStudio is a research-focused IDE that bridges the gap between traditional word processing and modern AI-assisted writing. Built on VS Code's robust infrastructure, it provides researchers with semantic document editing, intelligent PDF analysis, and context-aware AI assistance while maintaining compatibility with Microsoft Word and LaTeX workflows.
- PhD Students in psychology, biology, medicine, and social sciences
- Academic Researchers writing papers, grants, and thesis documents
- Programmer-Researchers who work with both code and academic documents
- Postdocs managing multiple publications
- Industry Researchers producing technical reports
- Professors responding to reviewers and managing collaborative papers
- Rich Text Editing: ProseMirror-based editor that understands document structure
- Document Elements:
- Sections with hierarchical structure
- Citations as live objects linked to source PDFs
- Figures and tables with captions
- Track changes and comments preservation
- Style preservation (APA, MLA, Chicago, etc.)
- Semantic PDF Parsing: Extract structured information from research papers
- Automatic section detection (Abstract, Methods, Results, Discussion)
- Figure and table extraction with captions
- Citation graph building
- Claim and evidence tagging
- PDF Library Management: Organize and search through 50-100+ papers
- Context Window: Index entire literature library for AI assistance
- Location-Aware Editing: AI understands document structure and can:
- Strengthen arguments in specific sections
- Find evidence from PDF library for claims
- Suggest citations from loaded papers
- Verify citation accuracy
- Match writing style and tone
- search_memory: Semantic search across PDF library
- verify_citation: Check claims against source material
- suggest_evidence: Find supporting evidence for arguments
- append_citation: Add properly formatted citations
- plan_update: Track research progress in RESEARCH.md
- "Find evidence for this claim in my PDFs"
- "Strengthen the causal argument using Study A & B"
- "Convert all citations from APA 6 to APA 7"
- "Check if this claim matches the cited paper"
- "What methods did Smith 2023 use?"
- Perfect Round-Tripping:
- Import .docx with comments and track changes
- Export clean .docx that preserves all formatting
- Support for LaTeX import/export
- Collaboration Features:
- Maintain supervisor comments
- Track changes compatibility
- Version control integration
- Focus Mode: Hide VS Code complexity for non-programmers
- Split View: PDF reader alongside document editor
- Citation Sidebar: Quick access to bibliography
- AI Chat Panel: Context-aware assistance
- BibTeX Integration:
- Real-time monitoring of references.bib file
- Automatic parsing and indexing of citations
- Export bibliography in multiple formats
- Smart Citation Autocomplete:
- Trigger with "@" symbol in editor
- Search by author, year, title, or keywords
- Preview citation format before insertion
- Citation Nodes:
- Immutable objects in document (not plain text)
- Preserve citation metadata and linking
- One-click navigation to source PDF
- PDF Annotation System:
- Sidecar JSON files (filename.pdf.json)
- Preserve original PDFs untouched
- Git-versioned annotations and highlights
- Sync between PDF viewer and document citations
- Semantic Understanding: Not just text editing, but understanding research document structure
- PDF-to-Document Flow: Direct connection between reading and writing
- VS Code Foundation: Leverage existing ecosystem while hiding complexity
- Research Workflows: Built for academic processes, not generic writing
- User loads 50 PDFs into library
- Searches for "cognitive load in children"
- AI finds relevant sections across all papers
- User highlights findings and adds to document with auto-citations
- Import .docx with reviewer comments
- AI suggests evidence from PDF library to address concerns
- User strengthens arguments with AI assistance
- Export revised .docx with track changes
- User writes claim in document
- AI flags missing citation
- Suggests relevant papers from library
- Auto-inserts citation in correct format
- Frontend (The Studio): VS Code Extension providing the UI and editing experience
- Backend (The Brain): AI processing layer handling document analysis and intelligence
- Communication: IPC/WebSocket between frontend and backend for non-blocking operations
- Import: mammoth.js for .docx parsing with format preservation
- Editor: ProseMirror with custom schema for academic documents
- Export: Pandoc integration for multi-format support
- Model Context Protocol (MCP): For standardized AI tool exposure
- LangChain.js: For complex AI orchestration
- Local LLM Support: Via Ollama for privacy-conscious users
- Documents: File system with Git-based version control
- Vector Store: ChromaDB (dev) / LanceDB (production)
- Metadata: SQLite for citation management and document relationships
- Cross-platform (Windows, Mac, Linux)
- Based on VS Code Extension framework
- Local-first processing with optional cloud
- Sub-2 second document load times
- Handle 1000+ PDF libraries
- VS Code Web Extension compatibility
- Progressive Web App capabilities
- WebAssembly for performance-critical operations
- IndexedDB for client-side storage
- Adoption: 10,000+ active researchers within first year
- Retention: 60%+ monthly active usage
- Document Completion: 50% reduction in time to complete papers
- Citation Accuracy: 90%+ improvement in citation verification
- Must maintain perfect .docx compatibility
- Cannot break existing Word workflows
- Must handle large PDFs (100+ pages) efficiently
- Must preserve all formatting and metadata
- Must work offline for desktop version