Long Short-Term Memory
Sepp Hochreiter, Jurgen Schmidhuber
1997 · Neural Computation
Long Short-Term Memory
Problem
- The paper addresses sequence modeling, as suggested by the title Long Short-Term Memory and the note tags (
nlp, sequence).
- More specific problem framing is not yet captured in the local evidence, so this section should remain high-level.
Prior Papers
- The local note records @rumelhartLearningRepresentationsBackpropagating1986 as prior work in canonical metadata.
- Based on the available receipts, this paper is positioned in relation to earlier backpropagation-based neural network work.
- Local evidence is limited beyond this recorded linkage, so a fuller literature comparison is not yet supported here.
Proposed Method
- The paper introduces a method named Long Short-Term Memory.
- Beyond the title, the local structured evidence does not provide reliable technical detail about the architecture or training procedure.
- Accordingly, the method can only be described cautiously here: it proposes a memory-oriented sequence model, but the local note does not yet capture the mechanism in detail.
Evaluation
- The paper appeared in Neural Computation (1997).
- Local evidence for the experimental setup, datasets, baselines, and quantitative results is currently very limited.
- A detailed evaluation summary is therefore not yet supported by the local note.
Method Strengths and Weaknesses
Strengths
- The paper appears to introduce a distinct sequence-modeling method with a clear conceptual identity, as indicated by its title.
- Its age (1997) together with the recorded follow-on links suggests lasting influence, though the local evidence does not document the exact reasons in detail.
Weaknesses
- Local evidence is too limited to substantiate specific technical strengths or failure modes.
- The current note does not yet capture architectural details, empirical limitations, or comparisons to alternatives, so any stronger judgment would be speculative.
Further Research
- The canonical metadata records the following follow-on connections:
- @choGRU2014
- @sutskeverSeq2Seq2014
- @petersELMo2018
- These links suggest that later recurrent-gating, sequence-to-sequence, and contextual representation work are relevant next papers to review.
- The local evidence supports these connections as graph metadata, but does not yet explain the precise technical lineage in detail.
Links
Prior Papers (Royal Road)
Further Papers (Royal Road)