Was sind Long Short-Term Memory Networks? (LSTM)

Long Short-Term Memory (LSTM) Netzwerke sind eine spezielle Art von rekurrenten neuronalen Netzwerken (RNNs), die speziell dafür entwickelt wurden, Langzeitabhängigkeiten in Sequenzdaten zu erfassen. LSTMs wurden von Hochreiter und Schmidhuber 1997 eingeführt und haben sich seitdem als sehr nützlich für eine Vielzahl von Aufgaben im Bereich des maschinellen Lernens erwiesen, insbesondere für die Verarbeitung und Vorhersage von sequenziellen Daten.

Hauptmerkmale und Funktionsweisen von LSTMs

Architektur:

Cell State: Der Cell State (Zellenzustand) ist ein wesentlicher Bestandteil von LSTMs, der Informationen über lange Zeiträume hinweg speichern kann. Er ist sozusagen das Gedächtnis des Netzwerks.
Gates: LSTMs verwenden drei verschiedene Gates, um den Informationsfluss zu steuern: Eingabegate (Input Gate), Vergessensgate (Forget Gate) und Ausgabegate (Output Gate). Diese Gates bestehen aus sigmoid- und tanh-Funktionen und bestimmen, welche Informationen hinzugefügt, entfernt oder ausgegeben werden. Die Funktionen sigmoid und tanh sind Aktivierungsfunktionen, welche für die der Steuerung des Informationsflusses der Neuronen verwendet werden.

Gates-Funktionalität:

Forget Gate: Dieses Gate entscheidet, welche Informationen aus dem Zellenzustand entfernt werden sollen. Es nimmt die vorherige Ausgabe und die aktuelle Eingabe als Eingabe und gibt einen Wert zwischen 0 und 1 aus. Ein Wert nahe 0 bedeutet, dass die Information vergessen wird, während ein Wert nahe 1 bedeutet, dass die Information beibehalten wird.

Input Gate: Dieses Gate entscheidet, welche neuen Informationen zum Zellenzustand hinzugefügt werden sollen. Es arbeitet zusammen mit einer tanh-Schicht, die neue potenzielle Werte erstellt, die dem Zellenzustand hinzugefügt werden könnten.

Output Gate: Dieses Gate entscheidet, welche Informationen aus dem Zellenzustand als Ausgabe verwendet werden sollen. Es filtert den Zellenzustand durch eine sigmoid-Schicht und multipliziert diesen mit dem tanh des Zellenzustands.

Anwendungen

LSTMs sind in der Lage, sowohl Langzeit- als auch Kurzzeitinformationen zu speichern und abzurufen. Dies ist besonders nützlich bei Aufgaben, bei denen frühere Informationen entscheidend sind. LSTMs werden häufig in Bereichen wie Sprachverarbeitung (z.B. maschinelle Übersetzung, Sprachsynthese), Zeitreihenanalyse (z.B. Aktienkursvorhersage), Videosequenzanalyse und vielen anderen verwendet, bei denen sequenzielle Informationen entscheidend sind.

Durch die Fähigkeit, Langzeitabhängigkeiten effektiv zu modellieren und die Probleme traditioneller RNNs, wie das Verschwinden und Explodieren von Gradienten, zu überwinden, haben LSTMs eine herausragende Rolle in der modernen KI und im maschinellen Lernen übernommen.

Cookie	Dauer	Beschreibung
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
CookieLawInfoConsent	1 year	Records the default button state of the corresponding category & the status of CCPA. It works only in coordination with the primary cookie.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Dauer	Beschreibung
__gads	1 year 24 days	The __gads cookie, set by Google, is stored under DoubleClick domain and tracks the number of times users see an advert, measures the success of the campaign and calculates its revenue. This cookie can only be read from the domain they are set on and will not track any data while browsing through other sites.
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_ga_8EVYKBJE0L	2 years	This cookie is installed by Google Analytics.
_ga_ECCBGK6LZQ	2 years	This cookie is installed by Google Analytics.
_gat_gtag_UA_216518707_1	1 minute	Set by Google to distinguish users.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.