Data Science Tools: Unterschied zwischen den Versionen
Aus eLib.at
Gego (Diskussion | Beiträge) Keine Bearbeitungszusammenfassung |
Gego (Diskussion | Beiträge) Keine Bearbeitungszusammenfassung |
||
(Eine dazwischenliegende Version desselben Benutzers wird nicht angezeigt) | |||
Zeile 23: | Zeile 23: | ||
==Questions to ask:== | ==Questions to ask:== | ||
* What technical tools do I want to | * What technical tools do I want to use? | ||
* What do I have installed on my system and what do I need to add? | * What do I have installed on my system and what do I need to add? | ||
* Which questions do I want to ask and what datasets do I want to work on? | * Which data questions do I want to ask and what datasets do I want to work on? | ||
==HowTos:== | ==HowTos:== | ||
Zeile 33: | Zeile 33: | ||
* [https://linuxize.com/post/how-to-add-directory-to-path-in-linux/ Add to PATH on Linux] | * [https://linuxize.com/post/how-to-add-directory-to-path-in-linux/ Add to PATH on Linux] | ||
* [https://www.freecodecamp.org/news/run-python-script-how-to-execute-python-shell-commands-in-terminal/ Run the Python Shell on Linux] | * [https://www.freecodecamp.org/news/run-python-script-how-to-execute-python-shell-commands-in-terminal/ Run the Python Shell on Linux] | ||
* [https://spacy.io/models/en SpaCy: Install language models] | |||
* [https://demos.explosion.ai/displacy DisplaCy - Text visualisation with SpaCy language model] | |||
Aktuelle Version vom 31. Dezember 2022, 14:32 Uhr
INFORMATION
Weitere Infos im Volltext. Sprache der Inhalte: English.
- Data Science Tools
- ELib Team
- 2022
- eLib Lizenz
- Internettext
- Onlineversion - Archivversion
Zitierhilfe: Zitiere diese Inhalte in verschiedenen Zitierstilen. Archivkopien aller Inhalte finden sich auch im großartigen Internet Archive (Spenden).
QUERVERBINDUNGEN
Verbindungen mit Personen, Orten, Dingen und Ereignissen finden sich unter Themen und Schwerpunkte.
TAGS & KATEGORIEN
This page collects one possible approach for newbies to get a NLP and Data Science virtual machine up and running.
Environment Setup
- DATlinux (Custom Lubuntu 22 for Data Science)
- Virtualbox
DATlinux has a variety of tools pre-installed - but we will be using SpaCy and Python for a minimal pipeline.
Questions to ask:
- What technical tools do I want to use?
- What do I have installed on my system and what do I need to add?
- Which data questions do I want to ask and what datasets do I want to work on?
HowTos:
- Python version on Linux
- Install SpaCy
- Add to PATH on Linux
- Run the Python Shell on Linux
- SpaCy: Install language models
- DisplaCy - Text visualisation with SpaCy language model
Datasets
- Reddit: pushshift.io
.
Endnoten