Data Science Tools: Unterschied zwischen den Versionen

Aus eLib.at
Wechseln zu: Navigation, Suche
Keine Bearbeitungszusammenfassung
Keine Bearbeitungszusammenfassung
 
(Eine dazwischenliegende Version desselben Benutzers wird nicht angezeigt)
Zeile 23: Zeile 23:
==Questions to ask:==
==Questions to ask:==


* What technical tools do I want to do?
* What technical tools do I want to use?
* What do I have installed on my system and what do I need to add?
* What do I have installed on my system and what do I need to add?
* Which questions do I want to ask and what datasets do I want to work on?
* Which data questions do I want to ask and what datasets do I want to work on?


==HowTos:==
==HowTos:==
Zeile 33: Zeile 33:
* [https://linuxize.com/post/how-to-add-directory-to-path-in-linux/ Add to PATH on Linux]
* [https://linuxize.com/post/how-to-add-directory-to-path-in-linux/ Add to PATH on Linux]
* [https://www.freecodecamp.org/news/run-python-script-how-to-execute-python-shell-commands-in-terminal/ Run the Python Shell on Linux]
* [https://www.freecodecamp.org/news/run-python-script-how-to-execute-python-shell-commands-in-terminal/ Run the Python Shell on Linux]
 
* [https://spacy.io/models/en SpaCy: Install language models]
* [https://demos.explosion.ai/displacy DisplaCy - Text visualisation with SpaCy language model]





Aktuelle Version vom 31. Dezember 2022, 14:32 Uhr

ELibToolsIdeenportal.png

 

INFORMATION

Weitere Infos im Volltext. Sprache der Inhalte: English.

 

Zitierhilfe: Zitiere diese Inhalte in verschiedenen Zitierstilen. Archivkopien aller Inhalte finden sich auch im großartigen Internet Archive (Spenden).

 

QUERVERBINDUNGEN

Verbindungen mit Personen, Orten, Dingen und Ereignissen finden sich unter Themen und Schwerpunkte.



 

TAGS & KATEGORIEN



This page collects one possible approach for newbies to get a NLP and Data Science virtual machine up and running.


Environment Setup

DATlinux has a variety of tools pre-installed - but we will be using SpaCy and Python for a minimal pipeline.

Questions to ask:

  • What technical tools do I want to use?
  • What do I have installed on my system and what do I need to add?
  • Which data questions do I want to ask and what datasets do I want to work on?

HowTos:


Datasets

  • Reddit: pushshift.io


.

Endnoten