Data Science Tools: Unterschied zwischen den Versionen

Aus eLib.at
Wechseln zu: Navigation, Suche
Keine Bearbeitungszusammenfassung
Keine Bearbeitungszusammenfassung
Zeile 10: Zeile 10:
|Tags=NLP
|Tags=NLP
}}
}}
This page collects one possible approach for newbies to get a NLP and Data Science virtual machine up and running. 


=Environment=
=Environment=


*DATlinux
*[https://datlinux.com/ DATlinux] (Custom Lubuntu 22 for Data Science)
*Virtualbox
*[https://www.virtualbox.org/ Virtualbox]
 
DATlinux has a variety of tools pre-installed - we will be using [https://spacy.io/ SpaCy] and Python for a minimal pipeline.
 
Questions to ask:
 
*What technical tools do I want to do?
*What do I have installed on my system and what do I need to add?
*Which questions do I want to ask and what datasets do I want to work on?
 
HowTos:
 
* [https://www.geeksforgeeks.org/check-the-version-of-the-python-interpreter/ Python version on Linux]
* [https://spacy.io/usage Install SpaCy]
* Add to PATH on Linux
 


=Datasets=
=Datasets=

Version vom 31. Dezember 2022, 13:34 Uhr

ELibToolsIdeenportal.png

 

INFORMATION

Weitere Infos im Volltext. Sprache der Inhalte: English.

 

Zitierhilfe: Zitiere diese Inhalte in verschiedenen Zitierstilen. Archivkopien aller Inhalte finden sich auch im großartigen Internet Archive (Spenden).

 

QUERVERBINDUNGEN

Verbindungen mit Personen, Orten, Dingen und Ereignissen finden sich unter Themen und Schwerpunkte.



 

TAGS & KATEGORIEN



This page collects one possible approach for newbies to get a NLP and Data Science virtual machine up and running.


Environment

DATlinux has a variety of tools pre-installed - we will be using SpaCy and Python for a minimal pipeline.

Questions to ask:

  • What technical tools do I want to do?
  • What do I have installed on my system and what do I need to add?
  • Which questions do I want to ask and what datasets do I want to work on?

HowTos:


Datasets

  • Reddit: pushshift.io


.

Endnoten