Skip to content

rjnp2/Data-Science

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DATA SCIENCE

Data-Science: What is, Basics & Process

Path for Data Science

Programming

Python_Programming
SQL programming
• Excel
• Comfortable with using the Terminal, version control in Git, and using GitHub

Calculus
Linear Algebra
Probability and Statistics:

  • Accessing database, CSV, and JSON data
  • Data cleaning and transformations using pandas
  • visualization
  • dashboards
  • Content-Based and Collaborative Filtering
  • Evaluation of Recommendation Systems. DCG, nDCG

Natural Language Processing

  • Time Series Analysis
  • Text Analytics
  • Lexical processing
  • Syntactic Processing
  • Semantic Processing

Fundamentals of Big Data Engineering

  • Hadoop and MapReduce Programming
  • NoSQL Databases and Apache HBase
  • Hive Tutorial
  • Analytics using PySpark

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages