TY - BOOK AU - O'Neil,Cathy AU - Schutt,Rachel TI - Doing data science: Straight talk from the frontline SN - 9789351103189 U1 - 006.31 23 PY - 2018/// CY - Mumbai PB - O'Reilly, Shroff Publishers & Distributors KW - Data mining KW - Big data KW - Information science KW - Data structures (Computer science) KW - Database management KW - Cyberinfrastructure N1 - Includes index; Introduction : What is data science? -- Statistical inference, exploratory data analysis, and the data science process -- Algorithms -- Spam filters, naive bayes, and wrangling -- Logistic regression -- Time stamps and financial modeling -- Extracting meaning from data -- Recommendation engines : building a user-facing data product at scale -- Data visualization and fraud detection -- Social networks and data journalism -- Causality -- Epidemiology -- Lessons learned from data competitions : data leakage and model evaluation -- Data engineering : MapReduce, Pregel, and Hadoop -- The students speak -- Next-generation data scientists, hubris, and ethics ER -