Quora is a regular haunt of mine these days, and a lot of my activity there is centered on topics of deep interest – usually data, engineering, aviation and technology. Here’s the first version of the Quora data science answers roundup that I posted in January 2017, soon after I was designated Quora Top Writer for 2017.
What you see below are some more of my answers from 2017, on data and related areas, from Quora.
Data science, data analysis, simulation, probability, statistics and machine learning answers:
- Some hard truths about becoming a data scientist
- The best thing about working in data science
- Important qualities for data scientists. Related posts here and here
- Relevance of the basics of ML given the presence of machine learning APIs
- Expensive boot camps for data science and justifying spending
- Nontrivial ideas from probability and statistics required for data science
- Thoughts on Andrew Ng’s deep learning course (which led to a blog post here too)
- On new and interesting research ideas in the AI space
- Managing unstructured text data and feature extraction – more here
- Managing missing data fields and null values in data science problems
- On linear programming versus stochastic searches for hyperparameter optimization
- Differentiating between fitness and loss functions
- On model interpretability in machine learning
- Characteristics of a good regression model
- Distribution modeling and probability – 1 , 2 , 3 , 4
- On data analysis and its use in the manufacturing industry
- Optimization techniques in data analysis and data science
- On the philosophy of deep learning – related answers on how deep learning algorithms learn , on weight initialization in deep neural networks
- On time series models in data analysis – more here , here , here , here , here , here and here
- Convex optimization and the use of gradient descent
- On Genetic Algorithms – 1 , 2 , 3
- Anomaly detection in financial time series data – related answer here
- Significance and difference in significance testing
- Agent based modeling for traffic simulations
Technology-specific answers on data science and analysis:
- On big data technology courses, and the lack of architecture, strategy and such courses
- On the continuing relevance of SQL/RDBMS technologies
- The develop-vs-use conundrum for building data and machine learning systems – more here
- Advice on career and certifications – 1 , 2 , 3 , 4
- Programming language specific answers – 1 , 2 , 3 , 4 , 5 , 6 , 7 , 8 , 9 , 10 , 11
- General data science books, resources, skills – 1 , 2 , 3
- On big data ecosystems and components
- Perspectives on data warehousing and big data technologies
- Contextualizing tools like Excel in the context of data analysis
Data science and management:
- The importance of BI and decision enablement tools in the data space
- Andrew Ng’s venture and how it could be differentiated from others
- Managing data science projects
Hope you enjoy reading through them and find them interesting and informative!