Eight Habits of Effective Data Scientists
The best data scientists I have hired and worked with share 8 common traits.
Vin Vashishta | Originally Published: June 5th, 2017
I am fortunate to have met with some of the pioneers of data science and machine learning early on in my career. Their thoughts shaped my interest in the field and their habits formed my daily routine. The most frequent question I am asked is some form of, ‘How do I build a career in data science or machine learning?’ It starts with forming some important habits. Here is what works for me and what I’ve seen build exceptional data scientists.
Focused Reading and Learning
There are only so many hours in the day and so much more garbage than gold to read. Develop a list of trusted sources. These are blogs, social feeds, and document aggregators that provide quality content. Do not waste your time reading the fluff pieces you find on sites like Forbes or HuffPo. Do not read every deep learning paper on Arxiv.
I follow the research being done at companies like Facebook, Google, Disney, and several others. They are frequent publishers and their work is worth reading. Universities like Stanford, MIT, Cornell, and others are also strong sources for quality research. Find bloggers who talk about topics of interest to you in a format that is easy for you to understand. Twenty different people all write/talk/video about the same topics so you have your pick of communication/presentation styles.
Optimize Your Workspace and Toolset
I am faster than most data scientists because I have streamlined the tools I use. I can deliver projects sooner because I am not wasting any time fighting with my development environment. I have default server images on Amazon for my workspaces. Each one is customized with the IDE/environmental variables I have found best fits the programming language and database(s) I need to use. I have go to data sources for most types of projects and prebuilt hooks into most internal data sources. Security settings are part of the image.
It took me a lot of trial and error to get here. I have used more configurations than I can count and experimented with several different data science and development applications. Build what works best for you and enables you to be most efficient. You will look like a professional in a field of unprepared amateurs.
Listen for Business Problems
If you do not want to be building attribution models until Google automates you out of a job, you need to find new business problems to solve. I listen to Bloomberg Business and CNBC frequently. What I am keeping an ear out for is why companies missed revenue targets. Those are the business problems they have not solved and are willing to pay for solutions to.
How can data science or machine learning tackle these challenges? I see a lot of predictive problems; something happened that the company did not anticipate. That is often a supply chain disruption or a change in customer preference. I also see data science or machine learning capabilities issues; a company does not have the capabilities to analyze their data. These are problems shared by a lot of companies. Learn how to solve these problems and advertise your abilities. You are a lot more valuable to a company if you can listen for their business problems and synthesize solutions.
Build A Professional Network
Reputation and influence are the new marketing. There are two benefits that make the work worth it. First, a professional network allows you to find people to learn from. Second, a professional network allows you to build a brand and grow your influence in the data science or machine learning community.
Social sites like Twitter and LinkedIn as well as technical sites like Stack are great places to build a professional network. Start out by following and listening. Once you start to see the types of content that the community likes to consume, move on to being an aggregator. Aggregating is as simple as sharing what you are reading that your followers will appreciate seeing as well. Begin to find your voice. As you see interesting topics that are not well covered by others or you have interesting experiences to share, start creating content. With Stack, start answering questions. As you gain expertise, think about contributing to open source projects or publishing your research.
Influence brings opportunities. Rather than cold calling for new jobs and promotions, you will be constantly promoting yourself. I have gotten speaking gigs, insider access to conventions, and several clients through my professional network. It is well worth the effort.
Take the Time to Speak
Take every opportunity you can to speak. One of the biggest reasons I have been called an influencer is my speaking engagements. I take small, private audiences. I prepare a short talk on a topic of interest to the group and spend the rest of the time answering questions. We all provide value in different ways. My style is to feed off my audience’s curiosity and direct the event towards important points around their areas of interest. Other speakers will spend most of their time on the presentation. Many audiences do not have their own questions, so it is more valuable for the speaker to take them on a guided tour. This is better suited to a conference setting than a private event.
Develop your own style, message, and audience. Share your unique experiences, projects, or vision. You will be amazed by how many people are interested in what you can share and teach them. Speaking helps develop your unique perspective. In a long career with machine learning or data science, your perspective is far more valuable than any other contribution you will make in code or algorithm.
Say No A Lot
Data scientists and machine learning experts can do a lot of different things. That leads to us getting asked to do a lot of different things. In most cases, no is the right answer. Saying no a lot really comes down to understanding what you want to be working on and choosing your professional path. I have said no to jobs, book deals, and projects that did not fit into my personal path. I am happier and more focused for it.
There is a Venn Diagram out there with one circle labeled “right for you” and another circle labeled “right in front of you.” The overlap is exceedingly small. I have found that few opportunities that land in my lap are right for me. The opportunities I want, I must chase after myself. I am the one asking for them, not the other way around. The most effective data scientists I know go get the projects, clients, and roles they want. I emulate that behavior myself and it has worked out well for me.
Adopt A Minimalist Style
The masters use the fewest lines of code, the least data, the simplest algorithm, speak briefly, and so on. Minimalism is the mark of an expert data scientist.
Speak with Purpose and To Be Understood
I spend only about 20% of my time communicating with data science and machine learning experts. Most of my communications are with non-technical audiences. They do not care about the machine learning. They have an outcome they want.
The lion’s share of speaking with purpose and clarity is first listening. The process of asking questions to get to the truth of what a person or group needs is an art form. I am still working on this one myself, but I have seen the masters dissect a problem down to its root by asking the right questions. The elements of a good dialectic are creating an environment where people are comfortable answering questions honestly and admitting what they don’t know, giving them a sense that spending the time to answer these questions will benefit them, and synthesizing their answers back to them in a way that shows comprehension.
Once I understand the question, I can answer it with a lot more certainty. I use language that anyone will understand while respecting my audience enough to expect they will grasp complex concepts if I can frame them the right way. I see it as my responsibility to express ideas and concepts in a way that my audience can understand. When they are lost or confused, that is my fault not theirs. This is the piece of my own advice I find hardest to take but that perspective on communication has helped me improve greatly.