BIG DATA & ANALYTICS

BIG DATA & ANALYTICS & AI
Our BI, big data and analytics team has supported many customers in building BI and analytics solutions to process large amounts of business data and provide real-time reports for business decisions.
Services
- Data warehouse and data mining design and implementation
- Collect and analyze data in real-time
- Data visualization
- Standard and custom reports
- Data analytics and forecasting
- Analyzing high volume structured and unstructured data
- Data migration
Technologies
-
Data Warehousing
- SSIS, SSRS, AWS Glue, AWS SQS, AWS DMS, AWS Kinesis Stream, AWS Kinesis Firehose, AWS EMR
- Hbase, HDFS
- MongoDB, CouchDB, Cassandra, Teradeta, AWS RedShift, AWS Aurora, AWS Athena, AWS Dynamo Database
-
Frameworks
- Hadoop, Spark
- Kafka, Storm, Sqoop, Flume
- Mahout, Drill, Solr
- Druid, SnappyData, Cassandra
- Map Reduce, Pentaho
-
BI and Data Visualization
- Tableau, Splunk, Pentaho, QuickSight, PowerBI, PowerApp, Cognos, Jasper, Qlikview, Pentaho BI, Power View, Datazen
- Predictive analytics: Regression, Classification, Clustering, Time Series
- Programming: Python, R, Java, Hive, Scala, SAS, SPSS
AI & Machine Learning - Skill Set
-
Machine Learning
- Descriptive Statistics
- Deep learning: TensorFlow, Keras, Yolo
- Model Optimization: TensorRT, OpenVINO
-
Reinforcement Learning
- Markov decision process
- Q-Learning and Deep Q-Networks
- Supervised Learning: Linear Regression, Neural Networks, Support Vector Machines
-
Unsupervised Learning
- K-Means Clustering
- Anomaly Detection
- Principal component analysis (PCA)
- Latent Dirichlet allocation (LDA)
- MLaaS: SAS, Google Cloud AI, Microsoft Azure AI, AWS AI
- Computer Vision: Image/video analytics
- Object Detection: Products, People, Vehicles
- NLP/OCR
- Category Theory

AI & Machine Learning - Sample Tools
- AI: Scikit Learn, TensorFlow, Caffe, MxNet, Keras, PyTorch, Theano, OpenVINO, Tensorrt, Gym, OpenCV, Pillow, Rawpy, Scikit-image
- NLP: Gensim, Underthesea, NLTK, Hugging Face, Spacy
- OCR: Tesseract, Google Vision, AWS Textract
- ML Pipeline: Ạmazon Sagemaker, Apache Airflow
- Infrastructure: Docker, Kafka, RabbitMQ, Kubernetes
Let contact our BI, big data and analytics team to discuss solution for your needs