kozistr

Machine Learning Engineer & Data Scientist

South Korea

kozistr@gmail.com

http://kozistr.tech

Dec 12, 2020 • 9 min read ☕
About ME
- #About
- #CV
Profile Service machine learning products in various domains, Audio & Speech, Vision, NLP, Recommendation Systems, Tabular, LLM application…
Dec 26, 2023 • 4 min read ☕
2023년 회고 (feat. 병특 끝)
- #Diary
Prologue 글을 쓰기 시작한 시점에서 올해가 1주일이 채 남지 않았는데, 올해를 돌아보면 중요한 사건이 끝나고 새로운 시작을 한, 그사이에 느낀 것들이 많고 놓친 것들을 되돌아보는 한 해였다. 나름 만족스러운 해였다. 큰 이벤트들을 먼저 떠올려…
May 26, 2023 • 2 min read ☕
(Kaggle) BirdCLEF 2023 - 24th (top 2%) place solution
- #Deep-Learning
- #Kaggle
Original Post : https://www.kaggle.com/competitions/birdclef-2023/discussion/412996 Architecture Here's the pipeline. pre-train on 2020, 20…
Feb 28, 2023 • 2 min read ☕
(Kaggle) Screening Mammography Breast Cancer Detection - 16th (top 1%) place solution
- #Deep-Learning
- #Kaggle
Original Post : https://www.kaggle.com/competitions/rsna-breast-cancer-detection/discussion/391133 Data Preprocessing My preprocessing code…
Jan 03, 2023 • 2 min read ☕
(Kaggle) Detecting Continuous Gravitational Waves - 22th (top 2%) place solution
- #Deep-Learning
- #Kaggle
Original Post : https://www.kaggle.com/competitions/g2net-detecting-continuous-gravitational-waves/discussion/375927 Data Pre-Processing In…
Aug 17, 2022 • 2 min read ☕
(Kaggle) Default Prediction - 135th (top 3%) place solution
- #Deep-Learning
- #Kaggle
Original Post : https://www.kaggle.com/competitions/amex-default-prediction/discussion/347996 TL;DR I couldn't spend lots of time on the co…
Nov 03, 2021 • 3 min read ☕
(Kaggle) Ventilator Pressure Prediction - 20th (top 1%) place solution
- #Deep-Learning
- #Kaggle
Original Post : https://www.kaggle.com/competitions/ventilator-pressure-prediction/discussion/285295 TL;DR Our solutions are focused on the…
Aug 09, 2021 • 2 min read ☕
(Kaggle) COVID-19 Detection - 47th (top 4%) place solution
- #Deep-Learning
- #Kaggle
Original Post : https://www.kaggle.com/competitions/siim-covid19-detection/discussion/263830 TL;DR I only got Kaggle GPU/TPU, couldn't expe…
Dec 16, 2022 • 3 min read ☕
2022년 회고
- #Diary
TL;DR 올해 회고를 시작하기 전 작년 회고를 읽었는데, 첫 줄부터 희망한 대로 흘러가진 않았다. 병특도 정착했고 2022년은 조용히 지나가나 했지만, 회사 관련해서도 큰 변화가 있었고 여러 일들이 있었다. 우리가 이라 하는 것처럼 "이젠 괜찮겠지…
Aug 24, 2022 • 2 min read ☕
MaxViT - Multi-Axis Vision Transformer
- #Deep-Learning
TL;DR paper : arXiv code : github Related Work GC ViT Introduction 최근 vision transformer연구 경향을 보면 global context를 잘 고려하는 ViT연구들이 많이 보이는데, 이…
Aug 19, 2022 • 2 min read ☕
GC ViT - Global Context Vision Transformers
- #Deep-Learning
TL;DR 최근 computer vision architecture를 보면 image 만 사용하는 게 아닌 extra training data로 text information를 활용하면서 성능을 끌어올리거나 여러 models를 ensemble 하는 …
Aug 18, 2022 • 1 min read ☕
TitaNet - Neural Model for speaker representation with 1D Depth-wise separable convolutions and global context
- #Deep-Learning
TL;DR paper : arXiv code : github Related Work contextnet paper ecapa-tdnn paper angular softmax paper Architecture architecture와 비슷한데, d…
Aug 15, 2022 • 3 min read ☕
NaturalSpeech - End-to-End Text to Speech Synthesis with Human-Level Quality
- #Deep-Learning
TL;DR 오랜만에 speech-synthesis 쪽 논문을 보다가 (LJSpeech dataset에서) MOS, CMOS metrics에서 human-level에 도달한 research 가 있는데, 거기에 최근 유행이었던 diffusion appr…
Aug 14, 2022 • 3 min read ☕
FlashAttention - Fast and Memory-Efficient Exact Attention with IO-Awareness
- #Deep-Learning
TL;DR 대부분 memory & speed 관점에서 attention 연구를 보면, full attention 하지 않는 방식이나 유사(?) attention을 만들거나 softmax 부분 연산을 줄이는 등의 시도들이 있었는데, 이번 연구는 har…
Aug 10, 2022 • 3 min read ☕
Charformer - Fast Character Transformers via Gradient-based Subword Tokenization
- #Deep-Learning
TL;DR paper : arXiv code : github Related Work mT5 paper ByT5 paper Introduction 이번엔 gradient-based subword tokenization module (GBST)를 만들었…
Aug 09, 2022 • 2 min read ☕
ByT5 - Towards a Token-Free Future with Pre-trained Byte-to-Byte Models
- #Deep-Learning
TL;DR paper : arXiv code : github Related Work CANINE mT5 paper Introduction 기존 LM 에서는 tokenizer 를 사용하고 있어 여러 측면에서 단점이 있는데, 이런 문제를 해결하기 위해 …
Mar 20, 2022 • 1 min read ☕
DeepNet - Scaling Transformers to 1,000 Layers
- #Deep-Learning
TL;DR 최근 논문을 보면 complex 한 architecture를 design 하기보다는 training recipes을 제안하거나 large-scale 모델을 더 stable하게 학습하는 방법 등의 논문들이 많이 나오는 경향입니다 (개인적으로…
Mar 19, 2022 • 3 min read ☕
토스 3 months review
- #Diary
TL;DR 벌써 토스로 이직한 지 3달이 됐네요. 사실 블로그에 글로 쓸 생각은 없었는데, 짧지만 정말 여러 가지를 생각하고 느끼기도 했고 입사 당시에는 토스 문화에 대해서 잘 알지 못했는데, 여기에 남겨보면 어떨까 해서 글을 씁니다. 토스 문화에 …
Jan 31, 2022 • 2 min read ☕
Data2Vec - A General Framework for Self-supervised Learning in Speech, Vision and Language
- #Deep-Learning
TL;DR FAIR에서 이란 논문이 나왔는데, multi-modal SSL paper라 해서 흥미가 생겨서 읽게 됐습니다. 읽기 전 궁금했던 points는 modality마다 feature extraction methods가 다를텐데, 어떤 meth…
Dec 17, 2021 • 3 min read ☕
2021년 회고
- #Diary
TL;DR 2020년도 회고한 지 얼마 안된 거 같은데 벌써 2021년이 끝나가네요. 올해에도 심경의 변화나 많은 events가 일어나진 않았지만, 최근에 쓴 퇴사부검도 그렇고 일어난 일 하나하나 굵직했던 거 같아요. 올해 키워드를 하나 뽑는다면 …
Dec 05, 2021 • 3 min read ☕
2021년 퇴사 부검
- #Diary
TL;DR 올해도 여러 큰 사건(?)들이 있었지만, 그 중 또 한 번의 이직이 가장 큰 사건이 아닐까 싶습니다. 지난 2년 동안 회사를 알아보고 이직하는 과정을 매년 하니 지치고 병특이라 총알(이직 선택지)이 많이 없어서, 일단 환경이 어떻든 버텨봐…
Aug 07, 2021 • 3 min read ☕
Anycost GANs for Interactive Image Synthesis and Editing
- #Deep-Learning
TL;DR Github에 들어가면 우측 상단에 에서 종종 재밌는 repositories를 추천해줘서 자주 구경 중인데, 도 이렇게 보다 논문까지 읽어보다 재밌어 보여서 짧게 정리해 보려고 합니다. paper : arXiv github : repo R…
Jul 26, 2021 • 2 min read ☕
VertiFocalNet - An IoU-aware Dense Object Detector
- #Deep-Learning
TL;DR 최근에 Object Detection (이하 OD) task 관련 kaggle challenge를 하다, 비슷한 대회 solutions들을 보다가 상위 랭크 solution에 (이하 )을 사용한 걸 발견해서, 요 논문을 더 자세하게 공부…
May 23, 2021 • 6 min read ☕
육군훈련소 후기 & 팁
- #army
TL;DR 지난 4월 29일 ~ 5월 20일까지 3주간 육군훈련소를 다녀왔습니다. 들어가기 1주일 전부터 훈련소에서 코로나 격리 수준이 기본권을 침해한다는 등 여러 기사와 썰 들이 돌아다니고, 최근에 다녀온 지인들도 격리주간에 꽤 힘들었다는 말을 들…
Apr 19, 2021 • 3 min read ☕
Assem-VC - Realistic Voice Conversion by Assembling Modern Speech Synthesis Techniques
- #Deep-Learning
TL;DR 최근 mindslab 에서 Cotatron에 이어 새로운 VC (Voice Conversion) 논문이 나와서 논문을 읽게 됐습니다. code는 곧 나올 예정인 듯합니다. issue를 보니 mid-june 에 release 할 가능성이 있…
Apr 02, 2021 • 2 min read ☕
EfficientNetV2 - Smaller Models and Faster Training
- #Deep-Learning
TL;DR EfficientNet 의 2번째 논문이 나왔네요. 저자는 EfficientNet 을 쓴 두 분이 쓰셨네요. 이번에 나온 논문은 효율성을 목표로 한 연구인데, NAS로 모델 훈련 속도와 파라메터 수를 엄청나게 줄이면서 성능도 compara…
Mar 25, 2021 • 3 min read ☕
ConSinGAN - Improved Techniques for Training Single-Image GANs
- #Deep-Learning
TL;DR 이전에 리뷰했던 SinGAN 후속 논문이 나왔는데, 우연히 github 메인 페이지 오른쪽에 보면 Explore repositories가 있는데, 여기에 추천 repo로 떠서 우연히 보게 됐습니다. 저자분께서 짧은 요약을 블로그에 정리해서…
Dec 23, 2020 • 4 min read ☕
2020년 회고
- #Diary
TL;DR 올해 처음으로 회고록을 적어보는데, 사실 작년부터 써야지 생각만 하다 결국 놓쳤는데, 벌써 1년이 지나 쓸 때가 온 걸 보고 시간은 정말 빨리 가는 걸 실감하며, 2020년 회고지만 지난 2년 동안 일어났던 작고 큼지막한 일들을 하나씩 적…
Sep 07, 2020 • 6 min read ☕
NVAE A Deep Hierarchical Variational Autoencoder
- #Deep-Learning
TL;DR 최근에 NVLabs 에서 VAE 관련 논문이 하나 나왔는데, 매주 월요일이 회사 짬데이라고 개인 or 팀 끼리 공부하고 싶은 주제 공부하고 공유하는 문화가 있어서, 마침 잘 돼서 논문 리뷰를 해 봅니다. paper : arXiv code …
Jun 16, 2020 • 4 min read ☕
TUNIT Rethinking the Truly Unsupervised Image-to-Image Translation
- #Deep-Learning
TL;DR 최근에 Clova AI 에서 unsupervised image 2 image translation 관련 논문이 나와서 한번 빠르게 봤습니다. 일단 제목부터가 재밌는데 TUNIT, Truly Unsupervised Image to Image…
Jun 06, 2020 • 5 min read ☕
UIS-RNN-SML SUPERVISED ONLINE DIARIZATION WITH SAMPLE MEAN LOSS FOR MULTI-DOMAIN DATA
- #Deep-Learning
TL;DR 평소에 speaker diarization task 에 정말 관심이 많고, 이전에 이쪽 분야 (speech domain 쪽 전반적으로) 업무를 하다가, 최근에 다시 이쪽 분야 trend 는 어떤지 궁금해서 예전에 UIS-RNN 기반으로 s…
May 23, 2020 • 3 min read ☕
ResNeSt Split-Attention Networks
- #Deep-Learning
TL;DR Amazon 에서 지난달에 재밌는 논문이 나왔는데요, 새로운 image classification architecture 를 제안했는데, EfficientNet 보다 더 좋은 성능을 보이는 human-made architecture 를 선…
May 10, 2020 • 3 min read ☕
Cotatron Transcription-Guided Speech Encoder for Any-to-Many Voice Conversion without Parallel Data
- #Deep-Learning
TL;DR 최근 mindslab 에서 VC (Voice Conversion)관련 논문이 나와서 오랜만에 요 쪽 domain 도 볼 겸 해서 논문을 읽게 됐습니다. 간단하게 요약하면, 유명한 google 의 TTS model 인 tacotron2 기반…
Apr 26, 2020 • 3 min read ☕
YOLOv4 Optimal Speed and Accuracy of Object Detection
- #Deep-Learning
TL;DR 이번에 리뷰할 논문은 오랜만에 나온 YOLO 4번째 버전인 YOLOv4 논문입니다. 이번 버전은 이야기가 있는(?) 버전인데, YOLO 원 저자인 Joe Redmon 님 께서 올해 2월쯤에 twit으로 CV 연구를 그만하겠다고 선언하셨는데…
Apr 11, 2020 • 2 min read ☕
Self-training with Noisy Student improves ImageNet classification
- #Deep-Learning
TL;DR 이번 포스팅에서 리뷰할 논문은 EfficientNet 기반으로 새로운 techniques 를 적용해서 ImageNet dataset 에서 SOTA 를 찍은 논문입니다. 나온지는 꽤 됐지만, 최근 TPU 에서 돌아가는 요 코드를 짜다가 생각…
Apr 11, 2020 • 2 min read ☕
ELECTRA Pre-training Text Encoders as Discriminators Rather Than Generators
- #Deep-Learning
TL;DR 이번에 리뷰할 논문은 ELECTRA 란 google ai 에서 3월에 발표한 논문인데, 재밌는 approach 를 하고 있어서 가져와 봤습니다. ELECTRA paper : OpenReview google ai blog : blog Rel…
Mar 14, 2020 • 5 min read ☕
SinGAN - Learning a Generative Model from a Single Natural Image
- #Deep-Learning
TL;DR 이번 포스팅에서는 ICCV 2019 에서 Best Paper Awards 에서 선정된 papers 중에 하나인 SinGAN 을 리뷰해 보겠습니다. 개인적으로 정말 재밌게 본 논문이고, ICCV 2019 논문들 중 최고였던거 같아요. 그래서…
Mar 14, 2020 • 6 min read ☕
StarGAN-v2 - Diverse Image Synthesis for Multiple Domains review
- #Deep-Learning
TL;DR 이번 포스팅에서는 I2I translation 를 푼 StartGAN v2 을 리뷰해 보겠습니다. 평소에 Multi-Domain I2I translation task 에 관심이 많았는데, 작년에 나온 StarGAN 후속작인 StarGAN …
Mar 14, 2020 • 4 min read ☕
StyleGAN-v2 - Analyzing and Improving the Image Quality of StyleGAN
- #Deep-Learning
TL;DR 이번 포스팅에서는 리뷰할 논문은 지난 19년 11월에 나온 StyleGAN v2를 리뷰 해 보겠습니다 StyleGAN 에 이어서 2 번째 논문인데, 이번 버전에서는 어떤 문제점들을 어떻게 해결했는지를 한번 보려고 합니다! 아래는 Style…
Mar 14, 2020 • 4 min read ☕
SAN Second-order Attention Network for Single Image Super-Resolution
- #Deep-Learning
TL;DR 이번 포스팅에서는 리뷰할 논문은 SAN (Second-order Attention Network) 이라는 Image Super Resolution task 에서 현재 여러 test set 에서 제일 높은 성능 (19년도 기준)을 보이고 있…
Jul 20, 2018 • 1 min read ☕
LK v4.16.x KASLR Bypass
- #Security
- #Linux-Kernel
TL;DR About my recent founds :) I found a bug, memory leak on v4.16.0-rc5. (KASLR Bypass). Maybe, it works on many LKs (i didn't check all …
Jul 20, 2018 • 9 min read ☕
Modern Linux Kernel 0,1-day Unkind-Exploitations Review
- #Security
- #Linux-Kernel
TL;DR Last time, I posted about 1-day vulnerability CVE-2017-5123, waitid() arbitrary R/W with null-deref on LK v4.13.x/~v4.14.0-rc4. It ju…
Jun 17, 2018 • 23 min read ☕
Linux Kernel - 2018-06-3 Founds
- #Security
- #Linux-Kernel
_decode_session6 - soft lockup Got from syzkaller & Found in LK v4.17.0-rc7. Call Trace (Dump) I'll update a post later... End rb_insert_co…
Jun 12, 2018 • 11 min read ☕
Linux Kernel - 2018-06-2 Founds
- #Security
- #Linux-Kernel
create_filter - memory leak Found on LK v4.17.x. kmemleak message pcpu_create_chunk - memory leak Found on LK v4.17.x. kmemleak message set…
Apr 21, 2018 • 39 min read ☕
Linux Kernel - 2018-04-3 Founds
- #Security
- #Linux-Kernel
__sctp_v6_cmp_addr - slab out of bounds Read Found in LK v4.17.0-rc1. slab-out-of-bounds in __sctp_v6_cmp_addr, 8 bytes read. Demo Log End …
Apr 02, 2018 • 37 min read ☕
Linux Kernel - 2018-04-1 Founds
- #Security
- #Linux-Kernel
anon_vma_chain - memory leak Found in LK v4.16.0-rc7. Call Trace (Dump) End kmalloc-1024 - slab padding/red zone overwritten Got from syzka…
Apr 02, 2018 • 18 min read ☕
Linux Kernel - 2018-04-2 Founds
- #Security
- #Linux-Kernel
skb_release_data - use after free Write Got from syzkaller & Found in LK v4.16.0. [2018-04-10] Maybe, it is reproducible under some conditi…
Mar 24, 2018 • 30 min read ☕
Linux Kernel - 2018-03-4 Founds
- #Security
- #Linux-Kernel
dev_hard_start_xmit - soft lockup Found in LK v4.16.0-rc7. stuck for 30s. Call Trace (Dump) Code Code : c8 00 00 00 48 c1 ea 03 48 01 c3 …
Mar 16, 2018 • 21 min read ☕
Linux Kernel - 2018-03-3 Founds
- #Security
- #Linux-Kernel
perf_trace_buf_alloc - warn Found in LK v4.16.0-rc5. Call Trace (Dump) Code In . Just size is over , so WARN_ONCE is just called... And all…
Mar 11, 2018 • 15 min read ☕
Linux Kernel - 2018-03-2 Founds
- #Security
- #Linux-Kernel
sctp_id2assoc - use after free Read Found in LK v4.16.0-rc4. Maybe it could be useful :) Call Trace (Dump) End init_tty - kernel panic Got …

SHOW MORE POSTS

© 2024 Hyeongchan Kim | Theme by JunhoBaik | Built with Gatsby