2020
DOI: 10.3390/app10186460
|View full text |Cite
|
Sign up to set email alerts
|

Fi-Fo Detector: Figure and Formula Detection Using Deformable Networks

Abstract: We propose a novel hybrid approach that fuses traditional computer vision techniques with deep learning models to detect figures and formulas from document images. The proposed approach first fuses the different computer vision based image representations, i.e., color transform, connected component analysis, and distance transform, termed as Fi-Fo image representation. The Fi-Fo image representation is then fed to deep models for further refined representation-learning for detecting figures and formulas from d… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
16
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 17 publications
(16 citation statements)
references
References 30 publications
0
16
0
Order By: Relevance
“…Fi-Fo Detector [28] Color transform, connected component analysis, distance transform applied on images that are fed to deformable pyramid network.…”
Section: Methods Advantages Disadvantagesmentioning
confidence: 99%
See 2 more Smart Citations
“…Fi-Fo Detector [28] Color transform, connected component analysis, distance transform applied on images that are fed to deformable pyramid network.…”
Section: Methods Advantages Disadvantagesmentioning
confidence: 99%
“…Younas et al [28] exploited a similar approach by employing a deformable FPN module to detect formulas and figures in document images. Instead of providing raw input images to their deformable Faster R-CNN model, the authors have proposed an image transformation method identical to [40].…”
Section: Deformable Convolutionsmentioning
confidence: 99%
See 1 more Smart Citation
“…The authors combined deep structure prediction with a traditional approach to detecting page objects, including formulas in document images. Younas et al [20] introduced a system called Fi-Fo that detects figures and formulas in document images. The authors empirically established that deformable convolutions [42] with Feature Pyramid Networks (FPN) [24] are a better fit as compared to other object detection algorithms.…”
Section: Related Workmentioning
confidence: 99%
“…The recent success of deep learningbased methods on computer vision within the last decade also had an impact on the task of formula detection in scanned document images. Several deep learning-based formula detection approaches [17][18][19][20] have been presented in the past two years. They are mainly equipped with object detection algorithms such as Faster R-CNN [21], YOLO [22], SSD [23], and FPNs [24].…”
Section: Introductionmentioning
confidence: 99%