---
_id: '7936'
abstract:
- lang: eng
text: 'State-of-the-art detection systems are generally evaluated on their ability
to exhaustively retrieve objects densely distributed in the image, across a wide
variety of appearances and semantic categories. Orthogonal to this, many real-life
object detection applications, for example in remote sensing, instead require
dealing with large images that contain only a few small objects of a single class,
scattered heterogeneously across the space. In addition, they are often subject
to strict computational constraints, such as limited battery capacity and computing
power.To tackle these more practical scenarios, we propose a novel flexible detection
scheme that efficiently adapts to variable object sizes and densities: We rely
on a sequence of detection stages, each of which has the ability to predict groups
of objects as well as individuals. Similar to a detection cascade, this multi-stage
architecture spares computational effort by discarding large irrelevant regions
of the image early during the detection process. The ability to group objects
provides further computational and memory savings, as it allows working with lower
image resolutions in early stages, where groups are more easily detected than
individuals, as they are more salient. We report experimental results on two aerial
image datasets, and show that the proposed method is as accurate yet computationally
more efficient than standard single-shot detectors, consistently across three
different backbone architectures.'
article_number: 1716-1725
article_processing_charge: No
author:
- first_name: Amélie
full_name: Royer, Amélie
id: 3811D890-F248-11E8-B48F-1D18A9856A87
last_name: Royer
orcid: 0000-0002-8407-0705
- first_name: Christoph
full_name: Lampert, Christoph
id: 40C20FD2-F248-11E8-B48F-1D18A9856A87
last_name: Lampert
orcid: 0000-0001-8622-7887
citation:
ama: 'Royer A, Lampert C. Localizing grouped instances for efficient detection in
low-resource scenarios. In: IEEE Winter Conference on Applications of Computer
Vision. IEEE; 2020. doi:10.1109/WACV45572.2020.9093288'
apa: 'Royer, A., & Lampert, C. (2020). Localizing grouped instances for efficient
detection in low-resource scenarios. In IEEE Winter Conference on Applications
of Computer Vision. Snowmass Village, CO, United States: IEEE. https://doi.org/10.1109/WACV45572.2020.9093288'
chicago: Royer, Amélie, and Christoph Lampert. “Localizing Grouped Instances for
Efficient Detection in Low-Resource Scenarios.” In IEEE Winter Conference on
Applications of Computer Vision. IEEE, 2020. https://doi.org/10.1109/WACV45572.2020.9093288.
ieee: A. Royer and C. Lampert, “Localizing grouped instances for efficient detection
in low-resource scenarios,” in IEEE Winter Conference on Applications of Computer
Vision, Snowmass Village, CO, United States, 2020.
ista: 'Royer A, Lampert C. 2020. Localizing grouped instances for efficient detection
in low-resource scenarios. IEEE Winter Conference on Applications of Computer
Vision. WACV: Winter Conference on Applications of Computer Vision, 1716–1725.'
mla: Royer, Amélie, and Christoph Lampert. “Localizing Grouped Instances for Efficient
Detection in Low-Resource Scenarios.” IEEE Winter Conference on Applications
of Computer Vision, 1716–1725, IEEE, 2020, doi:10.1109/WACV45572.2020.9093288.
short: A. Royer, C. Lampert, in:, IEEE Winter Conference on Applications of Computer
Vision, IEEE, 2020.
conference:
end_date: 2020-03-05
location: ' Snowmass Village, CO, United States'
name: 'WACV: Winter Conference on Applications of Computer Vision'
start_date: 2020-03-01
date_created: 2020-06-07T22:00:53Z
date_published: 2020-03-01T00:00:00Z
date_updated: 2023-09-07T13:16:17Z
day: '01'
department:
- _id: ChLa
doi: 10.1109/WACV45572.2020.9093288
external_id:
arxiv:
- '2004.12623'
language:
- iso: eng
main_file_link:
- open_access: '1'
url: https://arxiv.org/abs/2004.12623
month: '03'
oa: 1
oa_version: Preprint
publication: IEEE Winter Conference on Applications of Computer Vision
publication_identifier:
isbn:
- '9781728165530'
publication_status: published
publisher: IEEE
quality_controlled: '1'
related_material:
record:
- id: '8331'
relation: dissertation_contains
status: deleted
- id: '8390'
relation: dissertation_contains
status: public
scopus_import: 1
status: public
title: Localizing grouped instances for efficient detection in low-resource scenarios
type: conference
user_id: 2DF688A6-F248-11E8-B48F-1D18A9856A87
year: '2020'
...
---
_id: '7937'
abstract:
- lang: eng
text: 'Fine-tuning is a popular way of exploiting knowledge contained in a pre-trained
convolutional network for a new visual recognition task. However, the orthogonal
setting of transferring knowledge from a pretrained network to a visually different
yet semantically close source is rarely considered: This commonly happens with
real-life data, which is not necessarily as clean as the training source (noise,
geometric transformations, different modalities, etc.).To tackle such scenarios,
we introduce a new, generalized form of fine-tuning, called flex-tuning, in which
any individual unit (e.g. layer) of a network can be tuned, and the most promising
one is chosen automatically. In order to make the method appealing for practical
use, we propose two lightweight and faster selection procedures that prove to
be good approximations in practice. We study these selection criteria empirically
across a variety of domain shifts and data scarcity scenarios, and show that fine-tuning
individual units, despite its simplicity, yields very good results as an adaptation
technique. As it turns out, in contrast to common practice, rather than the last
fully-connected unit it is best to tune an intermediate or early one in many domain-
shift scenarios, which is accurately detected by flex-tuning.'
article_number: 2180-2189
article_processing_charge: No
author:
- first_name: Amélie
full_name: Royer, Amélie
id: 3811D890-F248-11E8-B48F-1D18A9856A87
last_name: Royer
orcid: 0000-0002-8407-0705
- first_name: Christoph
full_name: Lampert, Christoph
id: 40C20FD2-F248-11E8-B48F-1D18A9856A87
last_name: Lampert
orcid: 0000-0001-8622-7887
citation:
ama: 'Royer A, Lampert C. A flexible selection scheme for minimum-effort transfer
learning. In: 2020 IEEE Winter Conference on Applications of Computer Vision.
IEEE; 2020. doi:10.1109/WACV45572.2020.9093635'
apa: 'Royer, A., & Lampert, C. (2020). A flexible selection scheme for minimum-effort
transfer learning. In 2020 IEEE Winter Conference on Applications of Computer
Vision. Snowmass Village, CO, United States: IEEE. https://doi.org/10.1109/WACV45572.2020.9093635'
chicago: Royer, Amélie, and Christoph Lampert. “A Flexible Selection Scheme for
Minimum-Effort Transfer Learning.” In 2020 IEEE Winter Conference on Applications
of Computer Vision. IEEE, 2020. https://doi.org/10.1109/WACV45572.2020.9093635.
ieee: A. Royer and C. Lampert, “A flexible selection scheme for minimum-effort transfer
learning,” in 2020 IEEE Winter Conference on Applications of Computer Vision,
Snowmass Village, CO, United States, 2020.
ista: 'Royer A, Lampert C. 2020. A flexible selection scheme for minimum-effort
transfer learning. 2020 IEEE Winter Conference on Applications of Computer Vision.
WACV: Winter Conference on Applications of Computer Vision, 2180–2189.'
mla: Royer, Amélie, and Christoph Lampert. “A Flexible Selection Scheme for Minimum-Effort
Transfer Learning.” 2020 IEEE Winter Conference on Applications of Computer
Vision, 2180–2189, IEEE, 2020, doi:10.1109/WACV45572.2020.9093635.
short: A. Royer, C. Lampert, in:, 2020 IEEE Winter Conference on Applications of
Computer Vision, IEEE, 2020.
conference:
end_date: 2020-03-05
location: Snowmass Village, CO, United States
name: 'WACV: Winter Conference on Applications of Computer Vision'
start_date: 2020-03-01
date_created: 2020-06-07T22:00:53Z
date_published: 2020-03-01T00:00:00Z
date_updated: 2023-09-07T13:16:17Z
day: '01'
department:
- _id: ChLa
doi: 10.1109/WACV45572.2020.9093635
external_id:
arxiv:
- '2008.11995'
language:
- iso: eng
main_file_link:
- open_access: '1'
url: http://arxiv.org/abs/2008.11995
month: '03'
oa: 1
oa_version: Preprint
publication: 2020 IEEE Winter Conference on Applications of Computer Vision
publication_identifier:
isbn:
- '9781728165530'
publication_status: published
publisher: IEEE
quality_controlled: '1'
related_material:
record:
- id: '8331'
relation: dissertation_contains
status: deleted
- id: '8390'
relation: dissertation_contains
status: public
scopus_import: '1'
status: public
title: A flexible selection scheme for minimum-effort transfer learning
type: conference
user_id: 2DF688A6-F248-11E8-B48F-1D18A9856A87
year: '2020'
...
---
_id: '8193'
abstract:
- lang: eng
text: 'Multiple-environment Markov decision processes (MEMDPs) are MDPs equipped
with not one, but multiple probabilistic transition functions, which represent
the various possible unknown environments. While the previous research on MEMDPs
focused on theoretical properties for long-run average payoff, we study them with
discounted-sum payoff and focus on their practical advantages and applications.
MEMDPs can be viewed as a special case of Partially observable and Mixed observability
MDPs: the state of the system is perfectly observable, but not the environment.
We show that the specific structure of MEMDPs allows for more efficient algorithmic
analysis, in particular for faster belief updates. We demonstrate the applicability
of MEMDPs in several domains. In particular, we formalize the sequential decision-making
approach to contextual recommendation systems as MEMDPs and substantially improve
over the previous MDP approach.'
acknowledgement: Krishnendu Chatterjee is supported by the Austrian ScienceFund (FWF)
NFN Grant No. S11407-N23 (RiSE/SHiNE),and COST Action GAMENET. Petr Novotn ́y is
supported bythe Czech Science Foundation grant No. GJ19-15134Y.
article_processing_charge: No
author:
- first_name: Krishnendu
full_name: Chatterjee, Krishnendu
id: 2E5DCA20-F248-11E8-B48F-1D18A9856A87
last_name: Chatterjee
orcid: 0000-0002-4561-241X
- first_name: Martin
full_name: Chmelik, Martin
id: 3624234E-F248-11E8-B48F-1D18A9856A87
last_name: Chmelik
- first_name: Deep
full_name: Karkhanis, Deep
last_name: Karkhanis
- first_name: Petr
full_name: Novotný, Petr
id: 3CC3B868-F248-11E8-B48F-1D18A9856A87
last_name: Novotný
- first_name: Amélie
full_name: Royer, Amélie
id: 3811D890-F248-11E8-B48F-1D18A9856A87
last_name: Royer
orcid: 0000-0002-8407-0705
citation:
ama: 'Chatterjee K, Chmelik M, Karkhanis D, Novotný P, Royer A. Multiple-environment
Markov decision processes: Efficient analysis and applications. In: Proceedings
of the 30th International Conference on Automated Planning and Scheduling.
Vol 30. Association for the Advancement of Artificial Intelligence; 2020:48-56.'
apa: 'Chatterjee, K., Chmelik, M., Karkhanis, D., Novotný, P., & Royer, A. (2020).
Multiple-environment Markov decision processes: Efficient analysis and applications.
In Proceedings of the 30th International Conference on Automated Planning and
Scheduling (Vol. 30, pp. 48–56). Nancy, France: Association for the Advancement
of Artificial Intelligence.'
chicago: 'Chatterjee, Krishnendu, Martin Chmelik, Deep Karkhanis, Petr Novotný,
and Amélie Royer. “Multiple-Environment Markov Decision Processes: Efficient Analysis
and Applications.” In Proceedings of the 30th International Conference on Automated
Planning and Scheduling, 30:48–56. Association for the Advancement of Artificial
Intelligence, 2020.'
ieee: 'K. Chatterjee, M. Chmelik, D. Karkhanis, P. Novotný, and A. Royer, “Multiple-environment
Markov decision processes: Efficient analysis and applications,” in Proceedings
of the 30th International Conference on Automated Planning and Scheduling,
Nancy, France, 2020, vol. 30, pp. 48–56.'
ista: 'Chatterjee K, Chmelik M, Karkhanis D, Novotný P, Royer A. 2020. Multiple-environment
Markov decision processes: Efficient analysis and applications. Proceedings of
the 30th International Conference on Automated Planning and Scheduling. ICAPS:
International Conference on Automated Planning and Scheduling vol. 30, 48–56.'
mla: 'Chatterjee, Krishnendu, et al. “Multiple-Environment Markov Decision Processes:
Efficient Analysis and Applications.” Proceedings of the 30th International
Conference on Automated Planning and Scheduling, vol. 30, Association for
the Advancement of Artificial Intelligence, 2020, pp. 48–56.'
short: K. Chatterjee, M. Chmelik, D. Karkhanis, P. Novotný, A. Royer, in:, Proceedings
of the 30th International Conference on Automated Planning and Scheduling, Association
for the Advancement of Artificial Intelligence, 2020, pp. 48–56.
conference:
end_date: 2020-10-30
location: Nancy, France
name: 'ICAPS: International Conference on Automated Planning and Scheduling'
start_date: 2020-10-26
date_created: 2020-08-02T22:00:58Z
date_published: 2020-06-01T00:00:00Z
date_updated: 2023-09-07T13:16:18Z
day: '01'
department:
- _id: KrCh
intvolume: ' 30'
language:
- iso: eng
month: '06'
oa_version: None
page: 48-56
project:
- _id: 25863FF4-B435-11E9-9278-68D0E5697425
call_identifier: FWF
grant_number: S11407
name: Game Theory
publication: Proceedings of the 30th International Conference on Automated Planning
and Scheduling
publication_identifier:
eissn:
- '23340843'
issn:
- '23340835'
publication_status: published
publisher: Association for the Advancement of Artificial Intelligence
quality_controlled: '1'
related_material:
record:
- id: '8390'
relation: dissertation_contains
status: public
scopus_import: '1'
status: public
title: 'Multiple-environment Markov decision processes: Efficient analysis and applications'
type: conference
user_id: 2DF688A6-F248-11E8-B48F-1D18A9856A87
volume: 30
year: '2020'
...
---
_id: '8092'
abstract:
- lang: eng
text: Image translation refers to the task of mapping images from a visual domain
to another. Given two unpaired collections of images, we aim to learn a mapping
between the corpus-level style of each collection, while preserving semantic content
shared across the two domains. We introduce xgan, a dual adversarial auto-encoder,
which captures a shared representation of the common domain semantic content in
an unsupervised way, while jointly learning the domain-to-domain image translations
in both directions. We exploit ideas from the domain adaptation literature and
define a semantic consistency loss which encourages the learned embedding to preserve
semantics shared across domains. We report promising qualitative results for the
task of face-to-cartoon translation. The cartoon dataset we collected for this
purpose, “CartoonSet”, is also publicly available as a new benchmark for semantic
style transfer at https://google.github.io/cartoonset/index.html.
article_processing_charge: No
author:
- first_name: Amélie
full_name: Royer, Amélie
id: 3811D890-F248-11E8-B48F-1D18A9856A87
last_name: Royer
orcid: 0000-0002-8407-0705
- first_name: Konstantinos
full_name: Bousmalis, Konstantinos
last_name: Bousmalis
- first_name: Stephan
full_name: Gouws, Stephan
last_name: Gouws
- first_name: Fred
full_name: Bertsch, Fred
last_name: Bertsch
- first_name: Inbar
full_name: Mosseri, Inbar
last_name: Mosseri
- first_name: Forrester
full_name: Cole, Forrester
last_name: Cole
- first_name: Kevin
full_name: Murphy, Kevin
last_name: Murphy
citation:
ama: 'Royer A, Bousmalis K, Gouws S, et al. XGAN: Unsupervised image-to-image translation
for many-to-many mappings. In: Singh R, Vatsa M, Patel VM, Ratha N, eds. Domain
Adaptation for Visual Understanding. Springer Nature; 2020:33-49. doi:10.1007/978-3-030-30671-7_3'
apa: 'Royer, A., Bousmalis, K., Gouws, S., Bertsch, F., Mosseri, I., Cole, F., &
Murphy, K. (2020). XGAN: Unsupervised image-to-image translation for many-to-many
mappings. In R. Singh, M. Vatsa, V. M. Patel, & N. Ratha (Eds.), Domain
Adaptation for Visual Understanding (pp. 33–49). Springer Nature. https://doi.org/10.1007/978-3-030-30671-7_3'
chicago: 'Royer, Amélie, Konstantinos Bousmalis, Stephan Gouws, Fred Bertsch, Inbar
Mosseri, Forrester Cole, and Kevin Murphy. “XGAN: Unsupervised Image-to-Image
Translation for Many-to-Many Mappings.” In Domain Adaptation for Visual Understanding,
edited by Richa Singh, Mayank Vatsa, Vishal M. Patel, and Nalini Ratha, 33–49.
Springer Nature, 2020. https://doi.org/10.1007/978-3-030-30671-7_3.'
ieee: 'A. Royer et al., “XGAN: Unsupervised image-to-image translation for
many-to-many mappings,” in Domain Adaptation for Visual Understanding,
R. Singh, M. Vatsa, V. M. Patel, and N. Ratha, Eds. Springer Nature, 2020, pp.
33–49.'
ista: 'Royer A, Bousmalis K, Gouws S, Bertsch F, Mosseri I, Cole F, Murphy K. 2020.XGAN:
Unsupervised image-to-image translation for many-to-many mappings. In: Domain
Adaptation for Visual Understanding. , 33–49.'
mla: 'Royer, Amélie, et al. “XGAN: Unsupervised Image-to-Image Translation for Many-to-Many
Mappings.” Domain Adaptation for Visual Understanding, edited by Richa
Singh et al., Springer Nature, 2020, pp. 33–49, doi:10.1007/978-3-030-30671-7_3.'
short: A. Royer, K. Bousmalis, S. Gouws, F. Bertsch, I. Mosseri, F. Cole, K. Murphy,
in:, R. Singh, M. Vatsa, V.M. Patel, N. Ratha (Eds.), Domain Adaptation for Visual
Understanding, Springer Nature, 2020, pp. 33–49.
date_created: 2020-07-05T22:00:46Z
date_published: 2020-01-08T00:00:00Z
date_updated: 2023-09-07T13:16:18Z
day: '08'
department:
- _id: ChLa
doi: 10.1007/978-3-030-30671-7_3
editor:
- first_name: Richa
full_name: Singh, Richa
last_name: Singh
- first_name: Mayank
full_name: Vatsa, Mayank
last_name: Vatsa
- first_name: Vishal M.
full_name: Patel, Vishal M.
last_name: Patel
- first_name: Nalini
full_name: Ratha, Nalini
last_name: Ratha
external_id:
arxiv:
- '1711.05139'
language:
- iso: eng
main_file_link:
- open_access: '1'
url: https://arxiv.org/abs/1711.05139
month: '01'
oa: 1
oa_version: Preprint
page: 33-49
publication: Domain Adaptation for Visual Understanding
publication_identifier:
isbn:
- '9783030306717'
publication_status: published
publisher: Springer Nature
quality_controlled: '1'
related_material:
record:
- id: '8331'
relation: dissertation_contains
status: deleted
- id: '8390'
relation: dissertation_contains
status: public
scopus_import: '1'
status: public
title: 'XGAN: Unsupervised image-to-image translation for many-to-many mappings'
type: book_chapter
user_id: 2DF688A6-F248-11E8-B48F-1D18A9856A87
year: '2020'
...
---
_id: '8390'
abstract:
- lang: eng
text: "Deep neural networks have established a new standard for data-dependent feature
extraction pipelines in the Computer Vision literature. Despite their remarkable
performance in the standard supervised learning scenario, i.e. when models are
trained with labeled data and tested on samples that follow a similar distribution,
neural networks have been shown to struggle with more advanced generalization
abilities, such as transferring knowledge across visually different domains, or
generalizing to new unseen combinations of known concepts. In this thesis we argue
that, in contrast to the usual black-box behavior of neural networks, leveraging
more structured internal representations is a promising direction\r\nfor tackling
such problems. In particular, we focus on two forms of structure. First, we tackle
modularity: We show that (i) compositional architectures are a natural tool for
modeling reasoning tasks, in that they efficiently capture their combinatorial
nature, which is key for generalizing beyond the compositions seen during training.
We investigate how to to learn such models, both formally and experimentally,
for the task of abstract visual reasoning. Then, we show that (ii) in some settings,
modularity allows us to efficiently break down complex tasks into smaller, easier,
modules, thereby improving computational efficiency; We study this behavior in
the context of generative models for colorization, as well as for small objects
detection. Secondly, we investigate the inherently layered structure of representations
learned by neural networks, and analyze its role in the context of transfer learning
and domain adaptation across visually\r\ndissimilar domains. "
acknowledged_ssus:
- _id: CampIT
- _id: ScienComp
acknowledgement: Last but not least, I would like to acknowledge the support of the
IST IT and scientific computing team for helping provide a great work environment.
alternative_title:
- ISTA Thesis
article_processing_charge: No
author:
- first_name: Amélie
full_name: Royer, Amélie
id: 3811D890-F248-11E8-B48F-1D18A9856A87
last_name: Royer
orcid: 0000-0002-8407-0705
citation:
ama: Royer A. Leveraging structure in Computer Vision tasks for flexible Deep Learning
models. 2020. doi:10.15479/AT:ISTA:8390
apa: Royer, A. (2020). Leveraging structure in Computer Vision tasks for flexible
Deep Learning models. Institute of Science and Technology Austria. https://doi.org/10.15479/AT:ISTA:8390
chicago: Royer, Amélie. “Leveraging Structure in Computer Vision Tasks for Flexible
Deep Learning Models.” Institute of Science and Technology Austria, 2020. https://doi.org/10.15479/AT:ISTA:8390.
ieee: A. Royer, “Leveraging structure in Computer Vision tasks for flexible Deep
Learning models,” Institute of Science and Technology Austria, 2020.
ista: Royer A. 2020. Leveraging structure in Computer Vision tasks for flexible
Deep Learning models. Institute of Science and Technology Austria.
mla: Royer, Amélie. Leveraging Structure in Computer Vision Tasks for Flexible
Deep Learning Models. Institute of Science and Technology Austria, 2020, doi:10.15479/AT:ISTA:8390.
short: A. Royer, Leveraging Structure in Computer Vision Tasks for Flexible Deep
Learning Models, Institute of Science and Technology Austria, 2020.
date_created: 2020-09-14T13:42:09Z
date_published: 2020-09-14T00:00:00Z
date_updated: 2023-10-16T10:04:02Z
day: '14'
ddc:
- '000'
degree_awarded: PhD
department:
- _id: ChLa
doi: 10.15479/AT:ISTA:8390
file:
- access_level: open_access
checksum: c914d2f88846032f3d8507734861b6ee
content_type: application/pdf
creator: dernst
date_created: 2020-09-14T13:39:14Z
date_updated: 2020-09-14T13:39:14Z
file_id: '8391'
file_name: 2020_Thesis_Royer.pdf
file_size: 30224591
relation: main_file
success: 1
- access_level: closed
checksum: ae98fb35d912cff84a89035ae5794d3c
content_type: application/x-zip-compressed
creator: dernst
date_created: 2020-09-14T13:39:17Z
date_updated: 2020-09-14T13:39:17Z
file_id: '8392'
file_name: thesis_sources.zip
file_size: 74227627
relation: main_file
file_date_updated: 2020-09-14T13:39:17Z
has_accepted_license: '1'
language:
- iso: eng
month: '09'
oa: 1
oa_version: Published Version
page: '197'
publication_identifier:
isbn:
- 978-3-99078-007-7
issn:
- 2663-337X
publication_status: published
publisher: Institute of Science and Technology Austria
related_material:
record:
- id: '7936'
relation: part_of_dissertation
status: public
- id: '7937'
relation: part_of_dissertation
status: public
- id: '8193'
relation: part_of_dissertation
status: public
- id: '8092'
relation: part_of_dissertation
status: public
- id: '911'
relation: part_of_dissertation
status: public
status: public
supervisor:
- first_name: Christoph
full_name: Lampert, Christoph
id: 40C20FD2-F248-11E8-B48F-1D18A9856A87
last_name: Lampert
orcid: 0000-0001-8622-7887
title: Leveraging structure in Computer Vision tasks for flexible Deep Learning models
tmp:
image: /images/cc_by_nc_sa.png
legal_code_url: https://creativecommons.org/licenses/by-nc-sa/4.0/legalcode
name: Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC
BY-NC-SA 4.0)
short: CC BY-NC-SA (4.0)
type: dissertation
user_id: c635000d-4b10-11ee-a964-aac5a93f6ac1
year: '2020'
...
---
_id: '911'
abstract:
- lang: eng
text: We develop a probabilistic technique for colorizing grayscale natural images.
In light of the intrinsic uncertainty of this task, the proposed probabilistic
framework has numerous desirable properties. In particular, our model is able
to produce multiple plausible and vivid colorizations for a given grayscale image
and is one of the first colorization models to provide a proper stochastic sampling
scheme. Moreover, our training procedure is supported by a rigorous theoretical
framework that does not require any ad hoc heuristics and allows for efficient
modeling and learning of the joint pixel color distribution.We demonstrate strong
quantitative and qualitative experimental results on the CIFAR-10 dataset and
the challenging ILSVRC 2012 dataset.
article_processing_charge: No
author:
- first_name: Amélie
full_name: Royer, Amélie
id: 3811D890-F248-11E8-B48F-1D18A9856A87
last_name: Royer
orcid: 0000-0002-8407-0705
- first_name: Alexander
full_name: Kolesnikov, Alexander
id: 2D157DB6-F248-11E8-B48F-1D18A9856A87
last_name: Kolesnikov
- first_name: Christoph
full_name: Lampert, Christoph
id: 40C20FD2-F248-11E8-B48F-1D18A9856A87
last_name: Lampert
orcid: 0000-0001-8622-7887
citation:
ama: 'Royer A, Kolesnikov A, Lampert C. Probabilistic image colorization. In: BMVA
Press; 2017:85.1-85.12. doi:10.5244/c.31.85'
apa: 'Royer, A., Kolesnikov, A., & Lampert, C. (2017). Probabilistic image colorization
(p. 85.1-85.12). Presented at the BMVC: British Machine Vision Conference, London,
United Kingdom: BMVA Press. https://doi.org/10.5244/c.31.85'
chicago: Royer, Amélie, Alexander Kolesnikov, and Christoph Lampert. “Probabilistic
Image Colorization,” 85.1-85.12. BMVA Press, 2017. https://doi.org/10.5244/c.31.85.
ieee: 'A. Royer, A. Kolesnikov, and C. Lampert, “Probabilistic image colorization,”
presented at the BMVC: British Machine Vision Conference, London, United Kingdom,
2017, p. 85.1-85.12.'
ista: 'Royer A, Kolesnikov A, Lampert C. 2017. Probabilistic image colorization.
BMVC: British Machine Vision Conference, 85.1-85.12.'
mla: Royer, Amélie, et al. Probabilistic Image Colorization. BMVA Press,
2017, p. 85.1-85.12, doi:10.5244/c.31.85.
short: A. Royer, A. Kolesnikov, C. Lampert, in:, BMVA Press, 2017, p. 85.1-85.12.
conference:
end_date: 2017-09-07
location: London, United Kingdom
name: 'BMVC: British Machine Vision Conference'
start_date: 2017-09-04
date_created: 2018-12-11T11:49:09Z
date_published: 2017-09-01T00:00:00Z
date_updated: 2023-10-16T10:04:02Z
day: '01'
ddc:
- '000'
department:
- _id: ChLa
doi: 10.5244/c.31.85
ec_funded: 1
external_id:
arxiv:
- '1705.04258'
file:
- access_level: open_access
content_type: application/pdf
creator: dernst
date_created: 2020-08-10T07:14:33Z
date_updated: 2020-08-10T07:14:33Z
file_id: '8224'
file_name: 2017_BMVC_Royer.pdf
file_size: 1625363
relation: main_file
success: 1
file_date_updated: 2020-08-10T07:14:33Z
has_accepted_license: '1'
language:
- iso: eng
month: '09'
oa: 1
oa_version: Published Version
page: 85.1-85.12
project:
- _id: 2532554C-B435-11E9-9278-68D0E5697425
call_identifier: FP7
grant_number: '308036'
name: Lifelong Learning of Visual Scene Understanding
publication_identifier:
eisbn:
- 190172560X
publication_status: published
publisher: BMVA Press
publist_id: '6532'
quality_controlled: '1'
related_material:
record:
- id: '8390'
relation: dissertation_contains
status: public
scopus_import: '1'
status: public
title: Probabilistic image colorization
type: conference
user_id: 2DF688A6-F248-11E8-B48F-1D18A9856A87
year: '2017'
...