---
_id: '11929'
abstract:
- lang: eng
text: Broder et al.'s [3] shingling algorithm and Charikar's [4] random projection
based approach are considered "state-of-the-art" algorithms for finding near-duplicate
web pages. Both algorithms were either developed at or used by popular web search
engines. We compare the two algorithms on a very large scale, namely on a set
of 1.6B distinct web pages. The results show that neither of the algorithms works
well for finding near-duplicate pairs on the same site, while both achieve high
precision for near-duplicate pairs on different sites. Since Charikar's algorithm
finds more near-duplicate pairs on different sites, it achieves a better precision
overall, namely 0.50 versus 0.38 for Broder et al.'s algorithm. We present a combined
algorithm which achieves precision 0.79 with 79% of the recall of the other algorithms.
article_processing_charge: No
author:
- first_name: Monika H
full_name: Henzinger, Monika H
id: 540c9bbd-f2de-11ec-812d-d04a5be85630
last_name: Henzinger
orcid: 0000-0002-5008-6530
citation:
ama: 'Henzinger MH. Finding near-duplicate web pages: A large-scale evaluation of
algorithms. In: 29th Annual International ACM SIGIR Conference on Research
and Development in Information Retrieval. Association for Computing Machinery;
2006:284-291. doi:10.1145/1148170.1148222'
apa: 'Henzinger, M. H. (2006). Finding near-duplicate web pages: A large-scale evaluation
of algorithms. In 29th Annual International ACM SIGIR Conference on Research
and Development in Information Retrieval (pp. 284–291). Seattle, WA, United
States: Association for Computing Machinery. https://doi.org/10.1145/1148170.1148222'
chicago: 'Henzinger, Monika H. “Finding Near-Duplicate Web Pages: A Large-Scale
Evaluation of Algorithms.” In 29th Annual International ACM SIGIR Conference
on Research and Development in Information Retrieval, 284–91. Association
for Computing Machinery, 2006. https://doi.org/10.1145/1148170.1148222.'
ieee: 'M. H. Henzinger, “Finding near-duplicate web pages: A large-scale evaluation
of algorithms,” in 29th Annual International ACM SIGIR Conference on Research
and Development in Information Retrieval, Seattle, WA, United States, 2006,
pp. 284–291.'
ista: 'Henzinger MH. 2006. Finding near-duplicate web pages: A large-scale evaluation
of algorithms. 29th Annual International ACM SIGIR Conference on Research and
Development in Information Retrieval. SIGIR: International Conference on Research
and Development in Information Retrieval, 284–291.'
mla: 'Henzinger, Monika H. “Finding Near-Duplicate Web Pages: A Large-Scale Evaluation
of Algorithms.” 29th Annual International ACM SIGIR Conference on Research
and Development in Information Retrieval, Association for Computing Machinery,
2006, pp. 284–91, doi:10.1145/1148170.1148222.'
short: M.H. Henzinger, in:, 29th Annual International ACM SIGIR Conference on Research
and Development in Information Retrieval, Association for Computing Machinery,
2006, pp. 284–291.
conference:
end_date: 2006-08-11
location: Seattle, WA, United States
name: 'SIGIR: International Conference on Research and Development in Information
Retrieval'
start_date: 2006-08-06
date_created: 2022-08-19T07:20:31Z
date_published: 2006-08-01T00:00:00Z
date_updated: 2023-02-17T13:49:03Z
day: '01'
doi: 10.1145/1148170.1148222
extern: '1'
language:
- iso: eng
month: '08'
oa_version: None
page: 284-291
publication: 29th Annual International ACM SIGIR Conference on Research and Development
in Information Retrieval
publication_status: published
publisher: Association for Computing Machinery
quality_controlled: '1'
scopus_import: '1'
status: public
title: 'Finding near-duplicate web pages: A large-scale evaluation of algorithms'
type: conference
user_id: 2DF688A6-F248-11E8-B48F-1D18A9856A87
year: '2006'
...
---
_id: '1462'
abstract:
- lang: eng
text: A Fourier transform technique is introduced for counting the number of solutions
of holomorphic moment map equations over a finite field. This technique in turn
gives information on Betti numbers of holomorphic symplectic quotients. As a consequence,
simple unified proofs are obtained for formulas of Poincaré polynomials of toric
hyperkähler varieties (recovering results of Bielawski-Dancer and Hausel-Sturmfels),
Poincaré polynomials of Hubert schemes of points and twisted Atiyah-Drinfeld-Hitchin-Manin
(ADHM) spaces of instantons on ℂ2 (recovering results of Nakajima-Yoshioka), and
Poincaré polynomials of all Nakajima quiver varieties. As an application, a proof
of a conjecture of Kac on the number of absolutely indecomposable representations
of a quiver is announced.
acknowledgement: This work was supported by a Royal Society University Research Fellowship,
National Science Foundation Grant DMS-0305505, an Alfred P. Sloan Research Fellowship,
and a Summer Research Assignment of the University of Texas at Austin.
author:
- first_name: Tamas
full_name: Tamas Hausel
id: 4A0666D8-F248-11E8-B48F-1D18A9856A87
last_name: Hausel
citation:
ama: Hausel T. Betti numbers of holomorphic symplectic quotients via arithmetic
Fourier transform. PNAS. 2006;103(16):6120-6124. doi:10.1073/pnas.0601337103
apa: Hausel, T. (2006). Betti numbers of holomorphic symplectic quotients via arithmetic
Fourier transform. PNAS. National Academy of Sciences. https://doi.org/10.1073/pnas.0601337103
chicago: Hausel, Tamás. “Betti Numbers of Holomorphic Symplectic Quotients via Arithmetic
Fourier Transform.” PNAS. National Academy of Sciences, 2006. https://doi.org/10.1073/pnas.0601337103.
ieee: T. Hausel, “Betti numbers of holomorphic symplectic quotients via arithmetic
Fourier transform,” PNAS, vol. 103, no. 16. National Academy of Sciences,
pp. 6120–6124, 2006.
ista: Hausel T. 2006. Betti numbers of holomorphic symplectic quotients via arithmetic
Fourier transform. PNAS. 103(16), 6120–6124.
mla: Hausel, Tamás. “Betti Numbers of Holomorphic Symplectic Quotients via Arithmetic
Fourier Transform.” PNAS, vol. 103, no. 16, National Academy of Sciences,
2006, pp. 6120–24, doi:10.1073/pnas.0601337103.
short: T. Hausel, PNAS 103 (2006) 6120–6124.
date_created: 2018-12-11T11:52:10Z
date_published: 2006-04-18T00:00:00Z
date_updated: 2021-01-12T06:50:55Z
day: '18'
doi: 10.1073/pnas.0601337103
extern: 1
intvolume: ' 103'
issue: '16'
main_file_link:
- open_access: '1'
url: http://arxiv.org/abs/math/0511163
month: '04'
oa: 1
page: 6120 - 6124
publication: PNAS
publication_status: published
publisher: National Academy of Sciences
publist_id: '5734'
quality_controlled: 0
status: public
title: Betti numbers of holomorphic symplectic quotients via arithmetic Fourier transform
type: journal_article
volume: 103
year: '2006'
...
---
_id: '1461'
abstract:
- lang: eng
text: This note proves combinatorially that the intersection pairing on the middle-dimensional
compactly supported cohomology of a toric hyperkähler variety is always definite,
providing a large number of non-trivial L 2 harmonic forms for toric hyperkähler
metrics on these varieties. This is motivated by a result of Hitchin about the
definiteness of the pairing of L 2 harmonic forms on complete hyperkähler manifolds
of linear growth.
acknowledgement: The first author was partly supported by NSF grant DMS-0072675. The
second author was partly supported by a VIGRE postdoc under NSF grant number 9983660
to Cornell University.
author:
- first_name: Tamas
full_name: Tamas Hausel
id: 4A0666D8-F248-11E8-B48F-1D18A9856A87
last_name: Hausel
- first_name: Edward
full_name: Swartz, Edward
last_name: Swartz
citation:
ama: Hausel T, Swartz E. Intersection forms of toric hyperkähler varieties. Proceedings
of the American Mathematical Society. 2006;134(8):2403-2409. doi:10.1090/S0002-9939-06-08248-7
apa: Hausel, T., & Swartz, E. (2006). Intersection forms of toric hyperkähler
varieties. Proceedings of the American Mathematical Society. American Mathematical
Society. https://doi.org/10.1090/S0002-9939-06-08248-7
chicago: Hausel, Tamás, and Edward Swartz. “Intersection Forms of Toric Hyperkähler
Varieties.” Proceedings of the American Mathematical Society. American
Mathematical Society, 2006. https://doi.org/10.1090/S0002-9939-06-08248-7.
ieee: T. Hausel and E. Swartz, “Intersection forms of toric hyperkähler varieties,”
Proceedings of the American Mathematical Society, vol. 134, no. 8. American
Mathematical Society, pp. 2403–2409, 2006.
ista: Hausel T, Swartz E. 2006. Intersection forms of toric hyperkähler varieties.
Proceedings of the American Mathematical Society. 134(8), 2403–2409.
mla: Hausel, Tamás, and Edward Swartz. “Intersection Forms of Toric Hyperkähler
Varieties.” Proceedings of the American Mathematical Society, vol. 134,
no. 8, American Mathematical Society, 2006, pp. 2403–09, doi:10.1090/S0002-9939-06-08248-7.
short: T. Hausel, E. Swartz, Proceedings of the American Mathematical Society 134
(2006) 2403–2409.
date_created: 2018-12-11T11:52:09Z
date_published: 2006-08-01T00:00:00Z
date_updated: 2021-01-12T06:50:54Z
day: '01'
doi: 10.1090/S0002-9939-06-08248-7
extern: 1
intvolume: ' 134'
issue: '8'
main_file_link:
- open_access: '1'
url: http://arxiv.org/abs/math/0306369
month: '08'
oa: 1
page: 2403 - 2409
publication: Proceedings of the American Mathematical Society
publication_status: published
publisher: American Mathematical Society
publist_id: '5733'
quality_controlled: 0
status: public
title: Intersection forms of toric hyperkähler varieties
type: journal_article
volume: 134
year: '2006'
...
---
_id: '1715'
abstract:
- lang: eng
text: 'Background: Cell-to-cell communication at the synapse involves synaptic transmission
as well as signaling mediated by growth factors, which provide developmental and
plasticity cues. There is evidence that a retrograde, presynaptic transforming
growth factor-β (TGF-β) signaling event regulates synapse development and function
in Drosophila. Results: Here we show that a postsynaptic TGF-β signaling event
occurs during larval development. The type I receptor Thick veins (Tkv) and the
R-Smad transcription factor Mothers-against-dpp (Mad) are localized postsynaptically
in the muscle. Furthermore, Mad phosphorylation occurs in regions facing the presynaptic
active zones of neurotransmitter release within the postsynaptic subsynaptic reticulum
(SSR). In order to monitor in real time the levels of TGF-β signaling in the synapse
during synaptic transmission, we have established a FRAP assay to measure Mad
nuclear import/export in the muscle. We show that Mad nuclear trafficking depends
on stimulation of the muscle. Conclusions: Our data suggest a mechanism linking
synaptic transmission and postsynaptic TGF-β signaling that may coordinate nerve-muscle
development and function.'
acknowledgement: This work was supported by the Max Planck Society, HFSP, and Deutsche
Forschungsgemeinschaft.
article_processing_charge: No
author:
- first_name: Veronika
full_name: Dudu, Veronika
last_name: Dudu
- first_name: Thomas
full_name: Bittig, Thomas
last_name: Bittig
- first_name: Eugeni
full_name: Entchev, Eugeni
last_name: Entchev
- first_name: Anna
full_name: Kicheva, Anna
id: 3959A2A0-F248-11E8-B48F-1D18A9856A87
last_name: Kicheva
orcid: 0000-0003-4509-4998
- first_name: Frank
full_name: Julicher, Frank
last_name: Julicher
- first_name: Marcos
full_name: González Gaitán, Marcos
last_name: González Gaitán
citation:
ama: Dudu V, Bittig T, Entchev E, Kicheva A, Julicher F, González Gaitán M. Postsynaptic
mad signaling at the Drosophila neuromuscular junction. Current Biology.
2006;16(7):625-635. doi:10.1016/j.cub.2006.02.061
apa: Dudu, V., Bittig, T., Entchev, E., Kicheva, A., Julicher, F., & González
Gaitán, M. (2006). Postsynaptic mad signaling at the Drosophila neuromuscular
junction. Current Biology. Cell Press. https://doi.org/10.1016/j.cub.2006.02.061
chicago: Dudu, Veronika, Thomas Bittig, Eugeni Entchev, Anna Kicheva, Frank Julicher,
and Marcos González Gaitán. “Postsynaptic Mad Signaling at the Drosophila Neuromuscular
Junction.” Current Biology. Cell Press, 2006. https://doi.org/10.1016/j.cub.2006.02.061.
ieee: V. Dudu, T. Bittig, E. Entchev, A. Kicheva, F. Julicher, and M. González Gaitán,
“Postsynaptic mad signaling at the Drosophila neuromuscular junction,” Current
Biology, vol. 16, no. 7. Cell Press, pp. 625–635, 2006.
ista: Dudu V, Bittig T, Entchev E, Kicheva A, Julicher F, González Gaitán M. 2006.
Postsynaptic mad signaling at the Drosophila neuromuscular junction. Current Biology.
16(7), 625–635.
mla: Dudu, Veronika, et al. “Postsynaptic Mad Signaling at the Drosophila Neuromuscular
Junction.” Current Biology, vol. 16, no. 7, Cell Press, 2006, pp. 625–35,
doi:10.1016/j.cub.2006.02.061.
short: V. Dudu, T. Bittig, E. Entchev, A. Kicheva, F. Julicher, M. González Gaitán,
Current Biology 16 (2006) 625–635.
date_created: 2018-12-11T11:53:37Z
date_published: 2006-04-04T00:00:00Z
date_updated: 2021-11-16T07:44:15Z
day: '04'
doi: 10.1016/j.cub.2006.02.061
extern: '1'
intvolume: ' 16'
issue: '7'
language:
- iso: eng
month: '04'
oa_version: None
page: 625 - 635
publication: Current Biology
publication_status: published
publisher: Cell Press
publist_id: '5416'
related_material:
link:
- relation: erratum
url: https://doi.org/10.1016/j.cub.2006.06.020
status: public
title: Postsynaptic mad signaling at the Drosophila neuromuscular junction
type: journal_article
user_id: 8b945eb4-e2f2-11eb-945a-df72226e66a9
volume: 16
year: '2006'
...
---
_id: '1745'
abstract:
- lang: eng
text: SiGe islands grown by deposition of 10 monolayers of Ge on Si(0 0 1) at 740
°C were investigated by using a combination of selective wet chemical etching
and atomic force microscopy. The used etchant, a solution consisting of ammonium
hydroxide and hydrogen peroxide, shows a high selectivity of Ge over SixGe1-x
and is characterized by relatively slow etching rates for Si-rich alloys. By performing
successive etching experiments on the same sample area, we are able to gain a
deeper insight into the lateral displacement the islands undergo during post growth
annealing.
author:
- first_name: Georgios
full_name: Georgios Katsaros
id: 38DB5788-F248-11E8-B48F-1D18A9856A87
last_name: Katsaros
- first_name: Armando
full_name: Rastelli, Armando
last_name: Rastelli
- first_name: Mathieu
full_name: Stoffel, Mathieu
last_name: Stoffel
- first_name: Giovanni
full_name: Isella, Giovanni
last_name: Isella
- first_name: Hans
full_name: Von Känel, Hans
last_name: Von Känel
- first_name: Alexander
full_name: Bittner, Alexander M
last_name: Bittner
- first_name: Jerry
full_name: Tersoff, Jerry
last_name: Tersoff
- first_name: Ulrich
full_name: Denker, Ulrich
last_name: Denker
- first_name: Oliver
full_name: Schmidt, Oliver G
last_name: Schmidt
- first_name: Giovanni
full_name: Costantini, Giovanni
last_name: Costantini
- first_name: Klaus
full_name: Kern, Klaus
last_name: Kern
citation:
ama: Katsaros G, Rastelli A, Stoffel M, et al. Investigating the lateral motion
of SiGe islands by selective chemical etching. Surface Science. 2006;600(12):2608-2613.
doi:10.1016/j.susc.2006.04.027
apa: Katsaros, G., Rastelli, A., Stoffel, M., Isella, G., Von Känel, H., Bittner,
A., … Kern, K. (2006). Investigating the lateral motion of SiGe islands by selective
chemical etching. Surface Science. Elsevier. https://doi.org/10.1016/j.susc.2006.04.027
chicago: Katsaros, Georgios, Armando Rastelli, Mathieu Stoffel, Giovanni Isella,
Hans Von Känel, Alexander Bittner, Jerry Tersoff, et al. “Investigating the Lateral
Motion of SiGe Islands by Selective Chemical Etching.” Surface Science.
Elsevier, 2006. https://doi.org/10.1016/j.susc.2006.04.027.
ieee: G. Katsaros et al., “Investigating the lateral motion of SiGe islands
by selective chemical etching,” Surface Science, vol. 600, no. 12. Elsevier,
pp. 2608–2613, 2006.
ista: Katsaros G, Rastelli A, Stoffel M, Isella G, Von Känel H, Bittner A, Tersoff
J, Denker U, Schmidt O, Costantini G, Kern K. 2006. Investigating the lateral
motion of SiGe islands by selective chemical etching. Surface Science. 600(12),
2608–2613.
mla: Katsaros, Georgios, et al. “Investigating the Lateral Motion of SiGe Islands
by Selective Chemical Etching.” Surface Science, vol. 600, no. 12, Elsevier,
2006, pp. 2608–13, doi:10.1016/j.susc.2006.04.027.
short: G. Katsaros, A. Rastelli, M. Stoffel, G. Isella, H. Von Känel, A. Bittner,
J. Tersoff, U. Denker, O. Schmidt, G. Costantini, K. Kern, Surface Science 600
(2006) 2608–2613.
date_created: 2018-12-11T11:53:47Z
date_published: 2006-06-15T00:00:00Z
date_updated: 2021-01-12T06:52:56Z
day: '15'
doi: 10.1016/j.susc.2006.04.027
extern: 1
intvolume: ' 600'
issue: '12'
month: '06'
page: 2608 - 2613
publication: Surface Science
publication_status: published
publisher: Elsevier
publist_id: '5379'
quality_controlled: 0
status: public
title: Investigating the lateral motion of SiGe islands by selective chemical etching
type: journal_article
volume: 600
year: '2006'
...